The Stack That Seemed Like a Good Idea Every developer who processes content has built some version of the same stack. Puppeteer for PDF rendering and screenshots. Sharp or ImageMagick for image transformation.

Tesseract for OCR. Maybe wkhtmltopdf or LibreOffice thrown in for good measure. Each tool solves a real problem. Each tool works in isolation. And each tool becomes a maintenance liabilit