Login
From:
programminghistorian.org
(Uncensored)
subscribe
Working with batches of PDF files | Programming Historian
https://programminghistorian.org/en/lessons/working-with-batches-of-pdf-files
links
backlinks
Roast topics
Find topics
Find it!
Learn how to perform OCR and text extraction with free command line tools like Tesseract and Poppler and how to get an overview of large numbers of PDF documents using topic modeling.