PDF Text Extraction (2nd Wed Talk)

Hosted by Python Frederick
On Wed., Aug. 14, 2024 from 7pm to 9pm
At 122 E Patrick St, Frederick, MD 21701

\*Original Meeting date was April 10 but moved due to scheduling with the speaker.\*

Is your data locked up in portable document format (PDFs)? In this talk we're going to explore methods to extract text and other data from PDFs using readily-available, open-source Python tools (such as pypdf), as well as techniques such as OCR (optical character recognition) and table extraction. We will also discuss the philosophy of text extraction as a whole.

Also, we're going to have another "after hour." After the presentation is over, anyone who wants to stay and do a bit of coding or chatting is welcome to hang out. Think of it like a mini open workshop.

Visit the event scheduling website for all other details and how to RSVP.