Many PDFs don't have text so you can't reliably scrape information. So right at the beginning you are starting with a bad data source.
Yes you could attempt to OCR but unless the PDF has copy and paste text in it, the answer is no.
Besides if it did have selectable text, you would select all, copy and paste it to a text file and now it's the old programming class usual code you wrote in your first programming classes.
I'm running into more and more folk that want an app, but forget that we can code such up.
Does anyone know of a good resume parsing software that will parse specific information ( namely contact information ) from a pdf file onto an excel spreadsheet ?
I have a single pdf containing hundreds of resumes in which I need to parse the contact information onto an excel spreadsheet.
Any insight would be helpful, thank you !