Thank you for being a valued part of the CNET community. As of December 1, 2020, the forums are in read-only format. In early 2021, CNET Forums will no longer be available. We are grateful for the participation and advice you have provided to one another over the years.

Thanks,

CNET Support

Question

Best OCR software for locating and converting documents?

Jul 29, 2014 5:28AM PDT

Cnet, I am looking to find the best software which will allow my company to bacth locate and convert documents to OCR on the server. Ideally we could run this daily, in order to capture non-ocr'd pdfs. The initial run would need to scan the entire server and convert. Suggestions?

Discussion is locked

- Collapse -
Answer
Re: locate and ocr
Jul 29, 2014 5:34AM PDT

The best software should be a script you write yourself to scan the disk and convert the files found using your favorite OCR program. The only requirement for that OCR program is that it can run from the command prompt, accepting the filename of the file to convert.
I see no reason to expect that one program can do two such different things as you want.

Kees

- Collapse -
locate and ocr
Jul 29, 2014 5:59AM PDT

Good point. Would I just use the basic command prompt (DOS) in windows to write this script? And do you know what OCR softwares will run from the command prompt?

- Collapse -
Re: script
Jul 29, 2014 5:10PM PDT

1. Search in one directory can be done in basic DOS commands. Search on the whole disk needs VBscript or Powershell, I think.
2. Sorry, no, I don't know the specifics of different OCR programs.

Kees

- Collapse -
Answer
Small world.
Jul 30, 2014 2:39PM PDT

Let me share that one of the power tools that we are using to do such things is called TESSERACT OCR.

Why? Well we needed OCR to be industrial strength, command line and more. It's only a part of the software brew but such an important one that I thought I'd share it. The price is right too.
Bob