Scanned documents found...by Google

Google can now index scanned documents.

If you've ever had trouble finding scanned documents on Google, it's probably because it was not indexing them. On Thursday, this all changed. Google has announced that it is now indexing scanned documents.

Google is now able to perform optical character recognition (OCR) on any scanned document it finds stored in the PDF format. OCR technology is able to "read" a scanned document and covert it into words that can be searched and indexed.

OCR technology has always impressed me, I mean deciphering between a "0" and "O" is hard enough for a human, but for a computer? Now to apply it to all scanned PDF images on the Internet? Very impressive.

Here are a couple of examples:

Spin lock performance

Repairing aluminum wiring

About the author

Eric Franklin leads the CNET Reviews editors in San Francisco as managing editor. A 20-year industry veteran, Eric began his tech journey testing computers in the CNET Labs. When not at work he can usually be found at the gym, at the movies, or at the edge of his couch with a game controller in his hands.

 

Discuss Scanned documents found...by Google

Conversation powered by Livefyre

Show Comments Hide Comments