Apply OCR to Previously Saved or Scanned Documents;
Easily Convert PDF Files into Editable Formats
The articles in this blog are inspired by real life issues. This weekend, I received a call from a less-skilled user. All of his electronic files had "disappeared" from his PC, and he was busy trying to recreate them from hard copies. He had already scanned a bunch before he discovered the scanned files were not editable. Rather, he had a bunch of files that had basically been scanned into giant graphics. Was there anything he could do?
We discussed that, in the future, he needed to look for a "Scan to OCR" option in his scanning software. Most scanners include a Scan to OCR plug-in. (We also had a long conversation about disappearing files, and what that might mean, as well as setting up cloud backup through Backblaze to prevent these issues in the future.) But believe it or not, all was NOT lost. It is NOT too late to convert those scans. There is an easy way to apply OCR to saved files, using Google Drive and Google Docs.
What is OCR
OCR stands for Optical Character Recognition. OCR takes electronic information stored in an image format, (like GIF, JPG, PDF, or PNG,) and converts it into characters your programs can understand. When you copy or scan something, basically a device is taking a picture of whatever you lay on its bed. Electronically, it saves a series of dots, rather than a series of letters and numbers. If you wish to edit a scanned document, first you MUST convert it back from a bunch of dots to characters software can recognize and understand. This process is called OCR, or Optical Character Recognition.
Most popular scanners come with OCR software. If you are scanning a document, and you will want to edit it later, you should look for the “scan to OCR” option in your scanning software. (It may also be called "Scan to Doc.")
If you have a document that has already been scanned to an image-based format, you will need to convert it using OCR BEFORE you can edit it.. This is also true for files you have stored on drives and other media.
OCR Using Google Docs
Google Drive contains a free and easy way to convert image-based files to editable text. This is especially handy for PDF files, as well as files received as e-mail attachments. Google Docs has hidden OCR capabilities. You can open a file stored in Google Drive with Google Docs, and run it through an OCR plug-in while you're doing so. The steps are outlined below.
