Oct 31, 2008

Convert Scanned PDF Documents to Text with Google OCR

There are two types of PDF documents – those created by sending Office files, images, etc. to an Acrobat like PDF printer and those created by scanning physical paper like pages of a book, legal documents, etc.

google-ocr

Google could always index PDF documents created by conversion but now they also recognize text from PDFs that are generated by scanning paper documents using OCR software.

This is a scanned document and this is the html text view of that same document converted by Google.

Since scanned PDFs are nothing but images, don’t be surprised if Google adds a “search by text” function to their Image Search engine similar to OneNote or EverNote. That will surely be huge.

Convert Scanned PDFs to Text

Now if you have bunch of scanned PDF files on your hard drive and no OCR software, here’s what you can do to convert them into recognizable text.

Create a folder in your website (say abc.com/pdf) and upload all the PDF images to that folder. Now create a public web page that links to all the PDF files. Wait for the Google bots to spider your stuff.

Once done, type the query “site

.com/pdf filetype

” to see the PDF documents as HTML.

How to Handle OAuth Permissions in Google Add-ons

Google Apps Script now lets users grant partial permissions to add-ons. Learn how to detect missing OAuth scopes and prompt users to authorize the required permissions.

Amit Agarwal is a web geek, solo entrepreneur and loves making things on the Internet. Google recently awarded him the Google Developer Expert and Google Cloud Champion title for his work on Google Workspace and Google Apps Script.

Awards & Recognition

Google Developer Expert

Google Developer Expert

Google awarded us the Developer Expert title recogizing our work in Workspace

ProductHunt Golden Kitty

ProductHunt Golden Kitty

Our Gmail tool won the Lifehack of the Year award at ProductHunt Golden Kitty Awards

Microsoft MVP Alumni

Microsoft MVP Alumni

Microsoft awarded us the Most Valuable Professional title for 5 years in a row

Google Cloud Champion

Google Cloud Champion

Google awarded us the Champion Innovator award for technical expertise

Google Workspace Add-ons

Powerful tools to supercharge your productivity in Google Workspace

Mail Merge with Attachments

Send personalized bulk emails to your contacts directly from Google Sheets. Automate campaigns with custom templates, attach files, and track email opens in real-time.

Install Tutorials
Document Studio

Automatically generate pixel-perfect documents, invoices, and certificates from Google Sheets and Forms. Create PDFs, Google Docs, and slide presentations using custom templates.

Install Tutorials
Save Emails and Attachments

Automatically backup Gmail messages and attachments to Google Drive. Organize emails by labels, search by sender, and create searchable archives of your important conversations.

Install Tutorials
Google Forms Email Notifications

Send instant email notifications to form respondents and your team when Google Forms are submitted. Customize messages with form answers and add conditional logic.

Install Tutorials
Email Google Spreadsheets

Schedule and send Google Sheets as email attachments on autopilot. Share entire spreadsheets, specific cell ranges, or dynamic charts with your team at regular intervals.

Install Tutorials
Creator Studio for Google Slides

Convert Google Slides presentations into engaging animated GIFs and video files. Perfect for creating social media content, tutorials, and shareable visual stories.

Install Tutorials

Want to stay up to date?
Sign up for our email newsletter.