Google Drive's OCR Capabilities Expanded to Over 200 Languages

Advertisement
By Ketan Pratap | Updated: 8 May 2015 16:56 IST
Google Drive has been using OCR (optical character recognition) technology to allow scanned documents uploaded to the cloud storage service to be edited and indexed. Google on Wednesday expanded the OCR capabilities within Google Drive by adding support for over 200 languages.

Google stressed that the reason to expand the OCR capabilities is that most of the information in the world is still stored in physical forms (books, newspapers, and magazines among others) and not digital. For the uninitiated, optical character recognition converts a digital image with text into digital documents using computer algorithms. Images can be processed in (.jpg, .png, and .gif) files or in PDF documents.

Users can start using the OCR capabilities in Drive by uploading scanned document in PDF or image form after which they can right-click on the document in Drive to open with Google Docs. After choosing the option, a document with the original image alongside extracted text opens, which can be edited. Google notes that users will not be required to specify the language of the document as the OCR in Drive will automatically determine it. The OCR capability in Google Drive is also available in Drive for Android.

Advertisement

The company has also listed some limitations of the OCR technology, stating it will work best on cleanly scanned, high-resolution documents. While has claimed that it is still working to improve performance on poor quality scans and challenging text layouts. The company also lists that OCR will take longer than other uploads in Drive.

Google details how it's OCR technology works within Drive, "To make this possible, engineering teams across Google pursued an approach to OCR focused on broad language coverage, with a goal of designing an architecture that could potentially work with all existing languages and writing systems. We do this in part by using Hidden Markov Models (HMMs) to make sense of the input as a whole sequence, rather than first trying to break it apart into pieces. This is similar to how modern speech recognition systems recognise audio input."

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. New OTT Releases This Week: Jolly LLB 3, Kara, Spider-Noir, and More
  2. Vivo S60 With 7,200mAh Battery and 144Hz Display Arrives at This Price
  3. Motorola Edge 70 Pro+ to Launch in India With This MediaTek Chipset
  4. Blue Origin's New Glenn Rocket Destroyed in Fiery Explosion During Ground Test
  1. Faces Out on OTT: Know Where to Stream This Psychological Thriller Film Online
  2. Blue Origin’s New Glenn Rocket Explodes During Pre-Launch Test in Florida
  3. Activision to Shut Down Call of Duty: Warzone on PS4, Xbox One After Modern Warfare 4 Launch
  4. Vivo Over-Ear Noise-Cancelling Headphones Launched With Up to 75 Hours of Battery Life
  5. Motorola Edge 70 Pro+ Key Specifications Revealed Days Ahead of Launch in India on June 4
  6. Vivo TWS 5e Launched in China With 11mm Dynamic Drivers, Hybrid Adaptive ANC, Up to 55 Hours Battery Life
  7. Vivo S60 Launched With 7,200mAh Battery and 144Hz Display, Vivo S60 Vitality Edition Tags Along: Price, Specifications
  8. France's Financial Markets Authority Sets June 20 Deadline for Crypto Firms to Acquire MiCA Licence
  9. Sathi Leelavathi OTT Release: Where to Watch Lavanya Tripathi’s Romantic Drama?
  10. 007 First Light, IO Interactive's James Bond Title, Sells 1.5 Million Copies in Just 24 Hours of Launch
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.