Google's DeepMind AI Said to Outperform Professional Lip-Readers

Advertisement
By Shekhar Thakran | Updated: 24 November 2016 18:20 IST
Highlights
  • AI system developed by DeepMind and University of Oxford
  • Trained using videos with runtime over 5000 hours
  • Google Translate can now translate languages that it hasn't seen before

Artificial technology has taken significant leaps in recent times with companies such as Microsoft even making bold claims that its speech recognition system is as good as humans. Now, the latest achievements of Google's AI technology might help in significantly improving the lives of hearing-impaired people with a better lip-reading system, as well as improving the quality of translations using Multilingual Neural Machine Translation.

An AI system developed in collaboration with DeepMind, owned by Google's parent company Alphabet, and University of Oxford was trained by using news videos with runtime of over 5,000 hours, and over 118,000 sentences, according to a report by New Scientist.

Advertisement

The videos were taken from BBC shows that aired between 2010 and 2015. After training, the system was set to lip read programmes broadcast between March and September of this year. According to the report, the system can decipher entire phrases like "We know there will be hundreds of journalists here as well," by just looking at the speaker's lips.

While a professional lip-reader was able to register a success rate of just 12.4 percent in trying to decipher 200 randomly selected clips from the above data set, the AI system successfully recognised 46.8 percent of all words from the data set.

Advertisement

Google further announced that the company's translation tool - Google Translate - which recently received the support of Neural Machine Translation to recognise entire phrases instead of single words, will now make use of Multilingual Neural Machine Translation. The company says that Multilingual Neural Machine Translation improves upon the previous system and uses it as a base.

"Our proposed architecture requires no change in the base GNMT system, but instead uses an additional "token" at the beginning of the input sentence to specify the required target language to translate to. In addition to improving translation quality, our method also enables "Zero-Shot Translation" - translation between language pairs never seen explicitly by the system," the search giant said in a blog post.

Advertisement

This effectively means that Google Translate will now use one system for all translations instead of using individual systems for each language pair translation. It further allows the system to do translation between language pairs that it has never seen before.

Even though we will have to wait for the actual impact of these achievements by Google's AI technology, it has to be said that the technology's feat is impressive at the very least.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Samsung Galaxy A57 5G, Galaxy A37 5G Announced: What You Need to Know
  2. Vivo T5 Pro Tipped to Offer Battery and Display Upgrades Over Vivo T4 Pro
  3. Vivo X300 Ultra, X300s, and Vivo Pad 6 Pro Colours Teased Ahead of Launch
  4. Google Might Soon Let You Add Your 3D Avatars to Gemini
  5. Realme 16 5G Key Specifications, Features Revealed Ahead of India Launch
  6. Samsung Unveils Exynos 1680 SoC With 200-Megapixel Camera, 144Hz Display Supported
  7. Vivo Y21 5G, Vivo Y11 5G Arrive With 6,500mAh Batteries: See Price in India
  8. iQOO Neo 11 Pro, Neo 11 Pro+ Tipped With 2K Screen and 8,000mAh+ Battery
  9. Xiaomi 17T Pro Clears Key Regulatory Hurdle in Thailand; Might Launch Soon
  1. Gemini for Google TV Upgraded With Live Sports Scorecards and Interactive Educational Visuals
  2. Samsung Galaxy A57 5G, Galaxy A37 5G With Triple Rear Cameras, 5,000mAh Batteries Announced: Price, Specifications
  3. Court Drops Fraud Case Against CoinDCX Founders, Says No Evidence Found
  4. Google Is Reportedly Working on Adding 3D Avatars to Gemini
  5. Xiaomi 17T Pro Listing on Thailand's NBTC Certification Site Hints at Imminent Global Launch
  6. CFTC Launches Innovation Task Force to Regulate Crypto and AI Markets
  7. Samsung Unveils 4nm Exynos 1680 Chipset With 200-Megapixel Camera, 144Hz Display Support
  8. Vivo X300 Ultra, X300s and Vivo Pad 6 Pro Colour Options Revealed Ahead of China Launch
  9. The Hannah Montana 20th Anniversary Special Now on JioHotstar: Know Everything About The Show
  10. Razer Viper V4 Pro Wireless Gaming Mouse Launched Alongside Razer Gigantus V2 Pro Mouse Mat: Price, Features
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.