Microsoft Says Its Speech Recognition System Achieves New Accuracy Milestone

Advertisement
By Indo-Asian News Service | Updated: 21 August 2017 16:21 IST

Microsoft's conversational speech recognition system - designed to accurately recognises the words in a conversation like humans do - has reached a 5.1 percent error rate, its lowest so far.

This milestone means that, for the first time, a computer can recognise the words in a conversation as well as a person would.

"Our research team reached that 5.1 percent error rate with our speech recognition system, a new industry milestone, substantially surpassing the accuracy we achieved last year," Microsoft said in a blog post late on Sunday.

Advertisement

Last year in October, the team from Microsoft Artificial Intelligence and Research reported a speech recognition system that makes the same or fewer errors than professional transcriptionists.

The researchers had then reported a word error rate (WER) of 5.9 percent.

"Last year, Microsoft's speech and dialog research group announced a milestone in reaching human parity on the 'Switchboard' conversational speech recognition task, meaning we had created technology that recognised words in a conversation as well as professional human transcribers," said Xuedong Huang, Technical Fellow, Microsoft.

Advertisement

'Switchboard' is a corpus of recorded telephone conversations that the speech research community has used for more than 20 years to benchmark speech recognition systems.

The task involves transcribing conversations between strangers discussing topics such as sports and politics.

Advertisement

The team used "Microsoft Cognitive Toolkit 2.1" (CNTK), the most scalable deep learning software available, for exploring model architectures.

Additionally, Microsoft's investment in cloud compute infrastructure, specifically Azure GPUs, helped improve the effectiveness and speed.

Advertisement

Reaching human parity with an accuracy on par with humans has been a research goal for the last 25 years.

"Microsoft's willingness to invest in long-term research is now paying dividends for our customers in products and services such as Cortana, Presentation Translator, and Microsoft Cognitive Services," the post read.

"Moving from recognising to understanding speech is the next major frontier for speech technology," the post added.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. OTT Releases This Week (Sept 28 - Oct 5): Madharaasi, Junior, Annapoorani, and More
  2. iQOO 15 Will Debut With IP68+IP69 Rating, Faster Fingerprint Scanner
  3. Lava Agni 4 to India Launch Timeline, Design Teased Ahead of Debut
  1. Lava Agni 4 India Launch Timeline, Design Teased: Expected Specifications, Features
  2. iQOO 15 Confirmed to Come With IP68+IP69 Dust and Water Resistance, Faster 3D Ultrasonic Fingerprint Scanner
  3. Google Launches New Smart Home Speaker, Gemini-Powered Nest Cams and Doorbell With AI Capabilities
  4. Engineers Create First Artificial Neurons With Electrical Functions As Living Cells
  5. A Better Metric Might Assess The Habitability of Exoplanets: What You Need to Know
  6. SpaceX Prepares for October 13 Launch of Starship Flight 11, Final Test of Current Variant
  7. Jamnapaar Season 2 OTT Release Revealed: When and Where to Watch the Season 2 Online?
  8. Kurukshetra OTT Release Date Announced: Know When and Where to Watch it Online?
  9. BNB Chain’s X Account Hacked; CZ Warns Users of Phishing Links
  10. People We Meet on Vacations OTT Release Date: Know When and Where to Watch it Online?
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.