YouTube's New Automatic Caption Tool Can Now Describe Sound Effects

Advertisement
By Ketan Pratap | Updated: 27 March 2017 16:32 IST
Highlights
  • YouTube to soon automatically detect sound effects
  • The company is using machine learning
  • Initially, sound effects like applause, music, laughter will show up
YouTube's New Automatic Caption Tool Can Now Describe Sound Effects

YouTube introduced captions in its videos back in 2007, and made automated captions for speech available a few years later. The company will soon also start describing sound effects in videos through machine learning. YouTube has developed a sound effect captioning system for its video platform collaborating with Sound Understanding and Accessibility teams. The automatic sound effect captioning system will identify and label sounds in the video without manual input.

With machine learning, YouTube will be able to automatically detect the existence of sound effects in a video and transcribe it to appropriate classes or sound labels. YouTube will soon start showing sound effects like [APPLAUSE], [MUSIC], and [LAUGHTER]. The company explains that "these were among the most frequent manually captioned sounds, and they can add meaningful context for viewers who are deaf and hard of hearing."

YouTube stresses that the new changes will help the 360 million people around the world who have problems in hearing. The company has so far made several changes to cater to these users, and claims that the number of videos with automatic captions now exceeds 1 billion while adding that people watch videos with automatic captions more than 15 million times per day.

"We started this project by taking on a wide variety of challenges, such as how to best design the sound effect recognition system and what sounds to prioritise. At the heart of the work was utilising thousands of hours of videos to train a deep neural network model to achieve high quality recognition results," said Noah Wang, Software Engineer in a blog post.

Advertisement

The company adds that its new captioning tech is still in the early stages of recognising sound effects automatically. YouTube lists some more challenges that will make video watching experience even better for the targeted users. "Future challenges might include adding other common sound classes like ringing, barking and knocking, which present particular problems -- for example, with ringing we need to be able to decipher if this is an alarm clock, a door or a phone."

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Further reading: YouTube, Google
Advertisement

Related Stories

Popular Mobile Brands
  1. BSNL Announces Flash Sale in India With Free Data, Discounts
  2. Nothing Phone 3 Renders Leaked Ahead of July 1 Launch
  3. Samsung Galaxy M36 5G Launched in India: Price, Specifications
  4. Vivo X200 FE India Launch Teased; Key Specifications Revealed
  5. OTT Releases of the Week: Squid Game S3, Raid 2, Panchayat S4, and More
  6. Samsung Galaxy M36 5G Launching Today: All You Need to Know
  7. Xiaomi's Pad 7S Pro With Xring O1 Processor Launched: All Details
  8. Oppo K13x 5G to Be Available for Purchase in India Starting Today
  9. Redmi K Pad With 8.8-Inch Display, 7,500mAh Battery Unveiled: See Details
  1. James Webb Telescope Detects Methanol and Ethanol Near Young Stars, Hinting at Life’s Origins
  2. Rubin Observatory Captures Distant Nebulae From Chilean Mountaintop
  3. Apple to Expand Swift Language Support to Android; Sets Up Android Working Group
  4. FBC: Firebreak Has Crossed One Million Players, Remedy Confirms
  5. Two Spacecraft Recreate Artificial Solar Eclipses to Observe the Sun’s Superhot Corona
  6. Honor Magic V5's Periscope Telephoto Camera Teased Ahead of July 2 Launch
  7. Breakthrough Laser Tech Enhances LiDAR Accuracy and Gas Detection
  8. Canva Launches Deep Research Connector with ChatGPT, Introduces New Open MCP Server
  9. Samsung Galaxy S26 Series Said to Offer More RAM; iPhone 17 Lineup May Get 12GB RAM
  10. Meta Reportedly Planning to Acquire Startup PlayAI and Some of Its Employees
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.