MIT Develops AI-Based System That Adds Sound to Silent Videos

Advertisement
By Matt McFarland, The Washington Post | Updated: 20 June 2016 11:42 IST
MIT researchers have developed a computer system that independently adds realistic sounds to silent videos. Although the technology is nascent, it's a step toward automating sound effects for movies.

In a series of videos of drumsticks striking things - including sidewalks, grass and metal surfaces - the computer learned to pair a fitting sound effect, such as the sound of a drumstick hitting a piece of wood or rustling leaves.

The findings are an example of the power of deep learning, a type of artificial intelligence whose application is trendy in tech circles. With deep learning, a computer system learns to recognize patterns in huge piles of data and applies what it learns in useful ways.

Advertisement

In this case, the researchers at MIT's Computer Science and Artificial Intelligence Lab recorded about 1,000 videos of a drumstick scraping and hitting real-world objects. These videos were fed to the computer system, which learns what sounds are associated with various actions and surfaces. The sound of the drumstick hitting a piece of wood is different than when it disrupts a pile of leaves.

Once the computer system had all these examples, the researchers gave it silent videos of the same drumstick hitting other surfaces, and they instructed the computer system to pair an appropriate sound with the video.

Advertisement

To do this, the computer selects a pitch and loudness that fits what it sees in the video, and it finds an appropriate sound clip in its database to play with the video.

To demonstrate their accomplishment, the researcher then played half-second video clips for test subjects, who struggled to tell apart whether the clips included an authentic sound or one that a computer system had added artificially.

Advertisement

But the technology is not perfect, as MIT PhD candidate Andrew Owens, the lead author on the research, acknowledged. When the team tried longer video clips, the computer system would sometimes misfire and play a sound when the drumstick was not striking anything. Test subjects immediately knew the audio was not real.

Advertisement

And the researchers were able to get the computer to produce fitting sounds only when they used videos with a drumstick. Creating a computer that automatically provides the best sound effect for any video - the kind of development that could disrupt the sound-effects industry - remains out of reach for now.

Although the technology world has seen significant strides of late in artificial intelligence, there are still big differences in how humans and machines learn. Owens wants to push computer systems to learn more similarly to the way an infant learns about the world: by physically poking and prodding its environment. He sees potential for other researchers to use sound recordings and interactions with materials such as sidewalk cement as a step toward machines' better understanding our physical world.

© 2016 The Washington Post

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. YouTube's 'Ask YouTube' AI Chatbot Offers Smart Replies With Videos, Shorts
  2. Anthropic's New Connectors Will Make Claude More Creative
  3. Stranger Things: Tales from '85 Now Available for Streaming on Netflix
  1. AirDrop via Quick Share Reportedly Expands to Oppo Find X9 Ultra, Vivo X300 Ultra
  2. OpenAI, Amazon Announce Multi-Year Strategic Partnership as Microsoft’s Exclusive Deal Ends
  3. US Judge Rejects Former FTX CEO Sam Bankman-Fried’s Bid for New Trial
  4. Valve Says It's 'Hard at Work' on Steam Deck 2
  5. OnePlus Nord CE 6, Nord CE 6 Lite Availability Details Announced Ahead of May 7 Launch Date
  6. Smartphone Buyers in India Prioritise AI and Real-World Usage, Flipkart Report Shows
  7. Google Pixel 11 Series’ Tensor G6 Chipset Could Be Significantly Faster Than Last Year’s Tensor G5 SoC, Leak Suggests
  8. Oppo Reno 16 Pro Key Specifications Leaked; Tipped to Launch in H2 2026
  9. Samsung Galaxy S27 Tipped to Arrive With Redesigned Camera Layout to Accomodate Qi2 Magnetic Charging
  10. Anthropic’s Claude Can Now Complete Creative Tasks in Adobe, Blender and Autodesk
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.