New Lip-Sync Technology Uses Audio Clips to Generate Realistic Video

Advertisement
By Press Trust of India | Updated: 12 July 2017 18:39 IST

Scientists have developed new computer algorithms that can turn audio clips into a realistic, lip-synced video of the person speaking those words.

The researchers successfully generated highly-realistic video of former US President Barack Obama talking about terrorism, fatherhood, job creation and other topics, using audio clips of those speeches and existing weekly video addresses that were originally on a different topic.

"These type of results have never been shown before," said Ira Kemelmacher-Shlizerman, an assistant professor at the University of Washington (UW) in the US.

Advertisement

"Realistic audio-to-video conversion has practical applications like improving video conferencing for meetings, as well as futuristic ones such as being able to hold a conversation with a historical figure in virtual reality by creating visuals just from audio," said Kemelmacher-Shlizerman.

Advertisement

In a visual form of lip-syncing, the system converts audio files of an individual's speech into realistic mouth shapes, which are then grafted onto and blended with the head of that person from another existing video.

Advertisement

The team chose Obama because the machine learning technique needs available video of the person to learn from, and there were hours of presidential videos in the public domain.

"In the future video, chat tools like Skype or Messenger will enable anyone to collect videos that could be used to train computer models," Kemelmacher-Shlizerman said.

Advertisement

Because streaming audio over the Internet takes up far less bandwidth than video, the new system has the potential to end video chats that are constantly timing out from poor connections.

"When you watch Skype or Google Hangouts, often the connection is stuttery and low-resolution and really unpleasant, but often the audio is pretty good," said Steve Seitz, professor at UW.

"So if you could use the audio to produce much higher-quality video, that would be terrific," he said.

By reversing the process - feeding video into the network instead of just audio - the team could also potentially develop algorithms that could detect whether a video is real or manufactured, researchers said.

The new machine learning tool makes significant progress in overcoming what is known as the "uncanny valley" problem, which has dogged efforts to create realistic video from audio.

When synthesised human likenesses appear to be almost real - but still manage to somehow miss the mark - people find them creepy or off-putting.

"People are particularly sensitive to any areas of your mouth that don't look realistic," said Supasorn Suwajanakorn, a doctoral graduate at UW's Allen School of Computer Science & Engineering.

"If you do not render teeth right or the chin moves at the wrong time, people can spot it right away and it is going to look fake. So you have to render the mouth region perfectly to get beyond the uncanny valley," Suwajanakorn said.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: Science, Barack Obama, Apps
Advertisement

Related Stories

Popular Mobile Brands
  1. Motorola Edge 70 Launched With Snapdragon 7 Gen 4 SoC, Slim 5.99mm Profile
  2. Lava Agni 4 Price Range, Features Leaked; Will Launch in These Colourways
  3. Moto G67 Power 5G Launched in India With 7,000mAh Battery: See Price
  4. Samsung Galaxy S26 Ultra Spotted in Leaked Renders With Rounder Corners
  5. Moto G Play (2026), Moto G (2026) With Dimensity 6300 SoC Launched
  6. Southern Taurid Meteor Shower 2025 Promises Bright Fireballs in a Rare Swarm Year
  7. Apple's Low-Cost MacBook Launch Timeline, Price Leaked Ahead of Debut
  8. OnePlus Ace 6 Pro Max Configurations Leaked; May Feature Up to 16GB of RAM
  9. WhatsApp's Apple Watch App Is Finally Out: Check Features, Compatibility
  10. How Hot Was the Universe 7 Billion Years Ago? Scientists Now Have an Answer
  1. Motorola Edge 70 Launched With Snapdragon 7 Gen 4 Chipset, Slim 5.99mm Profile: Price, Specifications
  2. Researchers Unveil How Atomic Entanglement Enhances Light Bursts
  3. Lava Agni 4 Confirmed to Launch in Two Colourways; Tipster Leaks Price Range, Key Features
  4. Google Proposes Play Store Reforms in Settlement With Fortnite Maker Epic Games
  5. Scientists Recreate Cosmic ‘Fireballs’ in Lab to Solve Mystery of Missing Gamma Rays
  6. Realme UI 7.0 Launched With Light Glass Design, AI Notify Brief and AI Gaming Coach: See Eligible Phones, Beta Release Schedule
  7. iOS 26.2 Beta 1 Rolled Out to Developers With Enhanced Safety Alerts, Reminder Alarms
  8. Samsung Galaxy S26 Ultra Spotted in Leaked Design Renders That Hint at Rounder Corners
  9. Call of Duty: Black Ops 7 PC Specifications, Preloading Times Revealed; Activision Confirms Handheld Support
  10. Silicon Carbide-Based Motor Drive Enables a Smaller, Lighter Electric Aircraft Engine
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.