ByteDance Develops OmniHuman, an AI Framework That Can Generate Realistic Videos of Humans

OmniHuman can generate realistic videos from a single human image and motion signals such as audio or video.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 7 February 2025 14:31 IST
Highlights
  • OmniHuman can generate full-body videos
  • The AI system was trained on 18,700 hours of human video data
  • It is a research work and the model is not available in the public domain

OmniHuman can match lip movement and gestures with speech or music

Photo Credit: Unsplash/Markus Winkler

ByteDance, the company behind TikTok, recently shared its research on a new artificial intelligence (AI) framework. Dubbed OmniHuman, it is a video-generation framework that can create realistic human videos with full-body movement and lip-syncing. The researchers stated that it requires a human image along with motion signals such as video or audio to generate output. Several demonstration videos generated using the AI model have also been shared, showcasing the realism of the final output. Notably, the company stated that the AI model is available in the public domain.

OmniHuman Can Generate Realistic Human Videos

The researchers shared several demonstrations and detailed the framework on its website. It is an end-to-end system that was built using a novel multimodality motion conditioning mixed training strategy, the post claimed. While the researchers did not share any benchmark metrics, they claimed that the AI model “significantly outperforms existing methods.”

Advertisement

OmniHuman can generate videos using an image of the person and a motion signal. Motion signals can be audio only, video only or a combination of audio and video. The AI model can generate realistic videos based on text prompts. These videos can be full-body where the limbs, facial expressions, and lip movement can be synced with the audio or music playing in the background. OmniHuman can generate videos in different aspect ratios, allowing flexibility to users.

OmniHuman output example
Photo Credit: OmniHuman

 

The use of motion signals is a novel technique, which the company is calling omni-conditions training. With this, the AI model is trained on different modalities, including text, image, audio, and video. Researchers said this allowed the model to learn mixed conditioning which overcame the scarcity of high-quality data.

Advertisement

Notably, the model was trained on 18,700 hours of human video data. The details about the training process have been documented in a paper published in the online pre-print journal arXiv.

The company also shared several demonstrations of videos generated using the model, and the results appear to be highly realistic with natural body movements, hand gestures, and lip movements. Such realism has also raised concerns about deepfakes. However, the company has specified that the AI model is currently not available to be downloaded, and there is no service people can use to access its capabilities.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement
Popular Mobile Brands
  1. Google's Personal Intelligence Is Now Rolling Out to More Users
  2. Vivo V70 FE Could Launch in India Next Month at This Price
  3. Xiaomi 17 Series Goes on Sale in India: See Price, Offers
  4. Vivo X300 Ultra, Vivo X300s Will Feature This New Colour Technology
  5. Garmin Now Lets You Use WhatsApp on These Smartwatch Models
  6. OpenAI's Faster GPT-5.4 Mini and Nano AI Models Are Here: Details
  7. Powerbeats Pro 2 Nike Edition Launched in India With Apple's H2 Chip, ANC
  8. OnePlus Nord 6 Could Launch in India at This Price
  9. Here's How Much the Samsung Galaxy A57 5G and Galaxy A37 5G Might Cost
  10. Jio Users Can Get Free Incoming SMS Abroad Using Wi-Fi Calling
  1. Google’s Personal Intelligence Is Now Rolling Out to More Users
  2. Dreame L40 Ultra AE Robot Vacuum With 19,000Pa Vormax Suction Launched in India, Dreame D20 Ultra Tags Along
  3. Fourth Floor OTT Release Date: When and Where to Watch it Online?
  4. BTS Return Documentary OTT Release Date: When and Where to Watch it Online?
  5. OnePlus Nord 6 Said to Be More Expensive Than Nord 5 in India; Geekbench Listing Hints at Snapdragon Chip Upgrade
  6. OpenAI Introduces GPT-5.4 Mini, Nano as Faster Models Optimised for Coding and AI Agents
  7. Karuppu OTT Release Reportedly Revealed Online: What You Need to Know
  8. US SEC Defines Crypto Securities, Signals Clarity for US Traders and Institutions
  9. Samsung Galaxy S26 FE, Galaxy M47 5G and Galaxy F70 Pro 5G Reportedly Surface on GSMA Database
  10. Vivo V70 FE Price in India, Launch Timeline Leaked Days After Global Debut: Expected Price, Specifications
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.