OpenAI Unveils Sora, an AI-Powered Text-to-Video Generator Capable of Creating One-Minute-Long Clips

OpenAI said Sora can create multiple shots within a single generated video.

Advertisement
Written by Akash Dutta, Edited by Manas Mitul | Updated: 16 February 2024 12:12 IST
Highlights
  • Sora can also generate multiple characters with specific motion
  • Currently, Sora is available to red teamers to assess harms or risks
  • The AI video generator is based on a diffusion model

Sora uses a transformer architecture similar to GPT models

Photo Credit: X/Sam Altman

OpenAI, the company behind ChatGPT, introduced its first artificial intelligence (AI)-powered text-to-video generation model Sora on Thursday. The company claims it can generate up to 60-second-long videos. This is longer than any of its competitors in the segment, including Google's Lumiere, which was unveiled last month. Sora is currently available to red teamers, cybersecurity experts who extensively test software to help companies improve their software, and some content creators. The AI firm also plans to include Coalition for Content Provenance and Authenticity (C2PA) metadata in the future once the model is deployed in an OpenAI product.

Announcing the AI video generator in a post on X (formerly known as Twitter), the company said, “Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions.” Interestingly, the length of the video it claims to generate is more than ten times of what its rivals offer. Google's Lumiere can generate 5-second-long videos, whereas Runway AI and Pika 1.0 can generate 4-second and 3-second-long videos, respectively.

Advertisement

The X account of OpenAI and CEO Sam Altman also shared multiple videos generated by Sora, along with the prompts used to create them. The resulting videos appear highly detailed with seamless motion, something other video generators in the market have somewhat struggled with. As per the company, it can generate complex scenes with multiple characters, multiple camera angles, specific types of motion, and accurate details of the subject and background. This is possible because the text-to-video model uses both the prompt as well as “how those things exist in the physical world.”

Sora is essentially a diffusion model which uses a transformer architecture similar to GPT models. Similarly, the data it consumes and generates is represented in a term called patches, which is again akin to tokens in text-generating models. Patches are collections of videos and images, bundled in small portions, as per the company. Using this visual data enabled OpenAI to train the video generation model in different durations, resolutions and aspect ratios. In addition to text-to-video generation, Sora can also take a still image and generate a video from it.

Advertisement

However, it is not without flaws either. OpenAI stated on its website, “The current model has weaknesses. It may struggle with accurately simulating the physics of a complex scene, and may not understand specific instances of cause and effect. For example, a person might take a bite out of a cookie, but afterwards, the cookie may not have a bite mark.”

To ensure the AI tool is not used for creating deepfakes or other harmful content, the company is building tools to help detect misleading content. It also plans to use C2PA metadata in the generated videos, after adopting the practice for its DALL-E 3 model recently. It is also working with red teamers, especially domain experts in areas of misinformation, hateful content, and bias, to improve the model.

Advertisement

At present, it is only available to the red teamers and a small number of visual artists, designers, and filmmakers to gain feedback about the product.


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Poco X8 Pro Series Roundup: Here's Everything That We Know So Far
  1. Scientists Trace Rare Cosmic Outburst to a Massive Planetary Collision Around Gaia20ehk
  2. That Night Streaming on Netflix: What to Know About Clara Galle and Claudia Salas Starrer
  3. Jazz City OTT Release Date: When and Where to Watch Arifin Shuvoo and Sauraseni Maitra Starrer Online?
  4. Kirtaner Por Kirtan OTT Release: Where to Watch the Sequel to the 2023 Bengali Hit Comedy Online?
  5. Phantom Lawyer Season 1 Streaming on Netflix: What to Know About Yoo Yeon-seok and Esom Starrer
  6. Pizza Movie OTT Release Date: When and Where to Watch Gaten Matarazzo and Sean Giambrone Starrer Online?
  7. Hubble and Euclid Reveal Stunning New View of Cat’s Eye Nebula
  8. Silent Hill 2 Remake Has Surpassed 5 Million Copies Sold, Konami Announces
  9. Samsung Galaxy Z Flip 8 Battery Details Leaked; Might Have Same Capacity as the Galaxy Z Flip 7
  10. HSBC, Standard Chartered Said to Be First Recipients of Stablecoin Licences in Hong Kong
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.