OpenAI Unveils Sora, an AI-Powered Text-to-Video Generator Capable of Creating One-Minute-Long Clips

OpenAI said Sora can create multiple shots within a single generated video.

Advertisement
Written by Akash Dutta, Edited by Manas Mitul | Updated: 16 February 2024 12:12 IST
Highlights
  • Sora can also generate multiple characters with specific motion
  • Currently, Sora is available to red teamers to assess harms or risks
  • The AI video generator is based on a diffusion model

Sora uses a transformer architecture similar to GPT models

Photo Credit: X/Sam Altman

OpenAI, the company behind ChatGPT, introduced its first artificial intelligence (AI)-powered text-to-video generation model Sora on Thursday. The company claims it can generate up to 60-second-long videos. This is longer than any of its competitors in the segment, including Google's Lumiere, which was unveiled last month. Sora is currently available to red teamers, cybersecurity experts who extensively test software to help companies improve their software, and some content creators. The AI firm also plans to include Coalition for Content Provenance and Authenticity (C2PA) metadata in the future once the model is deployed in an OpenAI product.

Announcing the AI video generator in a post on X (formerly known as Twitter), the company said, “Sora can create videos of up to 60 seconds featuring highly detailed scenes, complex camera motion, and multiple characters with vibrant emotions.” Interestingly, the length of the video it claims to generate is more than ten times of what its rivals offer. Google's Lumiere can generate 5-second-long videos, whereas Runway AI and Pika 1.0 can generate 4-second and 3-second-long videos, respectively.

Advertisement

The X account of OpenAI and CEO Sam Altman also shared multiple videos generated by Sora, along with the prompts used to create them. The resulting videos appear highly detailed with seamless motion, something other video generators in the market have somewhat struggled with. As per the company, it can generate complex scenes with multiple characters, multiple camera angles, specific types of motion, and accurate details of the subject and background. This is possible because the text-to-video model uses both the prompt as well as “how those things exist in the physical world.”

Sora is essentially a diffusion model which uses a transformer architecture similar to GPT models. Similarly, the data it consumes and generates is represented in a term called patches, which is again akin to tokens in text-generating models. Patches are collections of videos and images, bundled in small portions, as per the company. Using this visual data enabled OpenAI to train the video generation model in different durations, resolutions and aspect ratios. In addition to text-to-video generation, Sora can also take a still image and generate a video from it.

Advertisement

However, it is not without flaws either. OpenAI stated on its website, “The current model has weaknesses. It may struggle with accurately simulating the physics of a complex scene, and may not understand specific instances of cause and effect. For example, a person might take a bite out of a cookie, but afterwards, the cookie may not have a bite mark.”

To ensure the AI tool is not used for creating deepfakes or other harmful content, the company is building tools to help detect misleading content. It also plans to use C2PA metadata in the generated videos, after adopting the practice for its DALL-E 3 model recently. It is also working with red teamers, especially domain experts in areas of misinformation, hateful content, and bias, to improve the model.

Advertisement

At present, it is only available to the red teamers and a small number of visual artists, designers, and filmmakers to gain feedback about the product.


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. OnePlus Buds Ace 3 Launched With Up to 54 Hours of Total Battery Life
  2. This Realme 16 Series Phone Could Launch in India Soon
  3. Top Budget Smartwatches with AMOLED Display Under Rs 3,000
  4. OpenAI Falls Short of Revenue and User Targets as It Races Toward IPO
  5. Steam Controller Will Launch on May 4: Check Price, Features
  6. Lenovo Idea Tab Pro Gen 2 Launched in India With 10,200mAh Battery
  7. Remake of First Assassin's Creed Game Said to Be in the Works at Ubisoft
  1. Apple Announces Monthly Payment Option for Annual Subscriptions on App Store
  2. Biker OTT Release Date Revealed: Know Everything About Plot, Cast, and More
  3. OpenAI Falls Short of Revenue and User Targets as It Races Toward IPO, WSJ Reports
  4. YouTube Tests 'Ask YouTube' AI Chatbot That Offers Smart Responses With Videos, Shorts
  5. Realme 16x 5G India Launch Seems Imminent as Storage Options, Colourways Surface Online
  6. Motorola Razr+ 2026 Leaked Renders Show Bigger Cover Screen, Design Changes
  7. Apple Reportedly Developing New AI-Powered Photo Editing Tools for iPhone, iPad, and Mac
  8. James Webb Space Telescope Reveals Cosmic Buckyballs in Distant Nebula
  9. OnePlus Buds Ace 3 Launched With Up to 55dB ANC, Up to 54 Hours of Total Battery Life: Price, Features
  10. Remake of First Assassin's Creed Game Said to Be in the Works at Ubisoft
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.