Search

Alibaba Releases Open-Source Wan 2.1 Suite of AI Video Generation Models, Claimed to Outperform OpenAI’s Sora

Alibaba’s Wan 2.1 T2V-1.3B video model can generate a 5-second 480p video using the Nvidia RTX 4090 in four minutes.

Advertisement
Highlights
  • Alibaba’s Wan 2.1 supports Chinese and English text prompts
  • It can generate videos using both text and image inputs
  • The team used a new 3D causal VAE architecture for the models
Alibaba Releases Open-Source Wan 2.1 Suite of AI Video Generation Models, Claimed to Outperform OpenAI’s Sora

The open-source Wan 2.1 video models are available with the Apache 2.0 license

Photo Credit: Reuters

Alibaba released a suite of artificial intelligence (AI) video generation models on Wednesday. Dubbed Wan 2.1, these are open-source models that can be used for both academic and commercial purposes. The Chinese e-commerce giant released the models in several parameter-based variants. Developed by the company's Wan team, these models were first introduced in January and the company claimed that Wan 2.1 can generate highly realistic videos. Currently, these models are being hosted on the AI and machine learning (ML) hub Hugging Face.

Alibaba Introduces Wan 2.1 Video Generation Models

The new Alibaba video AI models are hosted on Alibaba's Wan team's Hugging Face page. The model pages also detail the Wan 2.1 suite of large language models (LLMs). There are four models in total — T2V-1.3B, T2V-14B, I2V-14B-720P, and I2V-14B-480P. The T2V is short for text-to-video while the I2V stands for image-to-video.

The researchers claim that the smallest variant, Wan 2.1 T2V-1.3B, can be run on a consumer-grade GPU with as little as 8.19GB vRAM. As per the post, the AI model can generate a five-second-long video with 480p resolution using an Nvidia RTX 4090 in about four minutes.

While the Wan 2.1 suite is aimed at video generation, they can also perform other functions such as image generation, video-to-audio generation, and video editing. However, the currently open-sourced models are not capable of these advanced tasks. For video generation, it accepts text prompts in Chinese and English languages as well as image inputs.

Coming to the architecture, the researchers revealed that the Wan 2.1 models are designed using a diffusion transformer architecture. However, the company innovated the base architecture with new variational autoencoders (VAE), training strategies, and more.

Most notably, the AI models use a new 3D causal VAE architecture dubbed Wan-VAE. It improves spatiotemporal compression and reduces memory usage. The autoencoder can encode and decode unlimited-length 1080p resolution videos without losing historical temporal information. This enables consistent video generation.

Based on internal testing, the company claimed that the Wan 2.1 models outperform OpenAI's Sora AI model in consistency, scene generation quality, single object accuracy, and spatial positioning.

These models are available under the Apache 2.0 licence. While it does allow for unrestricted usage for academic and research purposes, commercial usage comes with multiple restrictions.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

 
Show Full Article
Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. Moto G96 5G Launched in India With 50-Megapixel Sony Lytia 700C Camera
  2. OnePlus Pad Lite With 11-Inch Display, 9,340mAh Battery Launched
  3. Realme 15 Pro 5G to Launch in India With Snapdragon 7 Gen 4 Chipset
  4. Oppo Reno 14 Gets a New Variant With a Colour Changing Rear Panel
  5. Samsung Galaxy Buds 3 Pro's Amazon Prime Day 2025 Offer Revealed
  6. OnePlus Nord CE 5 Review
  7. Samsung Galaxy Unpacked 2025 LIVE: Samsung Galaxy Z Fold 7, Flip 7 Expected
  8. WhatsApp's AI-Powered Chat Wallpaper Feature Is Coming to iOS
  9. Microsoft July 2025 Security Update Fixes One Zero-Day, 136 Other Flaws
  10. Google Pixel Phones Receiving Monthly Software Update for July 2025
  1. Dreame F10 Robot Vacuum Cleaner Launched in India With 300 Minutes of Run Time: Price, Specifications
  2. Apple Appoints Indian-Origin Sabih Khan as New COO; Jeff Williams Shifts Focus to Apple Watch
  3. Meta Invests $3.5 Billion in Ray-Ban Maker EssilorLuxottica in AI Glasses Push
  4. The Last of Us Part 2 Remastered Gets New Free Update That Allows Players to Experience Story Chronologically
  5. Gemini AI Upgraded to Support Google Home’s Broadcast Messages Feature
  6. Samsung W26 Foldable Phone Allegedly Spotted on China's 3C Site; Charging Speed Tipped
  7. Redmi 15C Leaked Renders Show Design, Colour Options; Reportedly Spotted on NBTC Site Alongside Poco C85
  8. Axiom 4 Mission Crew Settles Down at ISS, Begins Conducting Biomedical Research
  9. Microsoft Fixes One Zero-Day Vulnerability, 136 Other Flaws With July 2025 Windows Security Update
  10. Tata Motors Brings Dolby Atmos to Harrier.ev Powered by Harman JBL Black Audio System
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »