Google Lumiere Multimodal AI Video Generation Tool Unveiled; Can Create 5-Second Videos From Text, Images

Google Lumiere supports text-to-video and image-to-video models and has options to create stylised videos.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 29 January 2024 14:53 IST
Highlights
  • Google Lumiere uses the Space-Time-U-Net diffusion model
  • It generates 80 frames for the 5-second-long video
  • Lumiere joins existing AI video models by Runway and Pika

Google Lumiere is currently not available to the public

Photo Credit: Inbar Mosseri/Google

Google unveiled its latest artificial intelligence (AI) model, Lumiere, last week. The new AI model is a multimodal video generation tool that can generate 5-second-long videos. It supports both text-to-video and image-to-video generation and joins existing AI models such as Runway Gen-2 and Pika 1.0. As per Google, Lumiere uses a Space-Time U-Net (STUNet) architecture that innovates how motion occurs in an AI video, making it appear realistic. The platform is not open to the public as of yet.

In an accompanying preprint paper, the research team behind Lumiere explained that the major innovation in motion comes from creating the video in a single process instead of putting together still frames. Due to this, both the spatial (the objects in the video) and temporal (how things move around in the video) aspects of the video generation are created simultaneously. For the layperson, this results in perceiving motions as they occur in nature. To achieve this, Lumiere generates a larger number of 80 frames instead of Stable Diffusion's 25 frames.

Advertisement

“By deploying both spatial and (importantly) temporal down- and up-sampling and leveraging a pre-trained text-to-image diffusion model, our model learns to directly generate a full-frame-rate, low-resolution video by processing it in multiple space-time scales,” the paper added.

While Google Lumiere cannot be tested at the moment, the website is live and enthusiasts can check various videos created using the AI model as well as the text prompt and input images used to create the output. It can also generate videos in various styles, cinemagraphs that let users animate a certain part of the video, and inpainting where a masked-out video or image is used and the AI completes it based on the prompt.

Advertisement

Google's latest AI video generation tool competes with existing AI models such as Runway Gen-2, which was launched in March 2023, and Pika Lab's Pika 1.0, both of which are accessible to the public. While Pika can create 3-second-long videos (which can be increased for 4 more seconds), Runway can generate videos as long as 4 seconds. Both models are multimodal and allow video editing as well.


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: Google, Artificial intelligence
Advertisement

Related Stories

Popular Mobile Brands
  1. OnePlus Pad 4 to Launch in India With a 13,380mAh Battery on This Date
  2. Best Mobiles Under Rs. 40,000 in India
  3. YouTuber Demonstrates Flaw That Allows Money to Be Stolen From Locked iPhone
  4. Oppo Find X10 Key Specifications Leak as Find X9 Ultra Launch Nears
  5. Realme Buds T500 Pro Debut in India With Up to 56 Hours Total Battery Life
  6. OnePlus Nord CE 6 Lite Appears on Geekbench With This MediaTek Chip
  1. OnePlus Nord CE 6 Lite Appears on Geekbench With Dimensity 7400 Chip, Android 16
  2. Meta’s Planned Facial Recognition Feature for Smart Glasses Faces Opposition From Privacy Orgs
  3. Vivo X300 Ultra Pricing Surfaces Online via Retail Listing in Europe
  4. YouTube's New Option Lets Users Effectively Turn Off Shorts From Their Feed
  5. South Korea Plans Blockchain-Based Payments for Government Spending
  6. Amazon Launches AI Store to Help Users Discover and Shop AI-Powered Devices
  7. Motorola Razr Fold, Lenovo Legion Y70 to Launch Alongside Y900 Tablet During Lenovo's May 19 Event
  8. Apple Tap-to-Pay Vulnerability Demonstrated on Video as YouTuber Steals $10,000 From a Locked iPhone
  9. Adobe’s New Firefly AI Assistant Can Perform Complex Design Tasks With Text Prompts
  10. Crimson Desert Has Sold Over 5 Million Copies, Pearl Abyss Confirms
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.