Google Introduces Gemini 3.1 Flash-Lite as Its Fastest and Most Cost-Efficient AI Model

The Gemini 3.1 Flash-Lite is rolling out in preview via the Gemini API in AI Studio and Vertex AI.

Advertisement
Written by Akash Dutta, Edited by Rohan Pal | Updated: 5 March 2026 15:32 IST
Highlights
  • The AI model costs $0.25 per million input tokens
  • Gemini 3.1 Flash-Lite costs $1.50 per million output tokens
  • Google claims the new model outperforms 2.5 Flash in response speed

Gemini 3.1 Flash-Lite achieved an Elo score of 1432 on the Arena.ai Leaderboard

Photo Credit: Google

Google introduced the Gemini 3.1 Flash-Lite artificial intelligence (AI) model on Thursday. Calling it the fastest and the most cost-efficient AI model in the Gemini 3 series, the Mountain View-based tech giant said it is designed for high-volume developer workloads. The model is currently not available to end users and has been reserved for developers and enterprises via specific channels. The company also claimed that the model's output speed is higher than that of the 2.5 series. Notably, the Gemini 3.1 Flash-Lite is currently only available in preview.

Gemini 3.1 Flash-Lite Is Here

In a blog post, the tech giant announced and detailed its latest Gemini 3.1 series large language model (LLM). Currently, the Gemini 3.1 Flash-Lite can be accessed in preview via the Gemini application programming interface (API) in Google AI Studio, and via Vertex AI for enterprises.

Coming to capabilities, the company said the 3.1 Flash-Lite outperforms 2.5 Flash with a “2.5X faster Time to First Answer Token,” and a 45 percent increase in output speed, citing the Artificial Analysis benchmark. It is also said to have achieved an Elo score of 1432 on the Arena.ai leaderboard. It is also claimed to outperform GPT-5 mini, Claude 4.5 Haiku, and Grok 4.1 Fast in terms of output speed.

Advertisement

In AI Studio and Vertex AI, developers will be able to access the LLM in standard and thinking modes, with the latter allowing users to control the thinking time for a task. Highlighting some use cases, Google said the model can handle high-volume translation and content moderation, and can also be used for complex tasks, such as generating user interfaces and dashboards, creating simulations, or just following instructions.

Advertisement

The company also claimed that the Gemini 3.1 Flash-Lite is a cost-efficient AI model, with one million input tokens priced at $0.25 (roughly Rs. 23) and output tokens priced at $1.5 (roughly Rs. 137) per million tokens. In comparison, the Gemini 2.5 Flash costs $0.3 (roughly Rs. 27.5) per million input and $2.5 (roughly Rs. 229) per million output tokens.

 

For details of the latest launches and news from Samsung, Xiaomi, Realme, OnePlus, Oppo and other companies at the Mobile World Congress in Barcelona, visit our MWC 2025 hub.

Advertisement

Related Stories

Popular Mobile Brands
  1. Moto Watch Review: The Best Smartwatch Under Rs. 6,000 in 2026?
  2. Vivo T5x 5G AnTuTu Score Exceeds 1 Million Points, Will Launch in India Soon
  3. Realme Narzo Power 5G With 10,001mAh Battery Launched in India: Price, Specifications
  4. Nothing Phone 4a, Phone 4a Pro Launched in India at This Price
  5. Vivo X300 FE Launched as Global Version of This Chinese Smartphone
  6. Nothing Phone 4a Pro Teaser Hints at the Presence of This Phone 3 Feature
  7. iPhone 17e to MacBook Neo: Here's Everything Apple Launched This Week
  8. Lava Bold 2 5G India Launch Teased; Company Teases Design Ahead of Debut
  9. Nothing Launches Headphone (a) With Adaptive ANC, Spatial Audio Support
  10. Just a Day After Releasing GPT-5.3 Instant, OpenAI Teases GPT-5.4 Model
  1. OpenAI Teases GPT-5.4 AI Model Launch Just a Day After Releasing GPT-5.3 Instant
  2. Nothing Headphone (a) Launched With Adaptive ANC, Customisable Controls: Price, Specifications
  3. Granny OTT Release Date: When and Where to Watch the Village Mystery Thriller Online?
  4. Andhaka OTT Release: Where to Watch the Telugu Drama-Thriller Online?
  5. Pookie OTT Release: When and Where to Watch Vijay Antony’s Romantic Drama Online?
  6. WhatsApp Plus Paid Subscription Reportedly in Development With Additional Customisation Options, Up to 20 Pinned Chats
  7. Samsung Patent Hints at Potential Clamshell-Style Foldable With Two Cover Displays
  8. Google Introduces Gemini 3.1 Flash-Lite as Its Fastest and Most Cost-Efficient AI Model
  9. Nothing Phone 4a Launched in India With Glyph Bar Interface Alongside Nothing Phone 4a Pro: Price, Specs
  10. Oppo Find N6 Key Features, Colour Options Leaked Ahead of Imminent China Launch
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.