Gemini 1.5 Flash-8B With Lowest Token Cost Among Gemini Family Now Available

Gemini 1.5 Flash-8B is an experimental version of Gemini 1.5 Flash, first released last month.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 4 October 2024 14:22 IST
Highlights
  • Google has doubled the rate limits with Gemini 1.5 Flash-8B
  • The AI model costs $0.15 (roughly Rs. 12.5) per 1 million output tokens
  • Gemini 1.5 Flash-8B is said to be optimised for speed and efficiency

Developers can access Gemini-1.5 Flash-8B for free via Google AI Studio and the Gemini API

Photo Credit: Google

Gemini 1.5 Flash-8B, the latest entrant in the Gemini family of artificial intelligence (AI) models, is now generally available for production use. On Thursday, Google announced the general availability of the model, highlighting that it was a smaller and faster version of the Gemini 1.5 Flash which was introduced at Google I/O. Due to being fast, it has a low latency inference and more efficient output generation. More importantly, the tech giant stated that the Flash-8B AI model is the “lowest cost per intelligence of any Gemini model”.

Gemini 1.5 Flash-8B Now Generally Available

In a developer blog post, the Mountain View-based tech giant detailed the new AI model. The Gemini 1.5 Flash-8B was distilled from the Gemini 1.5 Flash AI model, which was focused on faster processing and more efficient output generation. The company now claims that Google DeepMind developed this even smaller and faster version of the AI model in the last few months.

Despite being a smaller model, the tech giant claims that it “nearly matches” the performance of the 1.5 Flash model across multiple benchmarks. Some of these include chat, transcription, and long context language translation.

Advertisement

One major benefit of the AI model is its price effectiveness. Google said that the Gemini 1.5 Flash-8B will offer the lowest token pricing in the Gemini family. Developers will have to pay $0.15 (roughly Rs. 12.5) per one million output tokens, $0.0375 (roughly Rs. 3) per one million input tokens, and $0.01 (roughly Rs. 0.8) per one million tokens on cached prompts.

Additionally, Google is doubling the rate limits of the 1.5 Flash-8B AI model. Now, developers can send up to 4,000 requests per minute (RPM) while using this model. Explaining the decision, the tech giant stated that the model is suited for simple, high-volume tasks. Developers who wish to try out the model can do so via Google AI Studio and the Gemini API free of charge.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Flipkart Big Billion Days 2025: Best 5G Smartphones You Can't Miss
  2. Xiaomi 17, 17 Pro, 17 Pro Max Will Launch in China on This Date
  3. Amazon, Flipkart Sale: Deals on iPhone 16 Pro, iPhone 15, and iPhone 14
  4. Amazon Great Indian Festival Sale Is Live: Best Deals Today
  5. Amazon Great Sale 2025 Live Updates: Deals on iPhone 15, OnePlus 13 and More
  6. Oppo Find X9 Series Will Launch in China on This Date
  7. OnePlus 15 Will Come With ColorOS 16 in China
  8. Amazon Sale 2025: Best Gaming Laptop Deals Under Rs. 1 Lakh
  9. iQOO 15 Design Revealed; Could Come in These New Colourways
  10. Samsung Galaxy S26 Ultra's 'Private Display' Feature Spotted on One UI 8.5
  1. Lenovo Cancels Some Pre-Orders of Lenovo Legion Go 2, Says Demand 'Substantially Exceeded' Projections
  2. iQOO 15 Design, New Colourways Revealed Ahead of October Launch
  3. Oppo Find X9 Launch Date Announced, Global Debut Teased; Will Feature Dimensity 9500 Chipset, Up to 7,500mAh Battery
  4. OpenAI, Jony Ive Reportedly Developing AI Speakers and Smart Glasses on the Back of Apple’s Supply Chain
  5. HyperOS 3 Update Release Timeline Revealed; Xiaomi 15 Ultra, Redmi K80 Pro Among First Phones to Get Updates
  6. MediaTek Dimensity 9500 Launched; Will Debut on Oppo Find X9 Series, Vivo X300 Lineup
  7. Perplexity’s Comet Browser Launched in India on Desktop, Available for Pre-Register on Android
  8. Microsoft Raises Prices of Xbox Series S/X Consoles in the US Again
  9. Samsung Galaxy S26 Ultra Could Offer 'Private Display' Feature Spotted in One UI 8.5 Code
  10. Xiaomi 17 Design, Specifications Revealed Ahead of Launch on September 25
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.