Gemini 1.5 Flash-8B With Lowest Token Cost Among Gemini Family Now Available

Gemini 1.5 Flash-8B is an experimental version of Gemini 1.5 Flash, first released last month.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 4 October 2024 14:22 IST
Highlights
  • Google has doubled the rate limits with Gemini 1.5 Flash-8B
  • The AI model costs $0.15 (roughly Rs. 12.5) per 1 million output tokens
  • Gemini 1.5 Flash-8B is said to be optimised for speed and efficiency

Developers can access Gemini-1.5 Flash-8B for free via Google AI Studio and the Gemini API

Photo Credit: Google

Gemini 1.5 Flash-8B, the latest entrant in the Gemini family of artificial intelligence (AI) models, is now generally available for production use. On Thursday, Google announced the general availability of the model, highlighting that it was a smaller and faster version of the Gemini 1.5 Flash which was introduced at Google I/O. Due to being fast, it has a low latency inference and more efficient output generation. More importantly, the tech giant stated that the Flash-8B AI model is the “lowest cost per intelligence of any Gemini model”.

Gemini 1.5 Flash-8B Now Generally Available

In a developer blog post, the Mountain View-based tech giant detailed the new AI model. The Gemini 1.5 Flash-8B was distilled from the Gemini 1.5 Flash AI model, which was focused on faster processing and more efficient output generation. The company now claims that Google DeepMind developed this even smaller and faster version of the AI model in the last few months.

Despite being a smaller model, the tech giant claims that it “nearly matches” the performance of the 1.5 Flash model across multiple benchmarks. Some of these include chat, transcription, and long context language translation.

Advertisement

One major benefit of the AI model is its price effectiveness. Google said that the Gemini 1.5 Flash-8B will offer the lowest token pricing in the Gemini family. Developers will have to pay $0.15 (roughly Rs. 12.5) per one million output tokens, $0.0375 (roughly Rs. 3) per one million input tokens, and $0.01 (roughly Rs. 0.8) per one million tokens on cached prompts.

Advertisement

Additionally, Google is doubling the rate limits of the 1.5 Flash-8B AI model. Now, developers can send up to 4,000 requests per minute (RPM) while using this model. Explaining the decision, the tech giant stated that the model is suited for simple, high-volume tasks. Developers who wish to try out the model can do so via Google AI Studio and the Gemini API free of charge.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Top OTT Releases This Week: Baramulla, Maharani Season 4, Bad Girl, and More
  2. Realme GT 8 Pro Will Launch in India on This Date
  3. Airtel Begins Transition to Dual 5G Network in India to Roll Out 5G Advanced
  4. Ray-Ban Meta Glasses Will Go on Sale via Amazon, Flipkart on This Date
  5. Huawei Mate 70 Air With 6.6mm Slim Form Factor Launched: See Price
  6. Samsung Galaxy S26 Ultra Tipped to Launch Without Major Camera Upgrades
  7. MeitY Reveals India's Big Plan to Govern Artificial Intelligence
  8. Canon EOS R6 Mark III With 7K Video Recording Support Launched in India
  1. Bank of England Plans to Match US Pace on Stablecoin Regulation: Report
  2. Indian Rhythm Action Game Suri: The Seventh Note Gets Gameplay Trailer; Launch Set for 2026
  3. Miami Mayor Francis Suárez Says His Bitcoin Salary Has Tripled in Value
  4. MeitY Unveils India AI Governance Framework with Key Principles and Roadmap
  5. Canon EOS R6 Mark III Mirrorless Camera With 32.5-Megapixel Sensor Launched in India: Price, Features
  6. iPhone Air 2 Spotted in Leaked Design Render That Hints at Dual Rear Camera Setup
  7. WhatsApp Username Feature Said to Roll Out in 2026, Business Accounts Could Also Get Access
  8. Samsung Galaxy S26 Ultra Camera, Charging Specifications Leaked Alongside Exynos 2600 Details
  9. Tinder Wants to See Through Your Camera Roll to Suggest Potential Matches
  10. Gemini Crypto Exchange Aims to Launch Regulated Prediction Market Contracts
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.