Fireworks.ai, the Generative AI Firm That Fine-Tunes and Customises Open-Source LLMs For Business Needs

Fireworks.ai offers custom AI models as APIs which are ready to be deployed.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 27 March 2024 18:27 IST
Highlights
  • Fireworks.ai started its fine-tuning service in March 2024
  • The AI firm allows companies to experiment with multiple AI models
  • Fireworks.ai has reportedly raised $25 million (roughly Rs. 208 crores)

Fireworks.ai co-founder and CEO Lin Qiao previously worked at Meta

Photo Credit: Pexels/Pixabay

Fireworks.ai is a California-based artificial intelligence (AI) startup that is offering a unique solution for enterprises. The AI firm does not build large language models (LLMs) or foundation models from scratch but fine-tunes open-source models and converts them into an Application Programming Interface (API) to help businesses deploy the AI capabilities in a seamless fashion. The fine-tuning reduces the scope of the AI model and focuses it on a specific functionality. This allows them to reduce instances of AI hallucinations and improve the capabilities of the model significantly.

The AI firm was co-founded by Lin Qiao who also holds the seat of the CEO in the company. After serving as the Senior Director of Engineering at Meta and working with AI frameworks and platforms, Qiao and her team founded the startup in October 2022, as per her LinkedIn profile. In a conversation with TechCrunch, she explained the business model of Fireworks.ai, highlighting the fine-tuning service they provide. She said, “It can be either off the shelf, open source models or the models we tune or the models our customer can tune by themselves. All three varieties can be served through our inference engine API.”

This puts the firm in a unique position where while it is not innovating at the foundation model level, it is bridging the gap between an LLM and a business-ready product that can be deployed seamlessly. With a primary focus on building APIs, Fireworks.ai lets its enterprise clients plug and play any open-source AI model in its catalogue. As per the report, the company also lets businesses experiment with different AI models to choose the one that fits their needs.

Advertisement

At present, the startup claims to contain 89 open-source LLMs such as Mixtral MoE 8x7B Instruct, Meta's Llama 2 70B Chat, Google's Gemma 7B Instruct, Stability AI's Stable Diffusion XL, and more. The AI firm offers the models in either serverless format that does not require businesses to configure hardware or deploy models, or as on-demand models which are available for dedicated deployments, served on reserved GPU configurations according to business needs.

Advertisement

For the on-demand format, Fireworks.ai has three payment plans — Developer, Business, and Enterprise — where the Developer plan comes with a pay-per-usage structure and a rate limit of 600 requests per minute, the Enterprise tier has custom pricing offers and unlimited rate limits. The serverless format is billed at a per-token pricing plan where different models, depending on whether they are text-only, image-only, or multimodal, will fetch a different price.


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: Artificial Intelligence, AI, LLM
Advertisement

Related Stories

Popular Mobile Brands
  1. New Shortcut Lets Scientists Run Complex Quantum Models on a Laptop
  1. New Shortcut Lets Scientists Run Complex Quantum Models on a Laptop
  2. Glaciers Speed Up in Summer and Slow in Winter, New Global Map Reveals
  3. Be Dune Teen OTT Release: When, Where to Watch the Marathi Comedy Drama Series
  4. Four More Shots Please Season 4 OTT Release: Where to Watch the Final Chapter of the Web Series
  5. Nari Nari Naduma Murari OTT Release: Know Where to Watch the Telugu Comedy Entertainer
  6. Engineers Turn Lobster Shells Into Robot Parts That Lift, Grip and Swim
  7. Strongest Solar Flare of 2025 Sends High-Energy Radiation Rushing Toward Earth
  8. Raat Akeli Hai: The Bansal Murders OTT Release: When, Where to Watch the Nawazuddin Siddiqui Murder Mystery
  9. Bison Kaalamaadan Is Now Streaming: Know All About the Tamil Sports Action Drama
  10. Pharma OTT Release: When, Where to Watch the Malayalam Medical Thriller Web Series
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.