AI Chip Startup Cerebras Releases Open Source ChatGPT-Like Models for Free: All Details

Silicon Valley-based Cerebras released seven models, all trained on its AI supercomputer called Andromeda.

Advertisement
By Reuters | Updated: 29 March 2023 12:02 IST
Highlights
  • OpenAI's chatbot ChatGPT has 175 billion parameters
  • Cerebras's seven models range from small parameters to large
  • Smaller models could be leveraged on smaller devices, smartphones
AI Chip Startup Cerebras Releases Open Source ChatGPT-Like Models for Free: All Details

Most of the AI models today like ChatGPT are trained on Nvidia's chips

Photo Credit: Cerebras

Artificial intelligence chip startup Cerebras Systems on Tuesday said it released open source ChatGPT-like models for the research and business community to use for free in an effort to foster more collaboration.

Silicon Valley-based Cerebras released seven models all trained on its AI supercomputer called Andromeda, including smaller 111 million parameter language models to a larger 13 billion parameter model.

"There is a big movement to close what has been open-sourced in AI...it's not surprising as there's now huge money in it," said Andrew Feldman, founder, and CEO of Cerebras. "The excitement in the community, the progress we've made, has been in large part because it's been so open."

Models with more parameters are able to perform more complex generative functions.

Advertisement

OpenAI's chatbot ChatGPT launched late last year, for example, has 175 billion parameters and can produce poetry and research, which has helped draw large interest and funding to AI more broadly.

Cerebras said the smaller models can be deployed on phones or smart speakers while the bigger ones run on PCs or servers, although complex tasks like large passage summarization require larger models.

Advertisement

However, Karl Freund, a chip consultant at Cambrian AI, said bigger is not always better.

"There's been some interesting papers published that show that (a smaller model) can be accurate if you train it more," said Freund. "So there's a trade off between bigger and better trained."

Advertisement

Feldman said his biggest model took a little over a week to train, work that can typically take several months, thanks to the architecture of the Cerebras system, which includes a chip the size of a dinner plate built for AI training.

Most of the AI models today are trained on Nvidia's chips, but more and more startups like Cerebras are trying to take share in that market.

The models trained on Cerebras machines can also be used on Nvidia systems for further training or customization, said Feldman.

© Thomson Reuters 2023


From smartphones with rollable displays or liquid cooling, to compact AR glasses and handsets that can be repaired easily by their owners, we discuss the best devices we've seen at MWC 2023 on Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. ROG Xbox Ally and ROG Xbox Ally X Handhelds Unveiled at Xbox Games Showcase
  2. WWDC 2025: How to Watch the Apple Keynote Live and What to Expect
  1. iOS 26 to Feature “Liquid Glass” UI Elements in Anticipation of 2027 iPhone Models: Report
  2. Microsoft Unveils ROG Xbox Ally and ROG Xbox Ally X Handheld PCs at Xbox Games Showcase
  3. Xiaomi SU7 Ultra Coming to Gran Turismo 7 on PlayStation With a Future Update
  4. NASA-ISRO Launch Joint Space Biology Experiments on Axiom Mission 4
  5. Scientists Discover Clicking Sounds in Rig Sharks for the First Time
  6. WWDC 2025: How to Watch the Apple Keynote Live and What to Expect
  7. Scientists Discover Heaviest Proton-Emitting Nucleus After Nearly 30 Years
  8. Hubble Unveils Galactic ‘Cotton Candy’ in the Large Magellanic Cloud
  9. James Webb Telescope Maps Fiery Atmosphere of Turbulent Exoplanet WASP-121b
  10. SpaceX Launches 27 Starlink Satellites from California Using Veteran Falcon 9 Booster
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.