Stability AI and Arm Release Lightweight Tex-to-Audio Model Optimised for Fast On-Device Generation

The new text-to-audio AI model developed by Stability AI and Arm is called Stable Audio Open Small.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 15 May 2025 12:24 IST
Highlights
  • Stability AI has open-sourced the Stable Audio Open Small AI model
  • It is a 341 million parameter text-to-audio model
  • Stable Audio Open Small can produce up to 11 seconds of audio

Stability AI says the text-to-audio model can run locally on a smartphone

Photo Credit: Stability AI

Stability AI developed a new text-to-audio generation artificial intelligence (AI) model in partnership with Arm. Announced on Wednesday, the new model is dubbed Stable Audio Open Small, and it is said to generate short audio samples using text prompts. The London-based AI firm said that the model is lightweight and is optimised to run entirely on Arm CPUs. It is also said to have a fast generation time, making it useful for bulk use cases. The open-source audio model is available to download from GitHub and Hugging Face.

Stability AI Releases Stable Audio Open Small

In a newsroom post, the AI firm detailed the new large language model. It is a distilled version of the Stable Audio Open model, which was released in June 2024, and can generate up to 47 seconds of audio. The smaller text-to-audio model was designed with a focus on faster generation speed and smaller size.

Advertisement

The Stable Audio Open Small is a 341 million parameter model that can generate up to 11 seconds of audio. The company claims that it can generate an audio sample in less than eight seconds while running locally on a smartphone. Interestingly, Stability AI and Arm announced their collaboration for generative audio creation at Mobile World Congress (MWC) 2025.

Coming to the architecture and training, the Stable Audio Open Small is a latent diffusion model based on a transformer architecture. It is trained on a dataset of 4,86,492 audio recordings. The company said that all audio files are licensed. For text conditioning, a publicly available pre-trained T5 model was used. The AI firm used the Adversarial Relativistic-Contrastive (ARC) algorithm in the post-training phase to improve prompt adherence and increase the inference speed.

Advertisement

As per the company, this text-to-audio model is suited for creating drum loops, foley, instrument riffs, and ambient textures. Due to its small size, it can be deployed on Arm-powered smartphones as well as edge devices. The model can also be used in scenarios where real-time generation and responsiveness matter.

Stable Audio Open Small's model weights can be downloaded on the AI firm's Hugging Face listing, and the code base can be found on the GitHub listing. The AI model is available for commercial and non-commercial use under the permissive Stability AI Community Licence.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Vivo V70 Lite 5G Silently Launched in Select Markets With These Features
  2. Vivo X Fold 6 Launch Teased; Will Arrive with 'OriginOS 6 Fold' Skin
  3. How to Watch WWDC 2026 Live on YouTube, Apple TV, and More
  4. Samsung Galaxy A27 Spotted in Leaked Mint Colourway, Might Launch Soon
  1. Samsung Galaxy A27 Leaked in New Mint Colour Option Ahead of Anticipated Launch
  2. Vivo X Fold 6 Confirmed to Launch in China Soon With OriginOS 6 Fold Skin, New AI Features
  3. ChatGPT Gets Lockdown Mode to Protect Users From Prompt Injection Attacks, Reduce Data Theft Risks
  4. Vivo V70 Lite 5G Launched With 50-Megapixel Sony Camera, Dimensity 7400 Turbo SoC: Price, Specifications
  5. Ginny Wedss Sunny 2 OTT Release: When and Where to Watch Avinash Tiwary and Medha Shankr’s Rom-Com
  6. How to Watch WWDC 2026 Live on YouTube, Apple TV, and More: iOS 27, New Siri Expected
  7. SETI Scientists Searched Interstellar Comet 3I/ATLAS for Alien Signals
  8. 29 OTT Release Date: When and Where to Watch Vidhu and Preethi Asrani’s Romantic Drama Online
  9. Jimmi: Paisa Aur Paap Season 1 Now Available For Streaming Online: What You Need To Know
  10. Patriot Now Streaming on Zee5: Cast, Plot, Trailer, Release Date and More
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.