Meta’s New Open-Source SAM Audio AI Model Can Isolate Sounds From Audio Mixtures

Meta’s new SAM Audio AI model lets users isolate and edit sounds from mixed audio using text, visual or time prompts.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 17 December 2025 16:20 IST
Highlights
  • SAM Audio is currently available in the Segment Anything Playground
  • The open-source model can also be downloaded from GitHub
  • Meta says the model can be used for noise filtering and isolating sounds

Meta’s release of SAM Audio comes a month after it released SAM 3 and SAM 3D

Photo Credit: Meta

Meta has released another new artificial intelligence (AI) model in the Segment Anything Model (SAM) family. On Tuesday, the Menlo Park-based tech giant released SAM Audio, a large language model (LLM) that can identify, separate, and isolate particular sounds in an audio mixture. The model can handle audio editing based on either text prompts, visual signals, or time stamps, automating the entire workflow. Like the other models in the SAM series, it is also an open-source model that comes with a permissive licence.

Meta Introduces SAM Audio AI Model

In a newsroom post, the tech giant announced and detailed its new audio-focused AI model. SAM Audio is currently available to download either via Meta's website, GitHub listing, or Hugging Face. Those users who would prefer to use the model's capabilities without running it locally can visit the Segment Anything Playground to test it out. The website also allows users to access all the other SAM models. Notably, it is available under the SAM Licence, a custom, Meta-owned licence that allows both research-related and commercial usage.

Meta describes SAM Audio as a unified AI audio model that uses text-based commands, visual cues, and time-based instructions to identify and separate sounds from a complex mixture. Traditionally, audio editing, especially isolating individual sound elements, has required specialised tools and manual work, often with limited precision. Meta's latest entry in the SAM series addresses this gap.

Advertisement

The model supports three types of prompting. With text prompts, users can type descriptions, such as “drum beat” or “background noise.” Visual prompting allows users to click on an object or a human in a video, and if a sound is being produced from there, it will be isolated. Finally, time span prompting lets anyone mark a segment of the timeline to target a sound.

Advertisement

To highlight an example, imagine there is an audio file of a person speaking on the phone while music plays in the background, and children's voices can be heard playing at a distance. Users can isolate any of these audio sources, be it the primary voice, the music, or the ambient noise made by the children, with a single command. Gadgets 360 staff members briefly tested the model and found it to be both fast and efficient. However, we were not able to test it in real-world situations.

Under the hood, SAM Audio is a generative separation model that extracts both target and residual stems from an audio mixture. It is equipped with a flow-matching Diffusion Transformer and operates in a Descript Audio Codec - Variational Autoencoder Variant (DAC-VAE) space.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement
Popular Mobile Brands
  1. OnePlus 15R, OnePlus 15R Ace Edition Launch Today: All You Need to Know
  2. Realme 16 Pro+ 5G Listed on Certification Website With These Specifications
  3. Apple's iPhone 18 Pro, iPhone Fold May Feature a Relocated Selfie Camera
  4. Dhurandhar OTT Release Date: What We Know So Far
  5. OnePlus 15, Nord CE 5 Prices Slashed During Community Sale: See Offers
  6. GTA 6 Characters Guide: Know Every Character Rockstar Has Teased So Far
  7. Moto G Power (2026) Launched With MediaTek Dimensity 6300 SoC: Details
  8. Google Pay Brings Its First Co-Branded UPI-Powered Digital Credit Card
  9. Motorola Signature Phone Could Launch Soon: See Leaked Design, Colourways
  1. Flex By Google Pay: Google Partners With Axis Bank to Introduce UPI-Powered, Digital Credit Card
  2. Warner Bros. Plans to Reject Paramount Bid on Funding, Terms
  3. Amazon Pay Adds Support for Biometric Authentication for UPI Payments in India
  4. The Pitt Season 2 OTT Release Date Revealed: Know When and Where to Watch it Online
  5. iPhone 18 Pro, iPhone Fold to Feature Relocated Selfie Camera; iPhone 17e to Offer MagSafe Support: Report
  6. Development on The Elder Scrolls 6 Is 'Progressing Really Well', Says Bethesda Director Todd Howard
  7. Meta’s New Open-Source SAM Audio AI Model Can Isolate Sounds From Audio Mixtures
  8. Vivo V70 Stops By US FCC Database; Listing Reveals RAM and Storage Specifications
  9. Taskaree: The Smuggler’s Web OTT Release Date: When and Where to Watch Emraan Hashmi's Intense Crime Thriller
  10. Home Town Streaming Now Online: Know Where to Watch This American Reality Show
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.