Meta’s New Open-Source SAM Audio AI Model Can Isolate Sounds From Audio Mixtures

Meta’s new SAM Audio AI model lets users isolate and edit sounds from mixed audio using text, visual or time prompts.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 17 December 2025 16:20 IST
Highlights
  • SAM Audio is currently available in the Segment Anything Playground
  • The open-source model can also be downloaded from GitHub
  • Meta says the model can be used for noise filtering and isolating sounds

Meta’s release of SAM Audio comes a month after it released SAM 3 and SAM 3D

Photo Credit: Meta

Meta has released another new artificial intelligence (AI) model in the Segment Anything Model (SAM) family. On Tuesday, the Menlo Park-based tech giant released SAM Audio, a large language model (LLM) that can identify, separate, and isolate particular sounds in an audio mixture. The model can handle audio editing based on either text prompts, visual signals, or time stamps, automating the entire workflow. Like the other models in the SAM series, it is also an open-source model that comes with a permissive licence.

Meta Introduces SAM Audio AI Model

In a newsroom post, the tech giant announced and detailed its new audio-focused AI model. SAM Audio is currently available to download either via Meta's website, GitHub listing, or Hugging Face. Those users who would prefer to use the model's capabilities without running it locally can visit the Segment Anything Playground to test it out. The website also allows users to access all the other SAM models. Notably, it is available under the SAM Licence, a custom, Meta-owned licence that allows both research-related and commercial usage.

Advertisement

Meta describes SAM Audio as a unified AI audio model that uses text-based commands, visual cues, and time-based instructions to identify and separate sounds from a complex mixture. Traditionally, audio editing, especially isolating individual sound elements, has required specialised tools and manual work, often with limited precision. Meta's latest entry in the SAM series addresses this gap.

The model supports three types of prompting. With text prompts, users can type descriptions, such as “drum beat” or “background noise.” Visual prompting allows users to click on an object or a human in a video, and if a sound is being produced from there, it will be isolated. Finally, time span prompting lets anyone mark a segment of the timeline to target a sound.

Advertisement

To highlight an example, imagine there is an audio file of a person speaking on the phone while music plays in the background, and children's voices can be heard playing at a distance. Users can isolate any of these audio sources, be it the primary voice, the music, or the ambient noise made by the children, with a single command. Gadgets 360 staff members briefly tested the model and found it to be both fast and efficient. However, we were not able to test it in real-world situations.

Under the hood, SAM Audio is a generative separation model that extracts both target and residual stems from an audio mixture. It is equipped with a flow-matching Diffusion Transformer and operates in a Descript Audio Codec - Variational Autoencoder Variant (DAC-VAE) space.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement
Popular Mobile Brands
  1. Realme 16 5G Launched in India With Selfie Mirror Feature: Check Price
  2. Vivo V70 FE Launched in India With 7,000mAh Battery, 200-Megapixel Main Camera
  3. Redmi Note 15 SE 5G Debuts in India With a Vegan Leather Finish: See Price
  4. PS Plus Monthly Games for April Revealed
  5. Infinix Note 60 Pro With Active Matrix Panel to Arrive in India on This Date
  6. These Three Pro Models Could Launch as Part of the Motorola Edge 70 Series
  7. Meta Reportedly Warns WhatsApp Users About This Fake App Spying on Them
  8. Anthropic's Source Code Leak Reveals Critical Details About Claude Code
  9. Samsung Galaxy 'Able' Reportedly Spotted in App, May Not Be Earphones
  10. OnePlus 15R Price in India Hiked Amidst Soaring Cost of Memory Components
  1. DoT Reportedly Extends SIM Binding Mandate Till the End of 2026
  2. Government Migrates 16.68 Lakh Official Email Accounts to Zoho Cloud, Spends Rs. 180 Crore
  3. Infinix Note 60 Pro India Launch Date Revealed; Company Teases Active Matrix Feature on Rear Panel
  4. Naughty Dog's Neil Druckmann Mentions 'Road Ahead' for the Last of Us, Teasing the Last of Us Part 3
  5. Repu Udayam 10 Gantalaku Brings a Race Against Time to Prime Video
  6. Honor X80i Launched With 7,000mAh Battery, MediaTek Dimensity 6500 Elite Chip: Price, Specifications
  7. Honor Play 80 Pro Launched With 7,000mAh Battery, 50-Megapixel Rear Camera: Price, Specifications
  8. Hong Kong Misses March Target for Stablecoin Licences, HKMA Yet to Approve Issuers
  9. Samsung Galaxy Buds 'Able' Reportedly Spotted in Development, Model Number Raises Questions
  10. Khakee Circus OTT Release Date: When and Where to Watch it Online?
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.