Meta Releases 'Segment Anything Model 2' With AI-Powered Object Identification in Images and Videos

Meta’s Segment Anything Model (SAM 2) arrives a year after its predecessor was announce by the Facebook parent firm.

Advertisement
Written by Akash Dutta, Edited by David Delima | Updated: 30 July 2024 14:12 IST
Highlights
  • SAM 2 can segment and track any object in a video or image
  • Meta said the AI model can enable advanced video editing and generation
  • The open-source AI model is available on GitHub
Meta Releases 'Segment Anything Model 2' With AI-Powered Object Identification in Images and Videos

Segment Anything Model 2 is based on a simple transformer architecture with streaming memory

Photo Credit: Meta

Meta released a new artificial intelligence (AI) model on Monday that can perform complex computer vision tasks. Dubbed Segment Anything Model 2 (SAM 2), it follows after its predecessor that was launched last year and was incorporated in Instagram's Backdrop and Cutouts tools. The successor to the model now comes with advanced capabilities and the company said it can perform segment identification and tracking even on videos. Like most of Meta's large language models (LLMs), SAM 2 is also an open-source AI model.

Meta's Segment Anything Model 2 Unveiled

In a newsroom post, Meta announced the new AI model which focuses on segment analysis on videos primarily, while improving its image segmentation capabilities. Highlighting the accomplishments of its predecessor, Meta said the AI model was used in Instagram's Backdrop and Cutouts features, while marine scientists used it to “segment sonar images and analyse coral reefs, satellite imagery analysis for disaster relief, and in the medical field, segmenting cellular images and aiding in detecting skin cancer".

SAM 2 is capable of object segmentation in an image and video as well as track it across different frames of a video in real-time. The AI can also track and segment objects in scenarios where the objects move fast, change in appearance, or are concealed by other objects or an entirely different scene.

The foundation model for prompt-based visual segmentation is built on a simple transformer architecture. It has a streaming memory that allows it to process videos in real-time. The company also claimed that the model was trained on its largest video segmentation dataset dubbed SA-V dataset.

Advertisement

Meta said the AI model can help ease the process of video editing or AI-based video generation, as well as to power new experiences in the company's mixed-reality ecosystem. The object tracking capability in videos can also assist in faster annotation of visual data to train other computer vision systems, the company added.

Since it is an open-source AI model, the company has hosted its weights on its GitHub page. Interested individuals can download and test out the AI model. Notably, it is licenced under the Apache 2.0 licence which allows for research, academic, and non-commercial usage.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Nothing Announces 'Now or Nothing' Sale in India: Check All Offers
  2. Vivo T4 Ultra Launched in India With 50-Megapixel Periscope Camera
  3. Google Releases Android 16 for Pixel Devices With These New Features
  4. Here's When the OnePlus Nord 5 and OnePlus Nord CE 5 Could Launch
  5. Itel Zeno 5G With 50-Megapixel Rear Camera Launched in India: See Price
  6. Android 16 Update Is Coming Soon - Here's What to Expect
  7. Nothing Phone 3 Leaked Render Suggests Design, Triple Rear Camera Unit
  8. Motorola Edge 60 With 5,500mAh Battery Launched in India: Price, Offers
  1. OpenAI Releases o3-Pro Reasoning-Focused AI Model, Comes With Improved Capabilities and Tool Use
  2. Google's June 2025 Pixel Drop Brings AI Sticker Generation to Gboard, Pixel VIPs Widget and Camera Hints
  3. Nintendo Switch 2 Sets Record, Sells Over 3.5 Million Units in First Four Days of Launch
  4. Vivo T4 Ultra With MediaTek Dimensity 9300+ SoC, 50-Megapixel Periscope Camera Launched in India
  5. Android 16 QPR1 Beta 2 Update With Support for Connected Displays, Flexible Window Tiling Released
  6. Android 16 With Support for Live Activities, Advanced Protection Rolling Out for Pixel Devices
  7. Itel Zeno 5G With MediaTek Dimensity 6300 SoC, 50-Megapixel Rear Camera Launched in India
  8. OnePlus Nord 5, OnePlus Nord CE 5 Launch Date Leaked: Expected Specifications
  9. NASA Slightly Raises Odds of Asteroid Hitting the Moon in 2032 After Updated JWST Data
  10. James Webb Space Telescope Captures Stunning Near-Infrared View of Sombrero Galaxy
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.