Meta Releases 'Segment Anything Model 2' With AI-Powered Object Identification in Images and Videos

Meta’s Segment Anything Model (SAM 2) arrives a year after its predecessor was announce by the Facebook parent firm.

Advertisement
Written by Akash Dutta, Edited by David Delima | Updated: 30 July 2024 14:12 IST
Highlights
  • SAM 2 can segment and track any object in a video or image
  • Meta said the AI model can enable advanced video editing and generation
  • The open-source AI model is available on GitHub

Segment Anything Model 2 is based on a simple transformer architecture with streaming memory

Photo Credit: Meta

Meta released a new artificial intelligence (AI) model on Monday that can perform complex computer vision tasks. Dubbed Segment Anything Model 2 (SAM 2), it follows after its predecessor that was launched last year and was incorporated in Instagram's Backdrop and Cutouts tools. The successor to the model now comes with advanced capabilities and the company said it can perform segment identification and tracking even on videos. Like most of Meta's large language models (LLMs), SAM 2 is also an open-source AI model.

Meta's Segment Anything Model 2 Unveiled

In a newsroom post, Meta announced the new AI model which focuses on segment analysis on videos primarily, while improving its image segmentation capabilities. Highlighting the accomplishments of its predecessor, Meta said the AI model was used in Instagram's Backdrop and Cutouts features, while marine scientists used it to “segment sonar images and analyse coral reefs, satellite imagery analysis for disaster relief, and in the medical field, segmenting cellular images and aiding in detecting skin cancer".

Advertisement

SAM 2 is capable of object segmentation in an image and video as well as track it across different frames of a video in real-time. The AI can also track and segment objects in scenarios where the objects move fast, change in appearance, or are concealed by other objects or an entirely different scene.

The foundation model for prompt-based visual segmentation is built on a simple transformer architecture. It has a streaming memory that allows it to process videos in real-time. The company also claimed that the model was trained on its largest video segmentation dataset dubbed SA-V dataset.

Advertisement

Meta said the AI model can help ease the process of video editing or AI-based video generation, as well as to power new experiences in the company's mixed-reality ecosystem. The object tracking capability in videos can also assist in faster annotation of visual data to train other computer vision systems, the company added.

Since it is an open-source AI model, the company has hosted its weights on its GitHub page. Interested individuals can download and test out the AI model. Notably, it is licenced under the Apache 2.0 licence which allows for research, academic, and non-commercial usage.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. iPhone 18 Pro Max Could Fit Existing iPhone 17 Pro Max Cases
  2. OnePlus N6 Confirmed to Launch in India With an 8,000mAh Battery
  3. Lenovo's First Nvidia RTX Spark-Powered Laptop Might Look Like This
  4. Microsoft Surface, Surface Pro Launched With Snapdragon X2 Chips: See Price
  5. Motorola Razr Fold Review: The Best First-Generation Foldable Ever Made?
  6. Snap Launches Specs AR Glasses With a Built-In Display at This Price
  7. The OnePlus 15R Is Now Available in a New 16GB RAM Variant at This Price
  8. Vivo Y500 4G Global Launch Teased, Here's Where It Might Arrive First
  1. Scientists Discover Giant Planet Formation Around Supermassive Black Holes
  2. EA Sports FC 26, Call of Duty: Vanguard and More Coming to Xbox Game Pass This Month
  3. Vivo Y500 4G Global Launch Teased; Confirmed to Debut With 8,100mAh Battery
  4. WhatsApp Working on Voice Note Widget for Quick Access via Android Home Screen
  5. Honor X80 Pro Max Teased With 10,000 Nits Display Ahead of June 22 Launch
  6. Binance Defends EU Licence Compliance Following Reports of Possible Rejection
  7. OnePlus 15R Now Available in New 16GB RAM Variant in India With Higher Price Tag: Specifications, Features
  8. Google Extends Android's Parental Controls Beyond Pixel Phones With Android 17
  9. iPhone 18 Pro Max Dummies Hint at Case Compatibility With iPhone 17 Pro Max Despite Thicker Camera Bump
  10. Lenovo Yoga Pro 9n Design Renders, Key Specifications Leaked; Nvidia RTX Spark-Powered Laptop Could Launch Soon
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.