Apple Releases Depth Pro, an Open Source Monocular Depth Estimation AI Model

Adding to the list, the Cupertino-based tech giant has now released a new AI model dubbed Depth Pro.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 7 October 2024 16:07 IST
Highlights
  • Apple’s AI model operates at the fixed resolution of 1536 x 1536
  • The Depth Pro model can synthesise depth maps of thin structures
  • The ViT image encoder also measures the focal length

The Apple Depth Pro AI model is available to download on GitHub

Photo Credit: Reuters

Apple has released several open-source artificial intelligence (AI) models this year. These are mostly small language models designed for a specific task. Adding to the list, the Cupertino-based tech giant has now released a new AI model dubbed Depth Pro. It is a vision model that can generate monocular depth maps of any image. This technology is useful in the generation of 3D textures, augmented reality (AR), and more. The researchers behind the project claim that the depth maps generated by AI are better than the ones generated with the help of multiple cameras.

Apple Releases Depth Pro AI Model

Depth estimation is an important process in 3D modelling as well as various other technologies such as AR, autonomous driving systems, robotics, and more. The human eye is a complex lens system that can accurately gauge the depth of objects even while observing them from a single-point perspective. However, cameras are not that good at it. Images taken with a single camera make it appear two-dimensional, removing depth from the equation.

So, for technologies where the depth of an object plays an important role, multiple cameras are used. However, modelling objects like this can be time-consuming and resource-intensive. Instead, in a research paper titled “Depth Pro: Sharp Monocular Metric Depth in Less Than a Second”, Apple highlighted how it used a vision-based AI model to generate zero-shot depth maps of monocular images of objects.

Advertisement

How the Depth Pro AI model generates depth maps
Photo Credit: Apple

Advertisement

 

To develop the AI model, the researchers used the Vision Transformer-based (ViT) architecture. The output resolution of 384 x 384 was picked, but the input and processing resolution was kept at 1536 x 1536, allowing the AI model more space to understand the details.

Advertisement

In the pre-print version of the paper, which is currently published in the online journal arXiv, the researchers claimed that the AI model can now accurately generate depth maps of visually complex objects such as a cage, a furry cat's body and whiskers, and more. The generation time is said to be one second. The weights of the open-source AI model are currently being hosted on a GitHub listing. Interested individuals can run the model on the inference of a single GPU.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Realme P4 Power 5G With 10,001mAh Battery Arrives in India: See Price
  2. Redmi Turbo 5 Max Launched With 9,000mAh Battery, Redmi Turbo 5 Tags Along
  3. NASA's TESS Captures First Images of Rare Interstellar Comet 3I/ATLAS
  4. Adobe Express Premium Is Now Free for One Year for All Airtel Users
  5. How to Change Your Mobile Number and Address Using New Aadhaar App
  6. CERN Experiments Confirm Early Universe Behaved Like a Near-Perfect Fluid
  7. Redmi Note 15 Pro Series 5G Launched in India With These Features
  8. Redmi Buds 8 Pro Launched With ANC, Hi-Res Audio at This Price
  9. Realme P4 Power 5G First Impressions
  1. CERN Experiments Confirm Early Universe Behaved Like a Near-Perfect Fluid
  2. NASA’s TESS Captures First Images of Rare Interstellar Comet 3I/ATLAS
  3. Daredevil: Born Again Season 2 OTT Release Date Confirmed: When and Where to Watch it Online?
  4. The Wrecking Crew Starring Jason Momoa and Dave Bautista Now Streaming: What You Need to Know
  5. Redmi Buds 8 Pro Launched With ANC, Hi-Res Audio and Up to 36 Hours of Total Battery Life
  6. Samsung Galaxy Tab S12+ Surfaces on IMEI Database, Could Launch Soon
  7. Champion OTT Release: Where To Watch Roshan Meka’s Telugu Sports Drama Online?
  8. Nothing Won't Launch a Flagship Model in 2026; Company to Focus on Nothing Phone 4a and Audio Products, Carl Pei Says
  9. Redmi Turbo 5 Max Launched With 9,000mAh Battery, Redmi Turbo 5 Tags Along: Price, Specifications
  10. Ponies Starring Emilia Clarke and Haley Lu Richardson Now Available for Streaming
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.