Search

Apple Researchers Introduce Matrix3D, a Unified AI Model That Can Turn 2D Photos Into 3D Objects

Matrix3D can perform several photogrammetry subtasks, including pose estimation, depth prediction, and novel view synthesis.

Advertisement
Highlights
  • Matrix3D utilises a multimodal diffusion transformer (DiT)
  • The model was developed in partnership with Nanjing University and HKUST
  • It is an open-source model available for download on GitHub
Apple Researchers Introduce Matrix3D, a Unified AI Model That Can Turn 2D Photos Into 3D Objects

Researchers said that Matrix3D was trained using the masked learning technique

Photo Credit: Reuters

Apple researchers released a new artificial intelligence (AI) model that can generate 3D views from multiple 2D images. The large language model (LLM), dubbed Matrix3D, was developed by the company's Machine Learning team, in collaboration with Nanjing University and the Hong Kong University of Science and Technology (HKUST). The Cupertino-based tech giant has made the AI model available to the open community, and it can be downloaded via Apple's listing on GitHub. With Matrix3D, the researchers have unified the 3D generation pipeline to eliminate the risk of errors.

Apple's Matrix3D Innovates Multi-Task Photogrammetry

In a post, the tech giant detailed the research that went into the development of the Matrix3D AI model. While several 3D rendering models already exist, this one innovates the existing space by unifying the pipeline to create 3D views. Instead of having multiple models and components, here, a single LLM performs several photogrammetry subtasks such as pose estimation, depth prediction, and novel view synthesis.

Notably, Photogrammetry is the technique of obtaining accurate measurements and 3D information about physical objects and environments by analysing images. It is commonly used to create maps, 3D models, and measurements from 2D images taken from different angles.

The researchers have also published a paper about the new model on the online preprint journal arXiv. As per the researches, Matrix3D is based on a multimodal diffusion transformer (DiT) architecture. It can integrate data across multiple modalities such as image data, camera parameters, and depth maps.

In the paper, Apple researchers highlight that the model was trained using a mask learning strategy where a part of the image is obstructed, and the AI model is trained to find the right pixels that fit in the gap.

The researchers found that the LLM can generate an entire 3D object or scene view with just three images from different angles. While the dataset used to train the model was not disclosed, the model itself is available to download, modify, and redistribute via a permissive Apple licence on the company's GitHub listing.

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

 
Show Full Article
Please wait...
Advertisement

Related Stories

Popular Mobile Brands
  1. OTT Releases of the Week: Truth or Trouble, Motorheads, and More
  2. Xiaomi Pad 7 Ultra With XRING 01 SoC and 12,000mAh Battery Launched
  3. Infinix GT 30 Pro 5G India Launch Date, Colours, Key Features Confirmed
  4. Lava Shark 5G With Unisoc T765 Chipset, 5,000mAh Battery Launched in India
  5. Jony Ive and OpenAI Said to Launch AI Device With Cameras in 2027
  6. Tecno Pova Curve 5G India Launch Date Announced
  7. Samsung Galaxy A26 Review
  8. Xiaomi 15S Pro With With In-House XRING 01 SoC, 6,100mAh Battery Launched
  9. WhatsApp Rolls Out Voice Chat Feature With End-to-End Encryption
  10. Xiaomi Launches YU7 EV in China With 253 KMPH Claimed Top Speed
  1. Samsung Tri-Fold Smartphone Price Tipped to Exceed $3,000; Launch Timeline Leaked
  2. Indian Developer Underdogs Studios Reveals Gameplay for Mukti, Narrative Title Coming to PS5 and PC
  3. Xiaomi Watch S4 15th Anniversary Edition Unveiled With XRING T1 Chipset
  4. HSBC Launches Blockchain-Based Tokenised Deposit Service in Hong Kong
  5. Oppo A5x 5G With MediaTek Dimensity 6300 SoC, 6,000mAh Battery Launched in India: Price, Specifications
  6. Vercel Releases v0 AI Model for Web Application Development, Compatible with OpenAI API
  7. Infinix GT 30 Pro 5G India Launch Set for June 3; Colour Options, Key Features Revealed
  8. Reliance Jio Rolls Out Prepaid Gaming Plans With JioGames Cloud Access in India: Price, Benefits
  9. Landman Season 1 Now Available on JioHotstar: What You Need to Know About American Political Drama Series
  10. Fountain of Youth Now Streaming on Apple TV+: What You Need to Know About American Adventure Movie
Gadgets 360 is available in
Download Our Apps
App Store App Store
Available in Hindi
App Store
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.
Trending Products »
Latest Tech News »