Nvidia Research Introduces DiffUHaul, an AI Tool That Allows Object Relocation in Images

The DIffUHaul is a training-free technique, meaning the tool is not required to be pre-trained on any datasets.

Advertisement
Written by Akash Dutta, Edited by Manas Mitul | Updated: 2 December 2024 18:59 IST
Highlights
  • Nvidia researchers published a paper on DiffUHaul
  • The AI tool works on the principle of text-to-image diffusion
  • Nvidia’s DiffUHaul uses BlogGEN model for spatial understanding
Nvidia Research Introduces DiffUHaul, an AI Tool That Allows Object Relocation in Images

The researchers highlighted that existing AI models struggle with object relocation

Photo Credit: Nvidia

Nvidia researchers introduced a new artificial intelligence (AI) model Monday that can relocate objects in an image. Dubbed DiffUHaul, the tool can spatially understand the context of an image to move an object from one place to another without impacting the background or the shape of the image. The unique aspect of this technique is that it is training-free, meaning no pre-training data was used to build this tool. The new technology was showcased by the company at the Special Interest Group on Computer Graphics and Interactive Techniques (SIGGRAPH) Asia 2024 conference.

Nvidia Researchers Introduce DiffUHaul AI Tool

In a research paper, Nvidia researchers detailed the new AI tool. The technology was developed in collaboration with The Hebrew University of Jerusalem, Tel Aviv University, and Reichman University. With the new tool, the researchers aimed to solve a prominent issue with AI image generation models – the problem of relocating objects in an image with spatial awareness.

The paper highlights that this particular editing task has remained a bottleneck for AI scientists due to AI models lacking spatial reasoning. Existing visual models can understand the context of an image, but are unable to move objects as they do not understand how a movement in a 2D environment would be perceived spatially.

With DiffUHaul, Nvidia claims this issue can be solved. Based on image diffusion architecture, the tool uses attention masking in the denoising step. This is done to preserve the high-level object appearance. The AI tool uses BlobGEN, a new technique that integrates spatial understanding into the AI tool. Further, new techniques were used to reconstruct real images with the localised model in the designated place.

Advertisement

On the front end, users will be able to type a text prompt highlighting the object they want changed and the AI can spatially readjust the object while adjusting the background accordingly. In demonstrations shown by the company, it could not be determined if the AI editing tool can understand the shape changes that come with spatial movement. For instance, if an air-borne balloon is moved to the ground, its shape is also changed. However, the AI might not be able to capture that due to a lack of training.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. iQOO Neo 10 Pro+ Battery and Charging Details Revealed Ahead of Debut
  2. Vivo S30, S30 Pro Mini, Pad 5, TWS Air 3 Launch Date, Key Features Confirmed
  3. OnePlus 13s With Snapdragon 8 Elite Chip to Launch in India on This Date
  4. Samsung Galaxy S25 FE Tipped to Retain Galaxy S24 FE Rear Cameras
  5. iPhone 17 Air Leak Suggests Battery Capacity, Thickness and Weight
  6. Coinbase Faces Multiple Lawsuits After User Data Breach: Report 
  7. HP Launches OmniBook 5 Series AI PCs With Snapdragon X Series Chipsets
  8. Apple AirPods With Built-in Camera Tipped to Launch Next Year
  9. Realme GT 7T Design, Specifications Leaked Ahead of May 27 Launch
  1. Sun Unleash a 600,000-Mile Filament in Fiery Eruption
  2. New Study Sets Stronger Mass Limit on Ultralight Bosonic Dark Matter
  3. NASA’s Perseverance Captures Deimos Before Dawn in Striking Martian Sky Image
  4. Huawei MateBook Fold Ultimate Design With 18-Inch Double-Layer Flexible OLED Display Launched: Price, Features
  5. Huawei Nova 14 Ultra, Nova 14 Pro, Nova 14 With 5,500mAh Battery, 100W Charging Launched: Price, Specifications
  6. Coinbase Faces Multiple Lawsuits After User Data Breach: Report 
  7. Dubai's VARA Sets June 19 Deadline for Crypto Firms to Comply With Updated Activity-Based Rulebooks
  8. Acer AI TransBuds With Ear-Hook Design Unveiled at Computex 2025
  9. Nintendo Switch 2 to Support Text-to-Speech in GameChat, VRR Support Limited to Handheld Mode
  10. Honor 400 Series China Launch Date Revealed; Confirmed to Offer Battery Upgrade Over Predecessors
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.