Google Can Reportedly Use Content to Train Search AI Even If Publishers Opt Out

A Google executive reportedly testified in court that it can use content to train AI for search products, even if publishers have opted out.

Advertisement
Written by Akash Dutta, Edited by David Delima | Updated: 6 May 2025 13:19 IST
Highlights
  • Google DeepMind AI models reportedly respect opt-outs
  • The AI products of Google Search include AI Overviews and AI Mode
  • Google Search manages content via the robots.txt web standard

The executive’s testimony was part of the US Justice Department’s antitrust case against Google

Photo Credit: Pexels/Pixabay

Google Search products can reportedly use content from publishers even if they have opted out of artificial intelligence (AI) training. As per the report, a Google DeepMind executive revealed the information during a testimony in the company's ongoing antitrust case against the US Justice Department. The executive reportedly highlighted that such content is not used in the AI models developed by DeepMind. The Mountain View-based tech giant reportedly explained that content for search is managed by a separate mechanism that uses the robots.txt web standard.

Update: After publishing the story, Google's global PR team reached out to Gadgets 360 with additional details. As per the statement, the company's publisher opt-out rule only applied to its Google-Extend product, and has never applied to Google Search. Google-Extended is a standalone product token that web publishers can use to manage whether content Google crawls from their sites may be used for training future generations of Gemini models that power Gemini Apps and Vertex AI API for Gemini. Google-Extended does not impact a site's inclusion in Google Search nor is it used as a ranking signal in Google Search.

Google Follows Different for AI Models, Search Products

According to a Bloomberg report, Eli Collins, the Vice President of Product at Google DeepMind, confirmed that the rules for adhering to publishers' decision to opt out from AI training are different for AI models from DeepMind and the company's Search products.

Advertisement

Google-Extended, which does not and has never applied to Google Search.

Advertisement

Attorney representing the Department of Justice in the antitrust case, Diana Aguilar, reportedly produced a document highlighting that 80 billion out of 160 billion tokens used to train Google's AI models came from content that publishers had opted out of AI training. Collins reportedly responded that DeepMind's models do not use the content once a publisher has opted out of AI training.

However, when Aguilar reportedly questioned if the Gemini AI model could use the same content if it was put inside the Search product, Collins confirmed that as “correct,” as long as the use case was within Search. Notably, this would include Gemini models powering Google's AI Overviews and recently launched AI Mode.

Advertisement

This means traditional opt-out methods aren't enough to keep Google from using content from publishers. The tech giant had updated its privacy policy in June 2023 to reflect that it will use all publicly available Internet data to train its language models. Here, publicly available Internet data refers to any website that does not have a paywall or mandatory sign-up pages, restricting its access to the public.

A Google spokesperson later told Bloomberg that the rules for Search-based AI tools are different, as publishers can “only decline having their data used in Search AI if they opt out of being indexed for search.” Publishers can do this by disabling the robots.txt web standard that allows Google's crawler bots to access the content to index it in search results.

Advertisement

However, this would also ensure that these web pages do not show up when a user uses Google's search engine to search for a topic. This effectively leaves publishers with no option but to accept the company training its AI models on said data.

The ongoing antitrust case is attempting to prove that Google has a monopoly in the search and AI space. Amit Mehta, a US District Judge presiding over the case, is being urged by the Department of Justice to force the tech giant to sell Google Chrome and to share the data that it uses to generate search results. However, no such measure has been suggested for the company's AI products.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. This Is How You Can Get ChatGPT Go Subscription for Free
  2. Red Magic 11 Pro Launched in Global Markets With Slightly Smaller Battery
  3. Realme GT 8 Pro Aston Martin F1 Limited Edition Launch Date Revealed
  4. Dude OTT Release Date: When and Where to Watch it Online?
  5. Here Are the Best Smartphones Under Rs 20,000 With AMOLED Display
  6. Poco F8 Pro, F8 Ultra Set for Global Launch 'Really Soon', Tipster Claims
  7. Apple Enters List of Top 5 Phone Makers in India in Q3 2025: Counterpoint
  8. Iran Tackles Illegal Bitcoin Mining Devices in Fresh Crackdown
  9. Samsung Galaxy A57 Spotted on Company's Test Server With This Model Number
  10. German Scientists Develop Laser Drill to Explore Icy Moons' Hidden Oceans
  1. OpenAI’s ChatGPT Go Plan Is Now Available for Free: Know How to Get It
  2. Ghostly Neutrinos May Hold the Answer to Why Matter Exists in Our Universe
  3. German Scientists Develop Laser Drill to Explore Icy Moons’ Hidden Oceans
  4. Japan’s Akatsuki Spacecraft Declared Inoperable, Marking End of Dedicated Venus Missions
  5. NASA’s JWST Produces First-Ever 3D Map of Distant Planet WASP-18b
  6. Bad Girl OTT Release Date Revealed: Know When and Where to Watch This Tamil Movie Online
  7. Dhoolpet Police Station OTT Release: Know When and Where to Watch This Upcoming Crime Series Online
  8. Rockstar Games Co-Founder Says GTA Games Won't Work if Set Outside the US
  9. Iran Tackles Unauthorised Crypto Mining After 95 Percent of Bitcoin Mining Devices Found Operating Illegally
  10. Red Magic 11 Pro Launched Globally With Snapdragon Elite Gen 5, Slightly Smaller Battery: Price, Specifications
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.