Reddit to Update Web Standard to Block Automated Data Scraping From Its Website

Reddit said that researchers and organizations such as the Internet Archive will continue to have access to its content for non-commercial use.

Advertisement
By Reuters | Updated: 26 June 2024 15:51 IST
Highlights
  • AI startups have reportedly been bypassing rules to gather content
  • Reddit said that it would update the Robots Exclusion Protocol
  • The platform will also block unknown bots and crawlers from data scraping

AI firms have been accused of plagiarizing content from publishers

Photo Credit: Reuters

Social media platform Reddit said on Tuesday it will update a Web standard used by the platform to block automated data scraping from its website, following reports that AI startups were bypassing the rule to gather content for their systems.

The move comes at a time when artificial intelligence firms have been accused of plagiarizing content from publishers to create AI-generated summaries without giving credit or asking for permission.

Reddit said that it would update the Robots Exclusion Protocol, or "robots.txt," a widely accepted standard meant to determine which parts of a site are allowed to be crawled.

Advertisement

The company also said it will maintain rate-limiting, a technique used to control the number of requests from one particular entity, and will block unknown bots and crawlers from data scraping - collecting and saving raw information - on its website.

Advertisement

More recently, robots.txt has become a key tool that publishers employ to prevent tech companies from using their content free-of-charge to train AI algorithms and create summaries in response to some search queries.

Last week, a letter to publishers by the content licensing startup TollBit said that several AI firms were circumventing the web standard to scrape publisher sites.

Advertisement

This follows a Wired investigation which found that AI search startup Perplexity likely bypassed efforts to block its Web crawler via robots.txt.

Earlier in June, business media publisher Forbes accused Perplexity of plagiarizing its investigative stories for use in generative AI systems without giving credit.

Advertisement

Reddit said on Tuesday that researchers and organizations such as the Internet Archive will continue to have access to its content for non-commercial use.

© Thomson Reuters 2024


Is the Samsung Galaxy Z Flip 5 the best foldable phone you can buy in India right now? We discuss the company's new clamshell-style foldable handset on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: Reddit, AI
Advertisement

Related Stories

Popular Mobile Brands
  1. Red Magic 11 Air Launched in Global Markets With ICE Cooling System
  2. Sarvam Maya OTT Release Date: When and Where to Watch it Online?
  3. Vivo X200T vs Motorola Signature vs OnePlus 15R: Price, Features Compared
  4. Samsung Galaxy Tab S12+ Surfaces on IMEI Database, Could Launch Soon
  5. Champion OTT Release: Where To Watch Roshan Meka's Telugu Sports Drama Online?
  1. CERN Experiments Confirm Early Universe Behaved Like a Near-Perfect Fluid
  2. NASA’s TESS Captures First Images of Rare Interstellar Comet 3I/ATLAS
  3. Daredevil: Born Again Season 2 OTT Release Date Confirmed: When and Where to Watch it Online?
  4. The Wrecking Crew Starring Jason Momoa and Dave Bautista Now Streaming: What You Need to Know
  5. Redmi Buds 8 Pro Launched With ANC, Hi-Res Audio and Up to 36 Hours of Total Battery Life
  6. Samsung Galaxy Tab S12+ Surfaces on IMEI Database, Could Launch Soon
  7. Champion OTT Release: Where To Watch Roshan Meka’s Telugu Sports Drama Online?
  8. Nothing Won't Launch a Flagship Model in 2026; Company to Focus on Nothing Phone 4a and Audio Products, Carl Pei Says
  9. Redmi Turbo 5 Max Launched With 9,000mAh Battery, Redmi Turbo 5 Tags Along: Price, Specifications
  10. Ponies Starring Emilia Clarke and Haley Lu Richardson Now Available for Streaming
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.