Hate Speech-Detecting AIs Can Be Fooled: Study

Advertisement
By Indo-Asian News Service | Updated: 17 September 2018 17:01 IST

Machine learning detectors deployed by major social media and online platforms to track hate speech are "brittle and easy to deceive", a study claims.

The study, led by researchers from the Aalto University in Finland, found that bad grammar and awkward spelling - intentional or not - might make toxic social media comments harder for artificial intelligence (AI) detectors to spot.

Advertisement

Modern natural language processing techniques (NLP) can classify text based on individual characters, words or sentences. When faced with textual data that differs from that used in their training, they begin to fumble, the researchers said.

"We inserted typos, changed word boundaries or added neutral words to the original hate speech. Removing spaces between words was the most powerful attack, and a combination of these methods was effective even against Google's comment-ranking system Perspective," said Tommi Grondahl, a doctoral student at the varsity.

Advertisement

The team put seven state-of-the-art hate speech detectors to the test for the study. All of them failed.

Among them was Google's Perspective. It ranks the "toxicity" of comments using text analysis methods.

Advertisement

Earlier, it was found that "Perspective" can be fooled by introducing simple typos.

But, Grondahl's team discovered that although "Perspective" has since become resilient to simple typos, it can still be fooled by other modifications such as removing spaces or adding innocuous words like "love".

Advertisement

A sentence like "I hate you" slipped through the sieve and became non-hateful when modified into "Ihateyou love".

Hate speech is subjective and context-specific, which renders text analysis techniques insufficient as stand-alone solutions the researchers said.

They recommend that more attention be paid to the quality of data sets used to train machine learning models - rather than refining the model design.

The results will be presented at the forthcoming ACM AISec workshop in Toronto.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: AI
Advertisement

Related Stories

Popular Mobile Brands
  1. Oppo Reno 16 Series Launched With 200-Megapixel Rear Camera: See Price
  2. Vivo S60 Benchmark Reveals Key Details Ahead of Debut
  3. Xiaomi 17T Amazon Availability, Zeiss-Tuned Telephoto Camera Confirmed
  4. Huawei Nova 16 Series Set to Launch in China on This Date
  5. Vivo Y600 Turbo Launched With 9,000mAh Battery at This Price
  6. One UI 8.5 Reportedly Reached These Galaxy Foldable Phones in India
  1. Scientists Discover 77 Rare Red Quasars Hidden Behind Cosmic Dust
  2. Samsung Galaxy Z Fold 5, Galaxy Z Flip 5 Reportedly Receive One UI 8.5 Stable Update in India
  3. Xiaomi 17T Amazon Availability, Zeiss-Tuned Telephoto Camera Confirmed via Microsite
  4. Ethereum Co-Founder Vitalik Buterin Responds to Criticism of Ethereum Foundation
  5. iOS 27 Said to Offer Third-Party AirPlay Alternatives Such as Google Cast to EU Users
  6. Huawei Wants to Surpass Moore’s Law Constraints With Its New Scaling System
  7. Oppo Enco Air 5s Launched With 12mm Drivers, Up to 48 Hours Total Battery Life: Price, Features
  8. Oppo Pad 6 Launched With MediaTek Dimensity 9500s SoC and 10,420mAh Battery: Price, Specifications
  9. Bad Thoughts Season 2 Out on OTT: Know Everything About This Dark Comedy Show
  10. Kara OTT Release Date Confirmed: When and Where to Watch Dhanush’s Tamil Crime Drama Online?
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.