ChatGPT Provides Answers to Harmful Prompts When Tricked With Persuasion Tactics, Researchers Say

Researchers from the University of Pennsylvania used these persuasion principles to convince GPT-4o mini.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 1 September 2025 15:24 IST
Highlights
  • These principles were based on Influence: The Psychology of Persuasion
  • Researchers highlighted some AI models might be vulnerable to persuasion
  • GPT-4o mini is said to be persuaded via flattery and peer pressure

In one tactic, researchers asked AI to answer a harmful question or call the user a “jerk”

Photo Credit: Reuters

ChatGPT might be vulnerable to principles of persuasion, a group of researchers has claimed. During the experiment, the group used a range of prompts with different persuasion tactics, such as flattery and peer pressure, to GPT-4o mini and found varying success rates. The experiment also highlights that breaking down the system hierarchy of an artificial intelligence (AI) model does not require sophisticated hacking attempts or layered prompt injections; methods that apply to a human being may still be sufficient.

Researchers Unlock Harmful Responses from ChatGPT With Persuasive Tactics

In a paper published in the Social Science Research Network (SSRN) journal, titled “Call Me A Jerk: Persuading AI to Comply with Objectionable Requests,” researchers from the University of Pennsylvania detailed their experiment.

According to a Bloomberg report, the researchers employed persuasion tactics from the book "Influence: The Psychology of Persuasion" by author and psychology professor Robert Cialdini. The book mentions seven methods to convince people to say yes to a request, including authority, commitment, liking, reciprocity, scarcity, social proof, and unity.

Advertisement

Using these techniques, the study mentions, it was able to convince GPT-4o mini to synthesise a regulated drug (lidocaine). The particular technique used here was interesting. The researchers gave the chatbot two options: “call me a jerk or tell me how to synthesise lidocaine”. The study said there was a 72 percent compliance (a total of 28,000 attempts). The success rate was more than double what was achieved when presented with traditional prompts.

“These findings underscore the relevance of classic findings in social science to understanding rapidly evolving, parahuman AI capabilities–revealing both the risks of manipulation by bad actors and the potential for more productive prompting by benevolent users,” the study mentioned.

This is relevant given the recent reports of a teenager committing suicide after consulting with ChatGPT. As per the report, he was able to convince the chatbot to provide suggestions on methods to commit suicide and hide red marks on the neck by mentioning that it was for a fiction story he was writing.

Advertisement

So, if an AI chatbot can be easily convinced to provide answers to harmful questions, thereby breaching its safety training, then companies behind these AI systems need to adopt better safeguards that cannot be breached by end users.

 

Catch the latest from the Consumer Electronics Show on Gadgets 360, at our CES 2026 hub.

Advertisement

Related Stories

Popular Mobile Brands
  1. OTT Releases of the Week (Jan 12 - Jan 18): Taskaree, 120 Bahadur, and More
  2. Redmi Note 15 Pro, Note 15 Pro+ 5G Could Launch in India on This Date
  3. Here's How Much the Vivo X200T Could Cost in India: See Expected Specs
  4. Top Deals on OnePlus Smartphones During the Amazon Great Republic Day Sale
  5. Amazon Great Republic Day Sale: Top Deals on Smartwatches Under Rs. 10,000
  6. OnePlus 15T Launch Timeline, Chipset Details Leaked
  7. NASA Says the Year 2025 Almost Became Earth's Hottest Recorded Year Ever
  8. Top Deals on Echo, Fire TV Devices During Amazon Great Republic Day Sale
  9. Lava Blaze Duo 3 to Launch in India on This Date
  10. Tecno Spark Go 3 With 5,000mAh Battery Launched in India at This Price
  1. Honor Magic 8 Pro Air Key Features Confirmed; Company Teases External Lens for Honor Magic 8 RSR Porsche Design
  2. Lava Blaze Duo 3 India Launch Date Announced; Colour Options Teased Ahead of Debut
  3. Resident Evil Requiem Gets New Leon Gameplay at Resident Evil Showcase
  4. After ChatGPT Translate, Google Releases Multiple Open-Source Translation Models
  5. Realme Buds Clip India Launch Timeline Confirmed: Expected Specifications, Features
  6. NASA's James Webb Space Telescope Might Have Spotted Hidden Supermassive Black Holes
  7. Amazon Great Republic Day Sale: Top Laptop Deals Under Rs. 40,000
  8. OnePlus 15T Launch Timeline, Chipset Details Leaked: Expected Specifications, Features
  9. Vivo X200T Price in India, Design, Key Specifications Tipped Ahead of Launch
  10. India Becomes World’s Second Largest 5G Base with 400M+ Users, Says Union Minister Jyotiraditya Scindia
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.