Grok 4.1 AI Model Tends to Show Sycophancy and Deception More Than Its Predecessor

Grok 4.1’s model card states that it scored higher on the dishonesty and sycophancy parameters compared to Grok 4.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 20 November 2025 18:30 IST
Highlights
  • Grok 4.1 Thinking scored 0.49 on deception and 0.19 on sycophancy
  • In contrast, Grok 4 scored 0.43 and 0.07, respectively
  • This means the AI model will agree with the user even if they’re wrong

Higher sycophancy in an AI model makes it more likely to show people-pleasing traits

Photo Credit: xAI

Grok 4.1 was released on Monday by Elon Musk's xAI. At launch, the artificial intelligence (AI) firm highlighted that the model now displays higher emotional intelligence and improved creative writing capabilities. However, its model card now shows a concerning problem. The large language model (LLM) scores higher on deception and sycophancy than its predecessor, Grok 4, which could result in it displaying people-pleasing traits. The model also has a false-negative rate of 0.20 for biology via prompt injection.

Grok 4.1 Model Card Raises Flags for Deceptive and Sycophant Behaviour

The model card of Grok 4.1 (first spotted by the Decoder) highlights several concerning facts about the AI model. For the unaware, a model card contains all the technical details (or specifications) of a model, which is gauged by various internal testing. It highlights both how performant an AI model is and how strong its safety guardrails are.

Advertisement

xAI says the fourth-generation Grok model was upgraded to improve its emotional intelligence, and during our testing, we found that it performs slightly better than GPT-5.1 in general conversations and creative writing. However, this improved performance comes at a cost.

The model card shows that Grok 4.1 performs worse on the deception and sycophancy metrics. In the MASK benchmark, its deception rate was noted as 0.49 for the thinking variant and 0.46 for the non-thinking variant. On the other hand, Grok 4's deception was lower at 0.43. Similarly, the sycophancy score goes up from 0.07 in Grok 4 to 0.19 and 0.23 in the thinking and non-thinking variants, respectively.

Advertisement

In a real-world scenario, this would mean that the chatbot powered by the AI model will try harder to please the user, agreeing with them even when it knows they are wrong. It might also manipulate the user after providing an inaccurate response.

It should be highlighted that the scores are high, but AI companies also add external guardrails (not part of the AI model itself but built into the chatbot's system) that often suppress these tendencies. However, a possibility remains that Grok might agree with a user's delusions or paranoia and end up amplifying their belief.

Advertisement

Separately, it also has a false negative rate of 0.20 for biology-related prompt injections, which means one out of five malicious prompts around the topic can slip past the guardrails, and the AI model will respond to the query.

Notably, it is still too early to gauge how these numbers on paper will translate into the real world. It is also possible that xAI developers are already working on fine-tuning techniques to minimise the risks associated with the model. However, the numbers do highlight the need to be careful when interacting with Grok, especially when sharing sensitive information with it.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. CMF Phone 3 Pro Launch Timeline Leaks as Tipster Reveals Key Specs
  2. iQOO 15T Launches in China With These Features
  3. Vivo X500 Pro Max Might Launch This Year With an 8,000mAh Battery
  4. Apple's Big Health Move in India: Watch, AirPods Pro Get New Features
  5. Redmi Turbo 6 Max Tipped to Launch With a Notably Larger Battery
  6. Microsoft Surface for Business (2026) Series Debuts Globally at This Price
  7. Samsung Launches Odyssey G8 6K Monitor Alongside OLED G7, ViewFinity S8
  8. Motorola Razr Fold Goes on Sale in India With These Offers
  9. iQOO Pad 6 Pro, iQOO TWS 5i Debut at These Prices: See Features
  10. Apple Wants AI to Make iPhone More Accessible With These Features
  1. Samsung Odyssey G8 6K Monitor Announced Alongside New Odyssey OLED G7 and ViewFinity S8 Monitors
  2. Android 17 to Introduce 'Continue On' Feature That Lets Users Resume Apps Across Phones, Tablets
  3. Vivo X500 Pro Max Chipset and Battery Details Leaked; Could Arrive With Largest Battery in the Vivo X500 Lineup
  4. CMF Phone 3 Pro May Launch Later Than Previously Anticipated; Key Specifications Tipped
  5. Apple’s Big Health Move in India: Watch Gets Sleep Apnea Alerts, AirPods Pro Gain Hearing Test Feature
  6. SpaceX Launches 24 More Starlink Satellites, Nears 10,500 in Orbit
  7. Apple Announces New AI-Powered Accessibility Features Across iPhone, Mac and Vision Pro
  8. Lenovo Legion 5 15IAX11 With 15.3-Inch OLED Display, Up to Intel Core Ultra 9 CPU Listed Online
  9. iQOO Pad 6 Pro Launched With Snapdragon 8 Elite Gen 5 SoC, 13.2-Inch 4K Display, iQOO TWS 5i Tags Along: Price, Features
  10. Warhorse Studios Announces Open-World Middle-Earth RPG and New Kingdom Come Adventure
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.