Grok 4.1 AI Model Tends to Show Sycophancy and Deception More Than Its Predecessor

Grok 4.1’s model card states that it scored higher on the dishonesty and sycophancy parameters compared to Grok 4.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 20 November 2025 18:30 IST
Highlights
  • Grok 4.1 Thinking scored 0.49 on deception and 0.19 on sycophancy
  • In contrast, Grok 4 scored 0.43 and 0.07, respectively
  • This means the AI model will agree with the user even if they’re wrong

Higher sycophancy in an AI model makes it more likely to show people-pleasing traits

Photo Credit: xAI

Grok 4.1 was released on Monday by Elon Musk's xAI. At launch, the artificial intelligence (AI) firm highlighted that the model now displays higher emotional intelligence and improved creative writing capabilities. However, its model card now shows a concerning problem. The large language model (LLM) scores higher on deception and sycophancy than its predecessor, Grok 4, which could result in it displaying people-pleasing traits. The model also has a false-negative rate of 0.20 for biology via prompt injection.

Grok 4.1 Model Card Raises Flags for Deceptive and Sycophant Behaviour

The model card of Grok 4.1 (first spotted by the Decoder) highlights several concerning facts about the AI model. For the unaware, a model card contains all the technical details (or specifications) of a model, which is gauged by various internal testing. It highlights both how performant an AI model is and how strong its safety guardrails are.

xAI says the fourth-generation Grok model was upgraded to improve its emotional intelligence, and during our testing, we found that it performs slightly better than GPT-5.1 in general conversations and creative writing. However, this improved performance comes at a cost.

Advertisement

The model card shows that Grok 4.1 performs worse on the deception and sycophancy metrics. In the MASK benchmark, its deception rate was noted as 0.49 for the thinking variant and 0.46 for the non-thinking variant. On the other hand, Grok 4's deception was lower at 0.43. Similarly, the sycophancy score goes up from 0.07 in Grok 4 to 0.19 and 0.23 in the thinking and non-thinking variants, respectively.

Advertisement

In a real-world scenario, this would mean that the chatbot powered by the AI model will try harder to please the user, agreeing with them even when it knows they are wrong. It might also manipulate the user after providing an inaccurate response.

It should be highlighted that the scores are high, but AI companies also add external guardrails (not part of the AI model itself but built into the chatbot's system) that often suppress these tendencies. However, a possibility remains that Grok might agree with a user's delusions or paranoia and end up amplifying their belief.

Advertisement

Separately, it also has a false negative rate of 0.20 for biology-related prompt injections, which means one out of five malicious prompts around the topic can slip past the guardrails, and the AI model will respond to the query.

Notably, it is still too early to gauge how these numbers on paper will translate into the real world. It is also possible that xAI developers are already working on fine-tuning techniques to minimise the risks associated with the model. However, the numbers do highlight the need to be careful when interacting with Grok, especially when sharing sensitive information with it.

 

Catch the latest from the Consumer Electronics Show on Gadgets 360, at our CES 2026 hub.

Advertisement

Related Stories

Popular Mobile Brands
  1. Redmi Note 15 5G Launched in India With 108-Megapixel Camera at This Price
  2. Realme 16 Pro Series With 7,000mAh Battery Debuts in India: See Price
  3. Motorola Unveils Razr Fold as its First Book-Style Foldable at CES
  4. Redmi Pad 2 Pro 5G With 12,000mAh Battery Arrives in India: See Price
  5. Motorola Unveils Signature Phone With Four 50-Megapixel Cameras
  6. Realme 16 Pro+, Realme 16 Pro Review: A New Dawn for Realme
  7. Vivo X200T Said to Launch in India With 'Aggressive' Pricing
  8. Realme Buds Air 8 Launched in India With Up to 58 Hours of Total Battery Life
  9. Redmi Note 15 5G First Impressions
  10. Samsung Galaxy Z Fold 8, Galaxy Z Flip 8 Listed on IMEI Database: Report
  1. Motorola Unveils Signature Phone With Snapdragon 8 Gen 5 Chip and 50-Megapixel Sony LYTIA Cameras: Price, Specifications
  2. CES 2026: Motorola Razr Fold Announced With 2K LTPO Inner Display, 50-Megapixel Triple Cameras
  3. Self-Driving Cars Could Prevent Over 1 Million Road Injuries Across the U.S. by 2035
  4. Astronomers Measure Mass and Distance of a Rogue Planet for the First Time in History
  5. The Rip OTT Release Date: When and Where to Watch it Online?
  6. Netflix’s One Last Adventure Takes Fans Inside the Making of Stranger Things 5
  7. Heer Express Streaming Now on JioHotstar: Know Everything About This Romance Comedy Film
  8. Akhanda 2: Thaandavam OTT Release Date Reportedly Postponed: What You Need to Know
  9. Naai Sekar Streaming Now on SunNXT: Know Everything About This Tamil Comedy Drama Film
  10. Samsung Galaxy Z Fold 8, Galaxy Z Flip 8 Reportedly Listed on IMEI Database Months Ahead of Anticipated Launch
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.