Anthropic Releases New Open-Source Tool That Evaluates How AI Models Behave

Dubbed Bloom, the AI tool creates a series of scenarios to test an AI model for a particular behavioural trait.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 23 December 2025 13:41 IST
Highlights
  • Researchers can tell Bloom which behaviour to test
  • The AI tool automates a lengthy and complex process
  • Bloom can be downloaded from GitHub

Anthropic also released a benchmark of four behaviours tested by the AI tool Bloom

Photo Credit: Anthropic

Anthropic released a new artificial intelligence (AI) tool last week that can test and gauge how an AI model behaves under normal and stressful circumstances. Dubbed Bloom, it is designed to automate the process of testing behavioural traits of models by generating a detailed set of scenarios as prompts and evaluating the responses. The San Francisco-based AI startup's AI tool is also open-source, meaning any interested developer or an AI lab can download it to test models across various traits.

Anthropic Introduces Bloom to Test Model Behaviour

In a post, the Claude maker introduced and detailed the new AI tool. Anthropic says that testing AI model's behaviour is important as it helps researchers learn if it is prone to becoming biased, prioritising self-preservation, or indulging in sycophancy. However, the process to test model behaviour so far has been manual, where researchers create a detailed set of prompts to stress-test models and then evaluate the responses. The company says it is a lengthy and complex process.

This is where Bloom comes in. Based on specific behaviours requested by a researcher, the tool creates sample evaluations locally until the trait has been captured. Then, it runs these scenarios on the target model. Anthropic claimed that Bloom integrates with a model's weights and biases for experiments at scale. It also exports “inspect-compatible” transcripts, which can be viewed within the tool.

Advertisement

The functioning of the AI tool can be broken down into four broad stages. First, the AI tool analyses the requested behaviour and any example transcripts shared with it to gain understanding about it. Then, it ideates evaluation scenarios that can effectively capture and measure the trait. “Each scenario specifies the situation, simulated user, system prompt, and interaction environment,” the post mentioned. Interestingly, Bloom generates new scenarios every time, instead of relying on fixed sets.

Advertisement

Then, all scenarios are rolled out in parallel as an AI agent simulates both the user's and the tool responses to trigger the desired behaviour in the model. Finally, a judge model is used to score each transcript for the presence of the behaviour, and a meta-judge produces analysis of the scores and data. Anthropic added that researchers can configure Bloom's behaviour by adjusting the interactions' length and modality.

Besides the tool, Anthropic has also released benchmark results of Bloom across four behaviours — delusional sycophancy, instructed long-horizon sabotage, self-preservation, and self-preferential bias. The company tested 16 different AI models, with a mix of in-house and third-party models.

Advertisement

Since Bloom is open-source, interested individuals can download the AI tool from the AI startup's GitHub listing. The tool is available with a permissive MIT licence for both academic and commercial use cases.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. OTT Releases of the Week: The Raja Saab, Kis Kisko Pyaar Karoon 2, Parasakthi, and More
  2. Brave Ark 2-in-1 Android PC With Snapdragon 8s Gen 3 Launched in India
  3. Samsung Galaxy S26 Ultra 3D Render Offers a 360-Degree Look at Its Design
  4. GPT-5.3-Codex Arrives as OpenAI's First AI Model That Helped Build Itself
  5. Google Pixel 10a Spotted in Leaked Images in These Four Colour Options
  6. Unfamiliar Now Streaming on Netflix: Everything You Need to Know About Plot and More
  7. Claude Opus 4.6 vs GPT-5.3-Codex: Best Agentic Coding AI Model in 2026
  8. Google's February 2026 Discover Core Update Brings These Major Changes
  9. Poco X8 Pro Series Price, Colours Inadvertently Listed on Xiaomi's Website
  1. WhatsApp Will Soon Let You Add a 'Close Friends' Status, Just Like Instagram: Report
  2. Poco X8 Pro Series Price, Colourways Inadvertently Listed on Xiaomi Website in Europe: Expected Specifications
  3. Itel A100 Confirmed to Launch in India Soon; Colourways, Battery Capacity and Durability Teased
  4. Google's February 2026 Discover Core Update to Focus on Local Content, Reduce Clickbait
  5. Apple Eyes Retail Expansion in India, New Job Listings Hint at Apple Store in Hyderabad
  6. After The Last of Us, HBO Is Adapting Baldur's Gate 3 for TV With Craig Mazin as Creator
  7. Oppo Find N6 China Launch Timeline, Durability Improvements Teased: Expected Features, Specifications
  8. GPT-5.3-Codex Released as OpenAI’s First AI Model to Assist in Its Own Development
  9. Unfamiliar Now Streaming on Netflix: Everything You Need to Know About Plot, Cast, and More
  10. Lava Yuva Star 3 Will Reportedly Launch in India Soon With Redesigned Camera Module
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.