OpenAI Develops CriticGPT Model Capable of Spotting GPT-4 Code Generation Errors

CriticGPT is currently under development and has not been released to the public.

Advertisement
Written by Akash Dutta, Edited by David Delima | Updated: 28 June 2024 19:53 IST
Highlights
  • OpenAI has published a paper on its new CriticGPT model
  • CriticGPT was trained on short responses from ChatGPT
  • OpenAI used the RLHF framework to train its CriticGPT AI model

OpenAI has found that CriticGPT still hallucinates

Photo Credit: Pexels/Shantanu Kumar

OpenAI published a study about a new artificial intelligence (AI) model on Thursday that can catch GPT-4's mistakes in code generation. The AI firm stated that the new chatbot was trained using the reinforcement learning from human feedback (RLHF) framework and was powered by one of the GPT-4 models. The under-development chatbot was designed to improve the quality of the AI-generated code that users get from the large language models. At present, the model is not available to users or testers. OpenAI also highlighted several limitations of the model.

OpenAI Shares Details about CriticGPT

The AI firm shared details of the new CriticGPT model in a blog post, stating that it was based on GPT-4 and designed to identify errors in code generated by ChatGPT. "We found that when people get help from CriticGPT to review ChatGPT code they outperform those without help 60 percent of the time,” the company claims. The model was developed using the RLHF framework and the findings have been published in a paper.

Advertisement

RLHF is a machine learning technique that combines machine output with humans to train AI systems. In such a system, human evaluators provide feedback to the AI's performance. This is used to adjust and improve the model's behaviour. Humans who provide feedback to the AI are called AI trainers.

CriticGPT was trained on a large volume of code data that contained errors. The AI model was tasked with finding these mistakes and to critique the code. For this, AI trainers were asked to write the mistakes in the code on top of the naturally occuring mistakes, and then write example feedback as if they had caught those errors.

Advertisement

Once the CriticGPT shared its multiple variations of its critique, the trainers were asked to spot if the errors they inserted was caught by the AI alongside the naturally occurring errors. OpenAI, in its research, found that CriticGPT performed 63 percent better than ChatGPT in catching errors.

However, the model still has certain limitations. CriticGPT was trained on short strings of code generated by OpenAI. The model is yet to be trained on long and complex sets of tasks. The AI firm also found that the new chatbot continues to hallucinate (generate incorrect factual responses). Further, the model has not been tested in scenarios where multiple errors are dispersed in the code.

Advertisement

This model is unlikely to be made public as it is designed to help OpenAI better understand training techniques that can generate higher quality outputs. If CriticGPT does make it to public, it is believed to be integrated within ChatGPT.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement
Popular Mobile Brands
  1. Haier Launches HQLED P7 Pro Series With Google TV, Dolby Atmos
  2. Redmi Turbo 5 With 7,540mAh Battery Goes on Sale in India: Price, Offers
  3. Nothing Is Now Teasing the Launch of a Mysterious "b" Product Series
  4. Athiradi Now Available for Streaming on OTT: Where to Watch the Malayalam Action Comedy
  5. Samsung Galaxy M47 5G India Launch Teased, Will Go on Sale via Amazon
  6. New OTT Releases of the Week: Drishyam 3, Thukra ke Mera Pyar S2, and More
  7. Tecno Camon Slim Confirmed to Launch Soon; Design, Colours Teased
  8. GTA 6 Website Shows New Look at Vice City, Removes Release Date Mention
  9. Samsung Galaxy S27 Leak Shows No Major Camera, Display Upgrades
  10. Own an iPhone XS or iPhone 11? This Exploit Could Put Your Device at Risk
  1. JWST Watches HD 80606 bExoplanet Heat Up by 1,100 Degrees in Hours
  2. Reliance's Jio Platforms Files for Record $4 Billion IPO
  3. Nothing Teases Launch of Mysterious New “b” Product Series in India
  4. WhatsApp Begins Testing Online Indicator, New Feature to Manage Chat Backups on Android
  5. Rockstar Games Shares New Look at Vice City on GTA 6 Website, Removes Release Date Mentions
  6. UAE Reportedly Cracks Down on Social Media Use for Children Under 15, Mandates Age Verification
  7. Malta Seeks to Bring DAOs Under New DeFi Rules Aligned With MiCA
  8. Unpatchable Hardware Vulnerability Leaves Owners of Older iPhone XS, iPhone XR and iPhone 11 Models at Risk
  9. Haier HQLED P7 Pro Series Smart TVs Launched in India With Dolby Atmos, 50W Speakers
  10. Instagram Rolls Out Support for Multiple Captions on Carousel Posts
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.