AI Model GPT-3 Found to Reason as Well as College Undergraduate Students

"Surprisingly, not only did GPT-3 do about as well as humans but it made similar mistakes as well," researchers said.

Advertisement
By Press Trust of India | Updated: 2 August 2023 18:02 IST
Highlights
  • GPT-3 was asked to solve reasoning problems typical tests such as the SAT
  • Scientists asked 40 UCLA students to solve the same problems
  • AI did worse than students in solving analogies based on short stories

The AI tool was found to perform better than the humans' average score in SAT analogies

Photo Credit: Reuters

GPT-3, the popular AI-powered tool, was found to reason as well as college undergraduate students, scientists have found.

The artificial intelligence large language model (LLM) was asked to solve reasoning problems that were typical of intelligence tests and standardised tests such as the SAT, used by colleges and universities in the US and other countries to make admissions decisions.

The researchers from the University of California - Los Angeles (UCLA), US, asked GPT-3 to predict the next shape which followed a complicated arrangement of shapes. They also asked the AI to answer SAT analogy questions, all the while ensuring that the AI would have never encountered these questions before.

Advertisement

They also asked 40 UCLA undergraduate students to solve the same problems.

Advertisement

In the shape prediction test, GPT-3 was seen to solve 80 percent of the problems correctly, between the humans' average score of just below 60 percent and their highest scores.

"Surprisingly, not only did GPT-3 do about as well as humans but it made similar mistakes as well," said UCLA psychology professor Hongjing Lu, senior author of the study published in the journal Nature Human Behaviour.

Advertisement

In solving SAT analogies, the AI tool was found to perform better than the humans' average score. Analogical reasoning is solving never-encountered problems by comparing them to familiar ones and extending those solutions to the new ones.

The questions asked test-takers to select pairs of words that share the same type of relationships. For example, in the problem "'Love' is to 'hate' as 'rich' is to which word?," the solution would be "poor".

Advertisement

However, in solving analogies based on short stories, the AI did less well than students. These problems involved reading one passage and then identifying a different story that conveyed the same meaning.

"Language learning models are just trying to do word prediction so we're surprised they can do reasoning," Lu said. "Over the past two years, the technology has taken a big jump from its previous incarnations." Without access to GPT-3's inner workings, guarded by its creator, OpenAI, the researchers said they were not sure how its reasoning abilities worked, that whether LLMs are actually beginning to "think" like humans or are doing something entirely different that merely mimics human thought.

This, they said, they hope to explore.

"GPT-3 might be kind of thinking like a human. But on the other hand, people did not learn by ingesting the entire internet, so the training method is completely different.

"We'd like to know if it's really doing it the way people do, or if it's something brand new - a real artificial intelligence - which would be amazing in its own right," said UCLA psychology professor Keith Holyoak, a co-author of the study.


Samsung launched the Galaxy Z Fold 5 and Galaxy Z Flip 5 alongside the Galaxy Tab S9 series and Galaxy Watch 6 series at its first Galaxy Unpacked event in South Korea. We discuss the company's new devices and more on the latest episode of Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Further reading: AI, GPT 3, AI Model, OpenAI
Advertisement

Related Stories

Popular Mobile Brands
  1. Bridgerton Season 4 Premieres in Two Parts on Netflix: See Details
  2. Sister Midnight Streaming Online: Everything You Need to Know
  3. All the Details About Kunal Khemu's Comedy Drama 'Single Papa'
  4. Scientists Track Glowing Green Comet 3I/ATLAS as It Nears Earth
  5. Nandamuri Balakrishna's Akhanda 2 Arrives on OTT in 2026
  1. Early Earth’s Deep Mantle May Have Held More Water Than Previously Believed, Study Finds
  2. Nandamuri Balakrishna's Akhanda 2 Arrives on OTT in 2026: When, Where to Watch the Film Online?
  3. Single Papa Now Streaming on OTT: All the Details About Kunal Khemu’s New Comedy Drama Series
  4. Scientists Study Ancient Interstellar Comet 3I/ATLAS, Seeking Clues to Early Star System Formation
  5. Bridgerton Season 4 to Release in Two Parts on OTT: When and Where to Watch It Online?
  6. Spider-Like Scar on Jupiter’s Moon Europa Could Indicate Subsurface Salty Water
  7. Wake Up Dead Man: A Knives Out Mystery Now Streaming on Netflix: Everything You Need to Know
  8. Secret Rain Pattern May Have Driven Long Spells of Dry and Wetter Periods Across Horn of Africa: Study
  9. Sister Midnight Out on OTT: Know Where to Watch This Radhika Apte-Starrer Online
  10. JWST Detects Thick Atmosphere on Ultra-Hot Rocky Exoplanet TOI-561 b
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.