Has This Artificial Intelligence Model Invented Its Own Secret Language?

The fact that AI language models do not interpret the text in the same manner humans do supports this theory.

Advertisement
By Edited by Gadgets 360 Newsdesk | Updated: 7 June 2022 15:23 IST
Highlights
  • It is difficult to determine exactly how AIs arrive at their conclusions
  • The research was conducted by Giannis Daras and Alexandros G. Dimakis
  • DALL-E 2 is unlikely to feature a hidden language

DALL-E 2 employs byte-pair encoding (BPE), which is a halfway solution.

Photo Credit: Twitter / Giannis Daras

Based on a written cue, a new generation of artificial intelligence (AI) models can make “creative” visuals on demand. Imagen, MidJourney and DALL-E 2 are just a few examples of how new technologies are changing the way creative content is created, with ramifications for copyright and intellectual property. While the output from these models is frequently impressive, it is difficult to determine exactly how they arrive at their conclusions. Researchers in the United States claimed last week that the DALL-E 2 model may have established its own hidden language to communicate about objects.

The research was conducted by Giannis Daras and Alexandros G. Dimakis, both students at the University of Texas at Austin. By asking the AI to create photos with text captions and then feeding the captions back into the system, the researchers discovered that DALL-E 2 thinks 'Apoploe vesrreaitais' means 'birds', 'contarra ccetnxniams luryca tanniounons' means 'bugs or pests', 'vicootes' means 'vegetables' and 'wa ch zod rea' means 'sea creatures that a whale might eat'.

DALLE-2 has a secret language.
"Apoploe vesrreaitais" means birds.
"Contarra ccetnxniams luryca tanniounons" means bugs or pests.

Advertisement

The prompt: "Apoploe vesrreaitais eating Contarra ccetnxniams luryca tanniounons" gives images of birds eating bugs.

Advertisement

A thread (1/n)???? pic.twitter.com/VzWfsCFnZo

— Giannis Daras (@giannis_daras) May 31, 2022

These statements are intriguing, and if accurate, they could have significant ramifications for the security and interpretability of this type of huge AI model. DALL-E 2 is unlikely to feature a hidden language.

Advertisement

"It might be more accurate to say it has its own vocabulary – but even then we can't know for sure," wrote Daras in a report published in The Conversation.

To begin with, it's difficult to validate any claims made regarding DALL-E 2 and other huge AI models at this point because only a few researchers and creative practitioners have access to them. Daras added that any photographs that are publicly posted should be taken with a grain of salt, as they have been cherry-picked by a human from a vast number of AI output images.

Advertisement

One theory is that the gibberish sentences are derived from the non-English vocabulary. Apoploe, for example, which appears to conjure images of birds, is related to Apodidae, the scientific name of a family of bird species in Latin. DALL-E 2, for example, was trained on a wide range of data scraped from the internet, including a large number of non-English terms.

The fact that AI language models do not interpret the text in the same manner humans do supports this theory. Instead, before analysing the text, they break it down into 'tokens', said Daras. Treating each word as a token may seem straightforward, but it might be problematic when identical tokens have various meanings. For example, 'match' signifies different meanings when playing tennis and when lighting a fire, Daras pointed out.

Treating each character as a token, on the other hand, results in a lower number of viable tokens, but each one transmits far less relevant information.

DALL-E 2 employs byte-pair encoding (BPE), which is a halfway solution. Examining the BPE representations for some of the gibberish words reveals that this could be a key aspect in deciphering the code. In any case, none of these possibilities are complete explanations for what's going on. When individual characters are removed from these sentences, for example, the resultant visuals appear to be corrupted in very precise ways. Individual gibberish words don't always combine to form logical compound visuals, it appears.

Overall, DALL-E 2's hidden language poses questions about interpretability. The researchers, through their latest report, want these models to act like humans, but seeing organised output in response to gibberish defies their expectations.

However, another Twitter thread has rejected the recent claims, by stating that 'Contarra ccetnxniams luryca tanniounons' into DALL-E 2 does not limit the search to bugs and pests, but also display images of other animals. 


How is Alexa faring in India? We discuss this on Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. Nothing Announces Offers on Phones, Wearables During Flipkart Sale
  2. Vivo Y31 Series With 6,500mAh Battery Launched in India: See Price
  3. [Exclusive] Noise to Launch Flagship Master Series Over-Ear Headphones Soon
  4. Samsung Begins Rolling Out One UI 8 Update to the Galaxy S25 Series
  5. Flipkart Big Billion Days Sale: Discounts on Motorola Phones Announced
  6. iOS 26 Released Alongside iPadOS 26, macOS Tahoe: Here's How to Download It
  7. iQOO 15 Live Image Leaked; Company Reveals Display Details
  8. Samsung Galaxy M36 Review: All Style, No Substance?
  9. Xiaomi 17 Pro Max Tipped to Come With a Secondary Display
  1. iOS 26 Update Released Alongside iPadOS 26 and macOS Tahoe: Check Eligible Models, How to Download
  2. Scientists Propose Space Missions to Chase Down Interstellar Comets
  3. Iceland Plume Discovery Reveals Ancient Volcanic Funnels Across North Atlantic
  4. Huawei Watch Ultimate 2 Design Renders Leaked, Could Launch Soon
  5. Marvel's Wolverine Will Reportedly Launch in 2026; Insomniac's Venom Game in 'Active Development'
  6. US President Donald Trump Challenges Block on Removing US Fed’s Lisa Cook
  7. iPhone 17 Series Outpaces iPhone 16 in Demand While iPhone 17 Pro Max Tops Pre-Orders, Analyst Says
  8. iPhone 16 Remained Top Selling Smartphone For Second Consecutive Quarter Globally: Report
  9. Samsung Galaxy S25 FE Launched in India With 6.7-Inch AMOLED Screen, 50-Megapixel Camera: Price, Features
  10. iPhone 18 Series Tipped to Feature Smaller Dynamic Island, Might Launch Without Under-Display Face ID
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.