Grab Superapp Says AI Models Struggle to Understand Asian Languages

Grab said that it had to develop an in-house AI model due to the unreliability of both proprietary and open-source AI models.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 4 November 2025 13:41 IST
Highlights
  • Grab has now built a specialised vision LLM for the eKYC process
  • The model is extracting information from user-submitted documents
  • Grab used both online and synthetic datasets to train the model

AI models struggle to understand non-English languages due to the limited datasets

Photo Credit: Unsplash/Rohan Solankurkar

Grab, the Singapore-based superapp company, highlighted on Monday that it was forced to develop an in-house artificial intelligence (AI) model for internal use. It is a lightweight vision large language model (LLM) that can scan documents and extract information from them. The company said the decision to develop the model was made as both proprietary and open-source models were not good at understanding Southeast Asian languages. The company's statement has raised fresh concerns around the accessibility of frontier models by Google, OpenAI, and Anthropic.

AI Models' Struggle With Non-English Languages

In a blog post detailing the architecture and training process of their in-house vision model, Grab highlighted the shortcomings they experienced when they tried to outsource the technology. “While powerful proprietary Large Language Models (LLMs) were an option, they often fell short in understanding SEA languages, produced errors, hallucinations, and had high latency. On the other hand, open-sourced Vision LLMs were more efficient but not accurate enough for production,” the post mentioned.

Advertisement

AI models' struggle with non-English languages is not a new finding. For years, researchers have pointed it out, and AI players have tried to fix the issue. However, despite gaining basic competence in popular foreign languages such as Hindi, Japanese, Spanish (Latin America and Spain), and Chinese, the models have yet to understand the lexicon enough to differentiate between the nuances. So, they might be useful in general conversations, but for enterprise or research-based needs, the applicability falls short.

For instance, a paper published earlier this year found that even AI models developed by Chinese companies are as bad in Chinese minority languages as are Western models. And the issue persists in both proprietary models from Google, OpenAI, Meta, and Anthropic, as well as in open-source models.

Advertisement

The reason behind this struggle is the lack of readily available, adequate datasets to train the model on these languages. This is one of the reasons major AI companies are partnering with Indian companies and institutions to collect more Indic language datasets. In July, Google teamed up with IIT Bombay to develop Indic language AI speech models. Meta is reportedly paying $55 an hour to contractors to train its models in the Hindi language, and OpenAI has announced a research collaboration with IIT Madras, backed by $500,000 from the ChatGPT maker.

While collecting data this way is expensive, it is still possible to eventually build large enough datasets in prominent Asian and other languages. However, the minority languages, such as the non-scheduled Indian languages, will still be a struggle for these models to gain competence in. And unless they can learn these languages, accessibility and functionality will always be limited.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Nothing Phone 4a Pro Review: A Big Leap
  2. Here's When the Redmi 15A 5G Will Be Launched in India
  3. Samsung Will Launch New Galaxy A-Series Smartphones in India on This Date
  4. Have a Large Following on TikTok, YouTube or Instagram? Facebook Will Pay You
  5. Samsung Galaxy Forever Offers Easy Upgrade, Return Option in India
  6. OnePlus 15T Camera Features Teased Ahead of March 24 Launch
  7. OnePlus Nord 6 Specifications Surface as Tipster Leaks Photo of Retail Box
  1. Huawei MatePad 11.5, MatePad SE 11 Full Specifications List Revealed via Flipkart Listings
  2. Boat Valour Watch 1R Launched in India With 1.43-Inch AMOLED Display, Up to 10-Day Battery Life: Price, Features
  3. Xiaomi SU7 EV Refreshed With 902km CLTC Range, Nvidia Thor AGX-Powered Assisted Driving Features
  4. Samsung Sets March 25 Launch Date for New Galaxy A-Series in India; Galaxy A37, Galaxy A57 Design Tipped Again
  5. Anthropic Study Finds People Don’t Really Want AI for Creative Work
  6. Bitcoin Trades Near $71,000 as Crypto Market Weathers Ongoing Macroeconomic Pressures
  7. Redmi 15A 5G India Launch Date Announced; Design and Specifications Teased
  8. World Happiness Report 2026: Heavy Social Media Use Linked to Lower Life Satisfaction Among Teenagers
  9. Oppo K14 5G With 7,000mAh Battery, 50-Megapixel Camera Goes on Sale in India: Price, Offers
  10. Oppo Find X9 Ultra, Find X9s Appearance on SIRIM Certification Database Signals Imminent Launch
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.