DeepSeek-OCR Open-Source AI Model Changes How AI Models Read and Process Plain Text

DeepSeek-OCR AI model brings a new approach to compressing long context text via optical 2D mapping.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 21 October 2025 17:21 IST
Highlights
  • The DeepSeek model is currently available on GitHub
  • Within 24 hours of release, it has received over 6K likes
  • The model turns text into pixels to improve its context memory

DeepSeek-OCR can compress a 1,000-word article into 100 visual tokens

Photo Credit: Reuters

DeepSeek, on Monday, released a new open-source artificial intelligence (AI) model that changes how these machines analyse and process plain text. Dubbed DeepSeek-OCR, it uses 2D mapping to convert text into pixels to compress long context into a digestible size. The AI startup claims that large language models (LLMs) are more efficient in processing pixels over text, and the compression allows them to capture more relevant information to generate the response. Additionally, the new approach is also said to generate more accurate results compared to traditional methods.

DeepSeek-OCR Introduces Novel Technique to Process Text

Based on optical character recognition (OCR) technology, the latest DeepSeek AI model uses a new method to process information. It first converts plain text into images, and then analyses the content to generate responses. The promise is that by reading the text in an image, it also compresses and stores massive chunks of a document in a way that makes it easier for a model to remember and reason with the information.

At its core, the model introduces “Context Optical Compression,” an approach of turning long pages of text into images, then letting the model convert those images into a highly condensed “vision token” representation, which is much smaller in size than the usual text-token representation. To highlight the conversion, the makers say that a 1,000-word article could be processed with just 100 vision tokens.

Advertisement

How the model works is also interesting. First, a document image is captured. Then, a vision encoder, which is a custom module made by the researchers, analyses the image and breaks the information into smaller patches. It is then compressed into a smaller number of vision tokens. Then, a decoder takes these vision tokens and reconstructs the textual meaning.

Because the AI model is working with far fewer tokens, the downstream language model (or reasoning module) has less memory burden and can handle longer content or bigger documents.

Andrej Karpathy, Co-Founder of OpenAI and former Director of AI at Tesla, praised DeepSeek-OCR for its novel implementation of vision tokens. He said that the approach could lead to higher efficiency and has the potential for bidirectional attention. He also said that this method could lead to the elimination of the tokeniser, which would make models more efficient.

Advertisement

For those who want to try out the DeepSeek-OCR, the model is currently being hosted on GitHub, where it has received more than 6,700 likes in just 24 hours. The model is available with the permissive MIT licence for both academic and commercial use cases.

 

Catch the latest from the Consumer Electronics Show on Gadgets 360, at our CES 2026 hub.

Further reading: DeepSeek
Advertisement

Related Stories

Popular Mobile Brands
  1. Arc Raiders Will Get Multiple New Maps This Year, Says Embark
  2. iQOO 15 Ultra Teaser Hints at Launch Date, Active Cooling Support
  3. Samsung Galaxy S26 Ultra Colourways Spotted in Leaked SIM Tray Images
  4. Here's How Much the Realme P4 Power Could Cost in India
  5. Oakley Meta HSTN Smart Glasses Review
  6. Viruses and Bacteria Evolve Differently in Space, ISS Study Finds
  7. Sarvam Maya OTT Release: Know Everything About This Malayalam Fantasy Drama Film
  8. Amazon Great Republic Day Sale: Best Deals on Printers Under Rs. 10,000
  9. Samsung Galaxy Z Fold 8 May Sport a Smaller Crease Using This Technology
  1. Global RAM Shortage Is Reportedly Causing GPU, Storage Drive Prices to Skyrocket
  2. Viruses and Bacteria Evolve Differently in Space, ISS Study Finds
  3. Rockstar Games Said to Have Granted a Terminally Ill Fan's Wish to Play GTA 6
  4. Oppo K15 Turbo Series Tipped to Feature Built-in Cooling Fans; Oppo K15 Pro Model Said to Get MediaTek Chipset
  5. Samsung Galaxy Z Fold 8 Said to Feature Dual Ultra-Thin Glass OLED Panel to Reduce Crease Visibility
  6. Honor Magic 8 Pro Air Launched Alongside Honor Magic 8 RSR Porsche Design: Price, Specifications
  7. Realme Neo 8 Key Specifications Including 8,000mAh Battery, Ultrasonic Fingerprint Sensor Confirmed
  8. Astronomers Find Massive Iron-Rich Feature Lurking Under the Ring Nebula
  9. Asus Reportedly Halts Smartphone Launches ‘Temporarily’ to Focus on AI Robots, Smart Glasses
  10. JioHotstar Announces Monthly Subscription Plans Across Mobile, Super, and Premium Tiers
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.