Anthropic Introduces PDF Image Understanding With Claude 3.5 Sonnet AI Model

Claude can now understand complex PDFs filled with charts and graphics.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 4 November 2024 16:30 IST
Highlights
  • The PDF image understanding feature is available in open beta currently
  • Earlier, Claude could understand the text in PDF files
  • Claude can also answer queries based on these images

The Claude AI feature is available to all users

Photo Credit: Anthropic

Anthropic released another new artificial intelligence (AI) feature for its chatbot Claude on Friday. The feature, dubbed PDF image understanding, now allows Claude to see and process images embedded within PDF files including charts and graphics. This capability has been added to the recently released Claude 3.5 Sonnet AI model. The company claims that this ability will allow the chatbot to accurately understand complex documents and offer better analysis of the data. The Anthropic application packaging interface (API) also supports PDF inputs. This feature is available in beta.

Anthropic Releases PDF Image Understanding for Claude

In its support documents, Anthropic detailed the new PDF support feature. The image understanding capability in PDF has been added to the Claude 3.5 Sonnet version 20241022, and it can process images in PDF as well as support PDF inputs.

Breaking down the first capability, Claude can now see and process images, charts, and graphics added to a PDF to perform a deeper analysis of the document. Once done, users can ask the AI queries about the particular images and it can answer with relevant information.

Advertisement

So far, Claude accepted images as input and could answer queries about them, however, it could not process images attached to a document. With this feature, Anthropic now allows users to get responses about PDFs in greater detail. The feature is likely aimed at the enterprise users of the chatbot who use it to analyse sales and marketing documents as well as other such files.

Advertisement

Claude 3.5 Sonnet now also accepts PDF as an input, which means users can now upload PDF files directly and let users ask queries about them. This brings Claude's capabilities on par with Google's NotebookLM, which is a dedicated platform for PDF and other file types.

Currently, the maximum file size of a PDF uploaded to Claude can be 32MB with a maximum page count of 1,000. Additionally, the chatbot cannot process PDFs which are password-protected or have encryption on them. Anthropic will make the feature available on Amazon Bedrock and Google Vertex AI soon.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Sony Could Finally Launch the PS5 Pro in India, BIS Listing Suggests
  2. Vivo T5x 5G Will Launch in India Next Week With These Features
  3. Xiaomi Pad 8 Launched in India With Snapdragon 8s Gen 4 SoC, 9,200mAh Battery
  4. iQOO Z11 Teased With 165Hz Display, 9,020mAh Battery; China Launch Expected Soon
  5. OnePlus 15T White Colourway, Key Display Features Revealed
  1. Jupiter Resumes Direct Motion This March as the Gas Giant Hits Peak Visibility for 2026 Skywatchers
  2. Samsung Testing 12,000mAh, 18,000mAh Batteries With Dual Cell and Triple Cell Designs, Leaked Reports Show
  3. OnePlus 15T White Colourway, Key Display Features Revealed as Company Opens Pre-Orders in China
  4. Microsoft Could Reportedly Price Next-Gen Xbox 'Project Helix' at $1,000 or More
  5. Ravam: Sound of Soul Streaming on AhaVideo: What You Need to Know About This New Horror Thriller
  6. Thailand Targets Crypto Mule Accounts Linked to Scams, Illegal Transfers as Authorities Freeze 10,000 Wallets
  7. Infinix GT 50 Pro 5G Real-Life Images Surface Online as Smartphone Arrives on BIS Database
  8. Microsoft’s New Copilot Cowork Can Take Actions and Autonomously Complete Tasks
  9. Mardaani 3 Set for OTT Release Soon: What You Need to Know About Shivani Shivaji Roy’s Return
  10. Lenovo Tab Plus Gen 2 Spotted in Leaked Renders That Point to Significant Design Overhaul
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.