Microsoft Introduces Maia 200 Chipset for AI Inference, Will Power OpenAI’s GPT-5.2

Microsoft is also inviting developers and AI startups to explore model and workload optimisation with the new Maia 200 SDK.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 27 January 2026 13:57 IST
Highlights
  • The Maia 200 chipset features more than 140 billion transistors
  • It delivers more than 10 petaflops performance in 4-bit precision
  • It is the successor to the Maia 100 chipset, released in 2023

Microsoft says Maia 200 can run the largest AI models in existence, with room for bigger future models

Photo Credit: Microsoft

Microsoft unveiled its newest artificial intelligence (AI) accelerator, Maia 200 chip, on Monday. It is a purpose-built chipset design for faster AI inference, and is said to cut the cost of running large language models (LLMs) at scale. The new enterprise-focused processor is the successor to the Maia 100, which was launched in 2023. The Maia 200 is currently being deployed in Microsoft's Azure cloud data centres, starting in the US. The company highlighted that its new chip will power the latest models, such as OpenAI's GPT 5.2.

Microsoft Unveils Maia 200 Chipset for AI Workloads

In a blog post, the Redmond-based tech giant announced and detailed its latest AI chipset. Maia 200 is built on Taiwan Semiconductor Manufacturing Corporation's (TSMC) 3nm process, and each chip contains more than 140 billion transistors. Microsoft said the chips will also feature a custom memory and communication architecture tailored specifically for inference workloads. The advanced design helps maximise the speed at which the chip can process data and keep AI models “fed” with information.

Advertisement

A key part of Maia 200's performance comes from its support for low-precision compute formats such as 4-bit (FP4) and 8-bit (FP8) operations. These formats allow AI models to generate responses more quickly and with lower energy use compared with traditional higher-precision computing. Microsoft said Maia 200 delivers in excess of 10 petaFLOPS (quadrillions of floating-point operations per second) in FP4 mode and over 5 petaFLOPS in FP8 mode, making it well-suited for modern LLMs and other AI systems that are used in real-time applications.

Maia 200 also includes 216GB of high-bandwidth memory (HBM3e) with 7TBps bandwidth and 272MB of on-chip SRAM. High-bandwidth memory lets the chip quickly access and move large amounts of data, which is a common bottleneck in AI workloads. The addition of on-chip SRAM helps reduce delays when models need frequent access to smaller, critical data sets, improving responsiveness for inference tasks.

Advertisement

At the system level, Microsoft has designed Maia 200 to scale efficiently across large clusters. Each chip supports 2.8TBps bi-directional bandwidth, and groups of up to 6,144 accelerators can be connected together using standard Ethernet networking. This scalable architecture allows data centre operators to deploy many Maia 200 chips in a rack or across nodes, increasing the throughput available for demanding AI services while keeping power use and costs under control.

One of the central goals behind Maia 200 is to improve performance per dollar, a key metric for inference infrastructure where organisations pay for both compute and energy. Microsoft said Maia 200 delivers around 30 percent better performance per dollar than the hardware the company currently uses in its fleet.

Advertisement

Microsoft is currently previewing a Maia software development kit (SDK) that includes tools such as a Triton compiler, PyTorch support, an optimised kernel library and low-level programming support, enabling developers to build and tune models for the Maia 200 platform.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. Samsung's One UI 9 Beta Is Now Available to Test on the Galaxy S26 Series
  2. Pine Labs Says AI Agents Can Now Complete UPI Payments Without MPIN
  3. Nothing's Ear 3a Could Arrive With Familiar Price Tag, New Colourway
  4. Nothing CEO Carl Pei Warns Smartphone Prices Could Rise Further in 2026
  5. New OTT Releases This Week: Bhooth Bangla, Raakh, Dridam, Karuppu, and More
  6. Samsung Galaxy S25 Edge Now Listed at Half of Its Launch Price in India
  7. Oppo Reno 16 Series Price, Storage Variants Leak Ahead of Launch
  8. Realme Narzo Days Sale Brings Discounts on These Narzo Series Phones
  1. Astronomers Discover Why Massive Galaxies Died Early in the Universe
  2. Nothing CEO Carl Pei Predicts Smartphones May Not Get Major Discounts During Sales Due to Ongoing Chip Shortage
  3. Samsung Galaxy S25 Edge Price in India Drops to All-Time Low: Specifications, Features
  4. Citi Debuts Blockchain-Based Marketplace Focused on Private Company Shares: Report
  5. Pine Labs Launches P3P Agentic Payment Protocol for Autonomous UPI Transactions in India
  6. Nothing Ear 3a to Arrive With Familiar Price Tag, New Colourway: Report
  7. The Evil Lawyer Out on OTT: Know Where to Stream This Thai Legal Thriller Web Series Online
  8. Honor X80 Pro Max Surfaces Online via Leaked Live Images; Tipster Reveals Key Specifications
  9. OnePlus Nord Buds 4 India Launch Teased as Company Reveals Colour, Design
  10. Sweet Magnolias Season 5 Now Streaming Online: What You Need to Know
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.