Microsoft Introduces Maia 200 Chipset for AI Inference, Will Power OpenAI’s GPT-5.2

Microsoft is also inviting developers and AI startups to explore model and workload optimisation with the new Maia 200 SDK.

Advertisement
Written by Akash Dutta, Edited by Ketan Pratap | Updated: 27 January 2026 13:57 IST
Highlights
  • The Maia 200 chipset features more than 140 billion transistors
  • It delivers more than 10 petaflops performance in 4-bit precision
  • It is the successor to the Maia 100 chipset, released in 2023

Microsoft says Maia 200 can run the largest AI models in existence, with room for bigger future models

Photo Credit: Microsoft

Microsoft unveiled its newest artificial intelligence (AI) accelerator, Maia 200 chip, on Monday. It is a purpose-built chipset design for faster AI inference, and is said to cut the cost of running large language models (LLMs) at scale. The new enterprise-focused processor is the successor to the Maia 100, which was launched in 2023. The Maia 200 is currently being deployed in Microsoft's Azure cloud data centres, starting in the US. The company highlighted that its new chip will power the latest models, such as OpenAI's GPT 5.2.

Microsoft Unveils Maia 200 Chipset for AI Workloads

In a blog post, the Redmond-based tech giant announced and detailed its latest AI chipset. Maia 200 is built on Taiwan Semiconductor Manufacturing Corporation's (TSMC) 3nm process, and each chip contains more than 140 billion transistors. Microsoft said the chips will also feature a custom memory and communication architecture tailored specifically for inference workloads. The advanced design helps maximise the speed at which the chip can process data and keep AI models “fed” with information.

Advertisement

A key part of Maia 200's performance comes from its support for low-precision compute formats such as 4-bit (FP4) and 8-bit (FP8) operations. These formats allow AI models to generate responses more quickly and with lower energy use compared with traditional higher-precision computing. Microsoft said Maia 200 delivers in excess of 10 petaFLOPS (quadrillions of floating-point operations per second) in FP4 mode and over 5 petaFLOPS in FP8 mode, making it well-suited for modern LLMs and other AI systems that are used in real-time applications.

Maia 200 also includes 216GB of high-bandwidth memory (HBM3e) with 7TBps bandwidth and 272MB of on-chip SRAM. High-bandwidth memory lets the chip quickly access and move large amounts of data, which is a common bottleneck in AI workloads. The addition of on-chip SRAM helps reduce delays when models need frequent access to smaller, critical data sets, improving responsiveness for inference tasks.

Advertisement

At the system level, Microsoft has designed Maia 200 to scale efficiently across large clusters. Each chip supports 2.8TBps bi-directional bandwidth, and groups of up to 6,144 accelerators can be connected together using standard Ethernet networking. This scalable architecture allows data centre operators to deploy many Maia 200 chips in a rack or across nodes, increasing the throughput available for demanding AI services while keeping power use and costs under control.

One of the central goals behind Maia 200 is to improve performance per dollar, a key metric for inference infrastructure where organisations pay for both compute and energy. Microsoft said Maia 200 delivers around 30 percent better performance per dollar than the hardware the company currently uses in its fleet.

Advertisement

Microsoft is currently previewing a Maia software development kit (SDK) that includes tools such as a Triton compiler, PyTorch support, an optimised kernel library and low-level programming support, enabling developers to build and tune models for the Maia 200 platform.

 

Get your daily dose of tech news, reviews, and insights, in under 80 characters on Gadgets 360 Turbo. Connect with fellow tech lovers on our Forum. Follow us on X, Facebook, WhatsApp, Threads and Google News for instant updates. Catch all the action on our YouTube channel.

Advertisement

Related Stories

Popular Mobile Brands
  1. HMD Vibe 2 5G India Launch Teased: Expected Design, Specifications
  2. Here's When Samsung's Galaxy S25 Series Might Get the One UI 8.5 Update
  3. Xiaomi 17T Key Specifications Revealed Via Benchmarking Site
  4. Vivo Y600 Pro With 10,200mAh Battery Arrives at This Price
  5. MediaTek Dimensity 7450, Dimensity 7450X Released: All You Need to Know
  1. Vivo Y500s Launched With 7,200mAh Battery, 50-Megapixel Main Camera: Price, Spcifications
  2. Samsung Galaxy S25 Series Likely to Begin Receiving Stable One UI 8.5 Update Soon: Report
  3. Banking Circle Launches Stablecoin Settlement Services After Bagging CASP Licence
  4. Qualcomm Sets Snapdragon for India Event for May 7 as OnePlus Gears Up for Nord CE 6 Launch
  5. Meta AI Business Assistant Expanded to Global Markets, to Let Advertisers Optimise Marketing Campaigns
  6. Vivo Y600 Pro Launched With 10,200mAh Battery, 50-Megapixel Camera: Price, Specifications
  7. Huawei Mate XT 2 Tipped to Launch in October With Upgraded Hinge, Kirin 9050 Pro Chip
  8. Samsung Galaxy Z Fold 8, Z Flip 8 and Z Wide Fold Dummy Units Hint at Design, Wireless Charging Support
  9. HMD Vibe 2 5G India Launch Teased: Expected Design, Key Specifications
  10. Aave Labs Urges Arbitrum DAO to Release $73 Million in Frozen ETH for rsETH Recovery
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2026. All rights reserved.