Amazon Nova Sonic Audio Generation AI Model Released, Can Process Speech in Real-Time

Amazon Nova Sonic AI model comes with a context window of 32,000 tokens.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 9 April 2025 11:59 IST
Highlights
  • The model offers up to eight minutes of speech generation per session
  • Amazon says Nova Sonic can understand the context behind the input speech
  • Currently, it only supports the English language in multiple accents
Amazon Nova Sonic Audio Generation AI Model Released, Can Process Speech in Real-Time

Amazon Nova Sonic can be accessed via the company’s Bedrock console via an API

Photo Credit: Amazon

Amazon introduced a new artificial intelligence (AI) model in its flagship Nova family of models on Tuesday. Dubbed Amazon Nova Sonic, it is a voice generation model capable of generating human-like speech. However, it is not a text-to-speech (TTS) tool; instead, it can process voice input in real time and respond to it. The Seattle-based tech giant says developers can use the model to build conversational AI chatbots and similar tools. Notably, the Amazon Nova Sonic AI model also supports functional calling and tool use, making it compatible with agentic application developments as well.

Amazon Nova Sonic Is Available As an API

In a blog post, the tech giant announced the release of the Amazon Nova Sonic. The company said traditional approaches to voice-enabled applications use a complex with multiple models such as text recognition, speech-to-text conversion, data processing, and TTS models. This often leads to an increase in latency, and failure in preserving linguistic context, the post added.

Amazon said its approach with the Nova Sonic model was to unify speech understanding and speech generation components. The AI model is said to be able to process data and generate speech in real time, giving it a conversation-like experience. This unified system also allows the model to better understand the pace and timbre of input speech to contextualise the intent of the user.

Additionally, the AI model can understand different speaking styles as well as separate masculine and feminine-sounding voices in different accents. It can also understand when a user misspeaks, mumbles, or pauses while speaking. Amazon says the model can pick up speech even in a noisy setting.

Advertisement

In response generation, the company claims the model can be more expressive and human-like, and can adjust its response style to match the context of the conversation. Currently, the AI model only supports the English language. Amazon said support for more languages will be added soon. The model supports a context window of 32,000 tokens for audio, with an additional window to handle longer conversations. It has a default session limit of eight minutes.

To use the Nova Sonic model, developers can head to Amazon Bedrock and find it under the model access option. It can also be accessed via a bidirectional streaming application programming interface (API) that can both process audio input and generate output.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. BSNL Announces Flash Sale in India With Free Data, Discounts
  2. Samsung Galaxy M36 5G Launching Today: All You Need to Know
  3. OTT Releases of the Week: Squid Game S3, Raid 2, Panchayat S4, and More
  4. Nothing Phone 3 Renders Leaked Ahead of July 1 Launch
  5. Redmi K Pad With 8.8-Inch Display, 7,500mAh Battery Unveiled: See Details
  6. Xiaomi's Pad 7S Pro With Xring O1 Processor Launched: All Details
  7. Nothing Phone 3 to Get 50-Megapixel Periscope Telephoto Camera
  8. Google Brings a Standalone App to Let You Try-On New Outfits Virtually
  9. Lumio Arc Projector Teased Ahead of Possible Amazon Prime Day Launch
  1. Xiaomi Pad 7S Pro With12.5-Inch Display and Xring O1 Processor Launched: Price, Specifications
  2. Xiaomi Watch S4 41mm With AMOLED Screen Launched Alongside Smart Band 10: Price, Specifications
  3. Google Releases Gemma 3n Open-Source AI Model That Can Run Locally on 2GB RAM
  4. Redmi K Pad With 8.8-Inch 3K Display, 7,500mAh Battery Launched: Price, Specifications
  5. Google Pixel Call Screening Feature Could Launch in India Soon With Support for Hindi: Report
  6. BSNL Teases Free Data, Broadband Deals and Discounts With Its Upcoming Flash Sale
  7. Walmart-Backed Flipkart Turns to Videos and Livestream to Woo Indian Online Shoppers
  8. Apple Changes App Store Rules in EU to Comply with Antitrust Order
  9. Capcom Showcases First and Third-Person Resident Evil Requiem Gameplay at Capcom Spotlight Livestream
  10. Telegram Bot Reportedly Spotted Selling Sensitive Personal Data of Indian Users
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.