Meta Voicebox Unveiled as New Text-to-Speech Generative AI Model: All Details

Meta's Voicebox is claimed to deliver audio clips using a two-second audio sample.

Advertisement
Written by Nithya P Nair, Edited by Siddharth Suvarna | Updated: 20 June 2023 14:30 IST
Highlights
  • Meta Platforms introduced Voicebox
  • Voicebox is a machine-learning model that can generate speech
  • It supports six languages

Voicebox claimed to generate audio samples 20 times faster than Microsoft's VALL-E

Photo Credit: Meta

Meta announced Voicebox, its advanced artificial intelligence (AI) tool that can generate speech from text last week. The latest tool by Facebook parent Meta is claimed to produce high-quality audio clips and edit pre-recorded audio while preserving the content and style of the audio. It is said to be multilingual and claimed to deliver speech in six languages. The machine learning model can be used for noise removal as well. Meta's Voicebox also has the ability to replace misspoken words without having to re-record an entire speech. The new generative text-to-speech model works like the new AI innovations including ChatGPT and Dall-E.

Facebook's parent company Meta unveiled Voicebox via a blog post last week. This new generative AI model can perform speech generation tasks — like editing, sampling, and stylising. It is claimed to deliver audio clips from a two-second audio sample and edit pre-recorded audio while keeping the content and style of the audio.

The text-to-speech model is promised to perform tasks like noise removal, content editing, style conversion, and diverse sample generation. It is stated to modify any part of a given sample and recreate a portion of the speech that's interrupted by noise such as car horns or barking dogs. The AI model can also be used to replace misspoken words without having to re-record an entire speech.

Advertisement

Voicebox can synthesise speech across six languages — English, French, Spanish, German, Polish, and Portuguese. It can create a reading of the text in any of those languages, even when the sample speech and the text are in different languages.

Advertisement

Voicebox claimed to outperform Microsoft's VALL-E and generate audio samples 20 times faster. "Our results show that speech recognition models trained on Voicebox-generated synthetic speech perform almost as well as models trained on real speech, with 1 percent error rate degradation as opposed to 45 to 70 percent degradation with synthetic speech from previous text-to-speech models", Meta AI detailed in a research paper. Further, a few audio samples are listed to show users the working of Voicebox.

In the blog, Meta further claims that Voicebox can generate speech that is more representative of how people talk in the real world in the aforementioned six languages. The company believes that this capability could be used to generate synthetic data to help better train a speech assistant model in the near future.

Advertisement

Voicebox is currently under development and is not available to public users. Meta says it realises that this technology brings the potential for misuse and unintended harm like the current AI innovations. It is said to be working on an effective classifier that can distinguish between authentic speech and audio generated with Voicebox to mitigate these possible future risks.


Apple unveiled its first mixed reality headset, the Apple Vision Pro, at its annual developer conference, along with new Mac models and upcoming software updates. We discuss all the most important announcements made by the company at WWDC 2023 on Orbital, the Gadgets 360 podcast. Orbital is available on Spotify, Gaana, JioSaavn, Google Podcasts, Apple Podcasts, Amazon Music and wherever you get your podcasts.
Affiliate links may be automatically generated - see our ethics statement for details.
 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement

Related Stories

Popular Mobile Brands
  1. All the Key Differences Between iPhone 17 and iPhone 17 Pro
  2. Apple Launches iPhone 17 Pro, 17 Pro Max With These Massive Upgrades
  3. Apple Launches iPhone 17 at 'Awe Dropping' Event With These Upgrades
  4. iPhone 17 Price Around the World: Cheapest & Most Expensive Countries
  5. Apple MacBook Air M4 Available With Up to Rs. 16,000 Discount via Amazon
  6. iPhone 17 Series Pre-Orders in India: Check Prices, EMI Options, and Offers
  7. Apple Watch Series 11, Ultra 3, SE Launched With These Health Features
  8. Xiaomi Confirms Authorised Retailers Ahead of Amazon, Flipkart Festive Sales
  9. iPhone 17 Price in India: See the Full Price List for All New Devices
  10. iQOO 15, iQOO Neo 11 Series Details Tipped; Might Feature 7,000mAh Battery
  1. How to Pre-Order iPhone 17, iPhone Air, iPhone 17 Pro, and iPhone 17 Pro Max in India: Dates, Timings, and Offers
  2. iPhone 16 and iPhone 16 Plus Price in India Slashed: Check New Prices
  3. Apple Announces iOS 26 and watchOS 26 Release Date for All Eligible Devices
  4. iPhone 17 Pro, iPhone 17 Pro Max Are Here: Massive Camera Upgrades, and A19 Pro Chip
  5. iPhone Air Launched: Ultra-Slim Form Factor, Apple Intelligence Features, and More
  6. iPhone 17 Launched: A19 Chip, Apple Intelligence, and More
  7. Apple Watch Series 11, Ultra 3, and SE Launched: Thinner Design and New Health Sensors
  8. AirPods Pro 3 Launched: Featuring Lossless Audio and a Redesigned Case
  9. Kazakhstan’s President Proposes Strategic Crypto Reserve, Fully Digitalised Alatau City Project
  10. Tecno Spark Slim Full Specifications Revealed; Features MediaTek Helio G200 SoC, 5.93mm Thick Build
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.