Live Now

Hume Introduces Interpretability-Based Voice Control Feature for AI Voice Customisation

With Hume’s AI tool, developers can choose from 10 voice dimensions to create the desired voice for AI chatbots.

Advertisement
Written by Akash Dutta, Edited by Siddharth Suvarna | Updated: 3 December 2024 18:53 IST
Highlights
  • Developers can control gender, confidence, assertiveness, and more
  • Hume’s Voice Control tool is currently available in beta
  • In the future, Hume plans to increase its range of base voices
Hume Introduces Interpretability-Based Voice Control Feature for AI Voice Customisation

Instead of prompts, Hume’s tool uses a slider to control different elements of voices

Photo Credit: Hume

Hume, a New York-based artificial intelligence (AI) firm, unveiled a new tool on Monday that will allow users to customise AI voices. Dubbed Voice Control, the new feature is aimed at helping developers integrate these voices into their chatbots and other AI-based applications. Instead of offering a large range of voices, the company offers granular control over 10 different dimensions of voices. By selecting the desired parameters in each of the dimensions, users can generate unique voices for their apps.

Hume Voice Control Tool

The company detailed the new AI tool in a blog post. Hume stated that it is trying to solve the problem of enterprises finding the right AI voice to match their brand identity. With this feature, users can customise different aspects of the perception of voice and allow developers to create a more assertive, relaxed, or buoyant voice for AI-based applications.

Hume's Voice Control is currently available in beta, but it can be accessed by anyone registered on the platform. Gadgets 360 staff members were able to access the tool and test the feature. There are 10 different dimensions developers can adjust including gender, assertiveness, buoyancy, confidence, enthusiasm, nasality, relaxedness, smoothness, tepidity, and tightness.

Instead of adding a prompt-based customisation, the company has added a slider that goes from -100 to +100 for each of the metrics. The company stated that this approach was taken to eliminate the vagueness associated with the textual description of a voice and to offer granular control over the languages.

Advertisement

In our testing, we found changing any of the ten dimensions makes an audible difference to the AI voice and the tool was able to disentangle the different dimensions correctly. The AI firm claimed that this was achieved by developing a new “unsupervised approach” which preserves most characteristics of each base voice when specific parameters are varied. Notably, Hume did not detail the source of the procured data.

Notably, after creating an AI voice, developers will have to deploy it to the application by configuring its Empathic Voice Interface (EVI) AI model. While the company did not specify, the EVI-2 model was likely used for this experimental feature.

Advertisement

In the future, Hume plans to expand the range of base voices, introduce additional interpretable dimensions, enhance the preservation of voice characteristics under extreme modifications, and develop advanced tools to analyse and visualise voice characteristics.

 

For the latest tech news and reviews, follow Gadgets 360 on X, Facebook, WhatsApp, Threads and Google News. For the latest videos on gadgets and tech, subscribe to our YouTube channel. If you want to know everything about top influencers, follow our in-house Who'sThat360 on Instagram and YouTube.

Advertisement
Popular Mobile Brands
  1. Google I/O 2025 LIVE: Google Smart Glasses Teased Ahead of Event
  2. Nothing Phone 3 Confirmed to Launch Globally in July
  3. OnePlus Pad 3 With Snapdragon 8 Elite SoC to Launch Globally on This Date
  4. Infinix Hot 60 Pro+ Tipped to Debut as the Slimmest Curved Screen Phone
  5. Huawei MateBook Fold Ultimate Design Debuts With 18-Inch Flexible Display
  6. Infinix XPad GT Will Debut on May 21 With This Snapdragon Chip
  7. Microsoft Introduces Edit, an Open-Source CLI Text Editor for Windows
  8. Airtel Now Offers Free Google One Plan to Wi-Fi and Postpaid Customers
  9. Tecno Megabook S16 AI PC With 16-Inch Display Unveiled at Computex 2025
  1. Intel Arc Pro B-Series GPUs With XMX AI Cores and Advanced Ray Tracing Units Launched
  2. Microsoft NLWeb Open Project for AI-Powered Natural Language Interface for Websites Unveiled
  3. Zeb-Silencio 111 Headphones With 40mm Titanium Drivers, Up to 55 Hours Battery Life Launched in India
  4. Qualcomm's Snapdragon 8 Elite 2 SoC to Launch Earlier Than Expected
  5. Nothing Phone 3 Confirmed to Launch Globally in July
  6. Infinix XPad GT Launch Date Set for May 21; Confirmed to Arrive With 8 Speakers and 10,000mAh Battery
  7. Realme GT 7 Dream Edition to Be Launched in Collaboration with Aston Martin Formula One Team
  8. Microsoft Releases Magentic-UI, an Open-Source Agentic Web Interface That Can Perform Tasks on the Web
  9. MSI Claw 8 With AMD Ryzen Z2 Extreme Processor Announced Alongside Claw 8 AI+ Polar Tempest Edition
  10. Tecno Megabook S16 AI PC With 16-Inch Display Unveiled at Computex 2025
Gadgets 360 is available in
Download Our Apps
Available in Hindi
© Copyright Red Pixels Ventures Limited 2025. All rights reserved.