UPDATED 13:00 EDT / OCTOBER 20 2025

AI

WellSaid pushes AI speech forward with faster, more natural voice production

WellSaid Labs Inc., a developer of lifelike artificial intelligence voice technology, today unveiled the next generation of its enterprise platform, introducing major upgrades to its audio studio that make producing natural, customizable speech faster and more intuitive.

WellSaid’s core technology synthesizes voices that sound natural and human-like using proprietary AI model named Caruso. It was trained on exclusive licensed audio from professional voice actors, not on public sources. It uses its technology to produce natural speech in various styles that can account for accent, timbre, pronunciation and other expressive elements.

“Today, enterprises require an AI voice solution that ships faster, sounds better and enables the ability to scale quickly while meeting compliance standards,” said Chief Technology Officer Chris Johnson.

The company upgraded its Studio product to provide easier production of speech with instant previews and fewer clicks to deliver speech audio. Users can fine-tune pitch, pace and loudness of voices at the word-level, while also adding multiple voices in one script or dialogue.

To bring high quality audio to customers, WellSaid made audio up to 96 kilohertz the new standard, which produces natural clarity and more readily captures intonation and stress across synthesized voices.

That includes the capacity for designing phonetic spellings for acronyms, brand names and borrowed terms. The platform also provides smart suggestions on potential phonetic spelling to assist users with getting pronunciation right.

The platform covers a wide variety of potential words from numerous industries, including coverage for more than 9,000 medical terms, 500 legal terms and thousands more used across healthcare, aviation and industrial. All of them are backed by Oxford Dictionary guidance.

The company also added 36 new voices covering global languages, including Arabic, Turkish, Persian and 18 dialects to enable localization of content. WellSaid also includes a large library of English-speaking voices that include accents such as Australian, British, Canadian, Irish and various regional accents from the United States.

AI-powered voice generation has seen rapid growth as lifelike and expressive custom speech becomes increasingly sought after across industries. The market was valued at $3.5 billion in 2023 and is projected to reach $21.8 billion by 2030, according to market analysis firm Grand View Research. Companies such as Eleven Labs Inc. and Hume AI Inc. have raised significant capital this year, $180 million and $50 million respectively, for voice cloning and generation technologies.

AI-generated voices now power intelligent voice agents that can understand natural speech, respond conversationally and take action on users’ behalf. This trend has fueled a surge in voice-enabled technologies that sound strikingly human across devices, over the phone and in customer interactions.

With human-like AI voices, companies can handle routine inquiries around the clock, schedule appointments and personalize outreach. Voice-capable AI agents can also contact customers directly with updates and tailor interactions to individual preferences and histories.

Looking ahead, WellSaid plans to introduce additional upgrades, including a usage insights dashboard, and enhanced pronunciation and performance tools for managing emphasis, cues, variability and breath control.

Image: SiliconANGLE/Microsoft Designer

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.