UPDATED 09:00 EDT / APRIL 07 2025

PyannoteAI raises $9M for its speech processing AI

French artificial intelligence startup pyannoteAI SAS today announced that it has raised $9 million in funding to enhance its technology.

Crane Venture Partners and Serena led the seed investment. They were joined by Alexis Conneau, chief executive of venture-backed AI startup WaveForms Inc., and Hugging Face Inc. Chief Technology Officer Julien Chaumond.

Founded last year, pyannoteAI offers an open-source AI toolkit of the same for transcribing speech. The software supports multiple languages and can automatically perform speaker diarization. That’s the process of attributing each section of a transcript to the relevant speaker, a task AI models usually struggle to perform reliably.

Under the hood, pyannoteAI’s AI toolkit runs on multiple internally-developed neural networks. It also features pipelines, software workflows that help prepare audio data before it’s processed by the models. Companies can fine-tune the toolkit’s individual components on their internal datasets to improve their performance.

On occasion of today’s funding announcement, pyannoteAI disclosed that its open-source software is downloaded more than 45 million times per month. The company’s installed base includes more than 100,000 developers. It generates revenue with a paid version of its open-source AI toolkit that includes more advanced capabilities.

According to pyannoteAI, its commercial offering is twice as fast as the open-source edition. The software also provides a 20% accuracy increase, which allows it to more reliably distinguish speakers in audio recordings. The model can tell voices apart even if several people speak at the same time.

Customers can upload files with up to 24 hours of audio to the commercial version of pyannoteAI’s software. According to the company, its platform automatically identifies recurring speakers across files to reduce the need for manual transcript editing.

To mitigate the impact of potential accuracy issues, pyannoteAI’s software generates a confidence score for each transcript segment that it generates. The lower the confidence score, the greater the risk that the AI made a mistake. This feature allows customers to quickly spot errors in lengthy transcripts without a time-consuming manual review.

Organizations can access pyannoteAI’s platform through an application programming interface or deploy it on their own infrastructure. According to the company, the software supports the major public clouds and bare-metal servers.

“We’re bringing enterprise-grade speaker intelligence AI to businesses that depend on voice data,” said pyannoteAI co-founder and CEO Vincent Molina. “Our goal is to make speaker-aware AI as seamless and universal as speech itself.”

The company plans to invest its newly raised capital in product development initiatives. It’s building features that will make it possible to split an audio file into multiple files that each only feature only a single speaker. Additionally, pyannoteAI will enable customers to run its AI models on a broader range of devices.

Photo: Unsplash

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

PyannoteAI raises $9M for its speech processing AI

Photo: Unsplash

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

MWC Barcelona 2026

Vast Forward 2026

CES 2026

AWS re:Invent 2025

Microsoft Ignite 2025

PyannoteAI raises $9M for its speech processing AI

Photo: Unsplash

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

MWC Barcelona 2026

Vast Forward 2026

CES 2026

AWS re:Invent 2025

Microsoft Ignite 2025

Cookies