UPDATED 09:14 EDT / FEBRUARY 08 2017

APPS

Microsoft enhances voice recognition with Custom Speech Service tool

Developers have a new machine learning tool for improving speech recognition to play around with courtesy of Microsoft Corp., which launched the public beta for its Custom Speech Service on Tuesday.

Custom Speech Service is designed to overcome some of the most common problems in speech recognition systems, such as people’s different accents and vocabulary, and issues with background noise. The system allows developers to build custom language models that are able to adapt to each user’s unique way of speaking to the specific vocabulary of each application. It can also adapt to various acoustic models in specific environments, or the number of people using an application, Microsoft said.

“Beneath the hood, the Custom Speech Service leverages an algorithm that shifts Microsoft’s existing speech recognizer to the developer-supplied data,” Microsoft Research’s John Roach said in a blog post. “By starting from models that have been trained on massive troves of data, the amount of application-specific data required is greatly reduced. In cases where the developer’s data is insufficient, the recognizer falls back on the existing models.”

The acoustic modeling capabilities meanwhile, are designed to enable speech recognition in some of the noisiest environments, such as on the factory floor. The algorithm picks out user’s speech amid all of the background noise, while prioritizing jargon that might be associated with a specific industry.

Alongside Custom Speech Service, Microsoft announced two other cognitive tools – the Bing Speech API, and Content Moderator, both of which will be available in March.

The Bing Speech API is designed to transcribe live audio or recorded speech into text, and also vice versa, paving the way for apps that can talk back at users. In addition, the API can be used to create voice-enabled applications that wake up when user’s speak a certain command.

As for Content Moderator, this is used to detect profanities in texts in over 100 languages. The service is also able to spot phishing URLs, personally identifiable information and malware. Finally, it can also analyze images and videos for offensive or unwanted content, including pornographic material.

Image courtesy of Microsoft

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Microsoft enhances voice recognition with Custom Speech Service tool

Image courtesy of Microsoft

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

theCUBE + NYSE Wired: AI + Cloud Leaders Media Week 2025

AWS Summit NYC 2025

AWS Mid-Year Leadership Summit 2025

RAISE Summit 2025

Blue Yonder AI and the Autonomous Supply Chain 2025

Microsoft enhances voice recognition with Custom Speech Service tool

Image courtesy of Microsoft

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

theCUBE + NYSE Wired: AI + Cloud Leaders Media Week 2025

AWS Summit NYC 2025

AWS Mid-Year Leadership Summit 2025

RAISE Summit 2025

Blue Yonder AI and the Autonomous Supply Chain 2025

Cookies