UPDATED 17:12 EDT / AUGUST 25 2017

EMERGING TECH

Google open-sources speech command dataset for TensorFlow

Alphabet Inc.’s TensorFlow machine learning framework and AIY do-it-yourself artificial intelligence teams have released a dataset of more than 65,000 utterances of 30 different speech commands, giving developers a powerful toolset to implement their own simple voice controls without having to build everything from scratch.

Pete Warden, a software engineer on the Google Brain Team, said in a blog post Thursday that although open-source speech recognition systems such as Kaldi can use neural networks to build powerful voice features, they can also be overly sophisticated for developers who only need basic voice functionality for their programs. According to Warden, the new speech data set offers a quick way to implement simple voice commands.

“The dataset is designed to let you build basic but useful voice interfaces for applications, with common words like ‘Yes,’ ‘No,’ digits, and directions included,” said Warden. “The infrastructure we used to create the data has been open-sourced too, and we hope to see it used by the wider community to create their own versions, especially to cover underserved languages and applications.”

According to Warden, the results of using the new dataset will depend on whether the speech patterns needed for a program are included in the set, but he also said that the dataset will become more versatile “as the community contributes improved models to TensorFlow.” Warden also said that Google hopes that the community will add more accents and dialects to the dataset.

While Google has plenty of its own AI projects, the company has also been looking to put AI capabilities in the hands of more people. For example, Google launched its do-it-yourself AI initiative, AIY Projects, back in May when it shipped a free AIY voice kit with the physical edition of The MagPi, the official magazine for the Raspberry Pi minicomputer. The goal behind AIY Projects is to make AI more accessible to developers and tech hobbyists, and the team plans on releasing other kits in the future.

TensorFlow, Google’s open-source machine learning framework, has also released a series of tutorials to help developers get started on AI, including one tutorial specifically for AI-powered audio recognition. Warden said that with the latest version of TensorFlow, developers can download the speech dataset and train a voice model in just a few hours.

Photo: Google

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Google open-sources speech command dataset for TensorFlow

Photo: Google

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Oracle Data Deep Dive NYC 2026

HPE World Quantum Day 2026

Qlik Connect 2026

Nutanix .NEXT 2026

KubeCon + CloudNativeCon EU 2026

Google open-sources speech command dataset for TensorFlow

Photo: Google

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Oracle Data Deep Dive NYC 2026

HPE World Quantum Day 2026

Qlik Connect 2026

Nutanix .NEXT 2026

KubeCon + CloudNativeCon EU 2026

Cookies