UPDATED 00:06 EDT / FEBRUARY 19 2016

NEWS

Google’s Cloud Vision takes image recognition to the next level

by Mike Wheatley

Google has thrown another new AI tool into its developer’s box in the form of its Cloud Vision API. The beta release of Cloud Vision, which had been available in limited preview since last December, is the latest in a flurry of AI-related announcements from Silicon Valley giants, as Google goes head to head with companies like Microsoft and IBM in a race to dominate this emerging niche.

Google’s Cloud Vision API brings the concept of machine learning to images for the first time. Using the tool, developers can build applications and robots that are capable of recognizing image content for the first time. For example, show it a picture of a banana and the bot will call it what it is. Alternatively, you could tell your robot to single out the smiling faces from those that are frowning, and it’ll give you the answer faster than you can click your fingers.

The software, which is also used by Google to power Google Photos, can detect or identify hundreds of different objects, colors and facial expressions in a given image, for example flowers, food, animals, notable landmarks and so on. There are other potential uses too, such as being able to detect inappropriate content (such as pornography) from crowdsourced images (as Google’s SafeSearch does), analyzing people’s emotions, detecting logos, reading text, and many more.

In a blog post, Ram Ramanathan, Product Manager of Google Cloud Platform, said the Cloud Vision API is available for anyone to submit and analyze their images during its unspecified beta timeframe. Users can submit up to 20 million images per month, with pricing dependent on your image’s volume and content detection requirements

Google claims that already, “thousands of companies” have used the API since it came out in preview last December, generating millions of requests for image annotations. One of its biggest users is the social photo editing app PhotFy, which relies on Cloud Vision to moderate over 150,000 photos on a daily basis, weeding out those that contain inappropriate content like pornography and violence.

The release of Cloud Vision into beta follows a promise from Google CEO Sundar Pichai last October that the company is going to prioritize its machine learning efforts this year. Google has already released its TensorFlow machine learning technology, which powers Google Search, to the open-source community.

One reason why Google is prioritizing its machine learning efforts is that the AI sector seems to be red hot at the moment, and just about every major tech company is trying to carve out a niche for itself. Google faces stiff competition from the likes of IBM and Microsoft in the race to create the best machine learning tools. For example, IBM open-sourced a rival to Google’s TensorFlow software called SystemML last November, while Microsoft recently released a bunch of its own machine learning tools onto GitHub, including software capable of recognizing people’s emotions based on their facial expressions in images.

The scramble to dominate AI is an interesting race in itself, and all the more so for developers and robotics creators, who can make use of a whole host of tools that didn’t exist just a few months ago.

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.