UPDATED 05:48 EDT / AUGUST 13 2015

NEWS

Google adds Cloud Dataflow & Pub/Sub to its Big Data suite

Google has made two of its Big Data tools generally available, adding a new data analysis framework and real-time data stream management service to its cloud portfolio.

These include Google Cloud Dataflow, which offers a framework for merging data from different sources into one data processing pipeline, and theoretically do much the same job as Hadoop does. Together with Google Cloud Pub/Sub, the two products stand alongside Google BigQuery in the companay’s Big Data cloud suite.

We first saw Dataflow at the 2014 Google I/O developer conference in San Francisco. Launched as an Alpha release, Google said the service was built to analyze data pipelines with especially large datasets, and could crunch said data in either batch or streaming mode. Following the Alpha release, Google added an open-source SDK for Java that made it easier to port the service to other developer languages. We last heard of Dataflow in April when Google released an updated beta version.

Less well known is Google’s Cloud Pub/Sub, which is a tool that integrates applications and services so their data streams can be analyzed in real-time. Although Google has been quieter about Cloud Pub/Sub’s development, its Cloud Product management team said on Wednesday that the solution has been made available after 10 years of internal innovation.

“Traditional approaches require separate queueing, notification, and logging systems, each with their own APIs and tradeoffs between durability, availability, and scalability,” Rohit Khare and Eric Schmidt, Google’s Cloud product managers, wrote in a blog post.

According to them, Cloud Pub/Sub is able to address multiple scenarios via a single API at a rate of one million message operations per second. Pricing for the service starts at $0.05 per million message operations. As for Dataflow, it will be priced per job, depending on how long it takes to complete an operation and how much data that must be moved around.

In other news, Google has also certified Cloudera Inc.’s Hadoop distribution on its Google Cloud Platform. The Intel-backed Big Data firm released Cloudera Director 1.5, a suite for deploying and managing its Hadoop distro in the cloud. Cloudera Director 1.5 already supported Amazon Web Services’ cloud, and will now work in tandem with Cloudera’s Enterprise and Express versions.

These developments big week for Google. The search engine giant surprised Silicon Valley watchers earlier this week when it announced it had restructured itself into a new parent company called Alphabet Inc., with Google Search, Cloud and various other divisions being reorganized as subsidiaries.

Photo Credit: Stuck in Customs via Compfight cc

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU