UPDATED 00:42 EDT / JULY 04 2016

NEWS

Sparkling Water 2.0 enables machine learning with Apache Spark

Enterprises have a tough time gathering insights from the vast oceans of data they accumulate, but a new tool for Apache Spark is hoping to change that, by allowing them to merge machine learning algorithms with the popular data processing engine.

Announced last week, Sparkling Water 2.0 is a newly updated tool created by a startup called H20.ai, formerly known as Oxdata Inc, which offers an open-source algorithm development platform of the same name. The tool is designed to make it simpler for companies to use machine learning algorithms in their data analysis. As such, Sparkling Water 2.0 is kind of like an API that lets users tap into H20’s open-source AI platform, instead of using Spark’s own MLlib machine-learning library.

In a statement, the company explained that Sparkling Water was designed to let users enjoy the best features of Spark alongside its own speed, columnar-compression and fully-featured machine learning algorithms. The tools also provides more flexibility for companies looking to find the best algorithms for specific use cases, simply by bringing more options to the table.

“Apache Spark’s MLlib offers a library of efficient implementations of popular algorithms directly built using Spark,” the company noted. But with Sparking Water, companies can also “use H2O algorithms in conjunction with, or instead of, MLlib algorithms on Apache Spark.”

As such, the tool is likely to appeal to both Spark and H20’s users, explained one analyst.

“Enterprises are looking to take advantage of a variety of machine learning algorithms to address an increasingly complex set of use cases when determining how to best serve their customers,” said Matt Aslett, Research Director, Data Platforms and Analytics at 451 Research. “Sparkling Water is likely to be attractive to H2O and Spark users alike, enabling them to mix and match algorithms as required.”

Sparking Water 2.0’s headline feature is it allows users to run both Spark and Scala through H20’s Flow user interface. In addition, it also brings a new visualization component to Spark’s MLlib, allowing users to see the results of their machine-learning algorithm powered analysis in a format that’s easier to digest.

The software supports the Apache Zeppelin notebook as well as Spark 2.0 and earlier editions, and offers production support for machine-learning pipelines.

H20.ai is also working on a project known as “Steam”, which it describes as a data science hub that allows data scientists and developers to collaboratively build, deploy and refine predictive applications across large scale data sets, eliminating much of the heavy lifting involved in DevOps. With Steam, developers and data scientists will be able to compare models across teams and move them into production without needing to perform any of the engineering work needed on the back end.

Image credit: ColiNOOB via Pixabay.com

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Sparkling Water 2.0 enables machine learning with Apache Spark

Image credit: ColiNOOB via Pixabay.com

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Pure Accelerate 2026

FinOps X 2026

Snowflake Summit 2026

Freshworks Refresh 2026

IBM Think 2026

Sparkling Water 2.0 enables machine learning with Apache Spark

Image credit: ColiNOOB via Pixabay.com

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Pure Accelerate 2026

FinOps X 2026

Snowflake Summit 2026

Freshworks Refresh 2026

IBM Think 2026

Cookies