UPDATED 09:00 EDT / SEPTEMBER 24 2019

BIG DATA

Cloudera debuts all-open-source integrated cloud data platform

Two months after adopting an all-open-source strategy, Cloudera Inc. today is announcing an integrated data platform made up entirely of open-source elements.

Cloudera Data Platform is being positioned as one-stop-shopping cloud service for organizations that want to perform analytics across hybrid and multicloud environments with enterprise-grade security and governance.

The package combines a cloud-native data warehouse, machine learning service and data hub, each running as instances within the self-contained operating environments called containers. Queries are managed by Apache Hive or Apache Impala, the latter of which was developed by Cloudera.

“The knock on Hadoop has always its operational complexity and the fact that it’s difficult to use,” said Arun Murthy (pictured), Cloudera’s co-founder and chief product officer. “What we’ve invented is an experience that attacks both.”

The focus of the Cloudera Data Platform is on reducing the time needed to install and configure multiple elements needed to create a data warehouse, analytics workbench or machine learning training suite. By using existing components in the cloud, the platform cuts deployment times from weeks to hours, Murthy said. The software works natively on Amazon Web Services Inc. S3 data natively and supports the Hadoop Distributed File System.

“To date we’ve been offering a bunch of HDFS clusters and customers had to install their own extensions,” he said. “With Cloudera Data Platform these are all native services. You can set up a secure data lake in a couple of hours.”

The platform also leverages Cloudera’s Shared Data Experience, a unified data framework that includes schema, permissions and governance artifacts. It enables multiple users to work from the same data and catalog using the tools that they prefer and to migrate workloads to the cloud.

“We move not just the bits but the data, the metadata, the tables and the security protocols,” Murthy said. “It’s secure end-to-end and it’s fully open.”

The combination of real-time processing and predictive analytics enables applications like real-time predictive billing, which can alert customers of excessive charges accruing to their mobile phone accounts, for example, as a result of leaving data services on while roaming, Murthy said.

Customers using Cloudera’s on-premises software can get a single view of both their local and cloud workloads. Cloudera Data Platform is currently a cloud-only service for workloads running on Amazon infrastructure.

An on-premises option, called CDP Data Center, will be available later this year with annual subscriptions starting at $10,000 per node. A preview version for Microsoft Corp.’s Azure cloud is due in a few months with support for Google LLC’s cloud likely to come early next year. Pricing information is published here.

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+  
CUBE Alumni Network
C-level and Technical
Domain Experts
15M+ 
theCUBE
Viewers
Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.