UPDATED 18:45 EDT / JUNE 14 2017


Drowning in the data lake, companies seek out predictive analytics tools

Whether it’s shopping for a pair of shoes or running a large multi-tier data center, everyone is looking for value. The increasing complexity involved in managing the vast store of raw data, the data lake, has information technology executives looking for tools that can perform predictive analytics accurately and rapidly while making a real contribution to the bottom line.

“One of the best ways to get value out of the data is this notion of predicting what is going to happen in your world, with your customers and with the data you already have,” said Arun Murthy (pictured), founder and vice president of engineering at Hortonworks Inc.

Murthy visited theCUBE, SiliconANGLE’s mobile live-streaming studio, and answered questions from hosts Lisa Martin (@Luccazara) and George Gilbert (@ggilbert41) during DataWorks Summit in San Jose, California. They discussed how clients who are using Hortonworks are able to meet data management challenges and add value at the same time. (* Disclosure below.)

Client adds 30 percent throughput to cluster

Murthy described the experience of one large, unnamed financial services client who had hundreds of thousands of machines running on the Hortonworks Data Platform. The company used SmartSense, a proactive Hadoop-based monitoring service, to identify 25 machines with bad configurations. The result: The company added 30 percent throughput back on their cluster.

“At that scale, it’s a lot of money,” said Murthy.

There is also increasing interest in using tools like Apache Atlas to create a scalable, open framework for data governance. Murthy cited an example where Hortonworks has been working with partner IBM Corp. to help clients in Europe meet the requirements of the General Data Protection Regulation, a set of mandatory data protection standards that will go into effect in 2018.

“If you are not compliant by March of next year, you pay a portion of your revenue as fines,” Murthy said. “It’s a really big deal.”

Although Hortonworks was founded in 2011, Murthy has been working with Hadoop for longer.

“People have been looking for folks with 10 years of experience on Hadoop, and I’m here finally,” he said. “It’s been an amazing journey.”

Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s independent editorial coverage of DataWorks Summit. (* Disclosure: Hortonworks Inc. sponsored this DataWorks Summit segment on SiliconANGLE Media’s theCUBE. Neither Hortonworks nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Show your support for our mission by joining our Cube Club and Cube Event Community of experts. Join the community that includes Amazon Web Services and Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger and many more luminaries and experts.

Join Our Community 

Click here to join the free and open Startup Showcase event.

“TheCUBE is part of re:Invent, you know, you guys really are a part of the event and we really appreciate your coming here and I know people appreciate the content you create as well” – Andy Jassy

We really want to hear from you, and we’re looking forward to seeing you at the event and in theCUBE Club.

Click here to join the free and open Startup Showcase event.