

For Big Data to add value to a business, it’s crucial to have the systems in place to transform, integrate and deliver appropriate data to the end user. According to Donna Prlich, VP of Products and Solutions Marketing at Pentaho Corp., in an interview with theCUBE at Hadoop Summit, that’s where her company comes in.
“One of the keys of Pentaho data integration is that if you’re thinking about ingesting data, for instance from Hadoop, it’s not just about getting the data in, right?” she explained “Pentaho’s going to allow you to actually cleanse and transform that data and provide a level of governance. And then as you start to think about Hadoop and an environment where maybe you have data in an enterprise data warehouse, the transformations that go on, where you want to be able to blend data together. And the orchestration of all of that is all managed through Pentaho data integration.”
And as Big Data gets bigger, Pentaho is innovating to keep pace. The big news for the company, according to Prlich, involves the deployment Big Data in the Cloud.
“We’re seeing that a lot, so we have support for Amazon Web Services, adding support for SAP HANA. And we’ve also released a scalability study that we did at Pentaho just to… get a sense for our customers of how their environments would scale, and had some really interesting results there,” she explained. “And then we have pre-announced support of IBM Spark. In this release we actually support the orchestration of Spark jobs within Pentaho data integration. So lots of big news around emerging technologies, the Cloud, and this whole world of Big Data.”
How is this transforming businesses in practice? Prlich offered a use-case in the implementation at Landmark Halliburtan.
“They’re collecting sensor data, and they’re bringing that data together with other information about parts and maintenance, etc.,” she said. “They bring that data together and then they deliver it out through applications…. The guy who’s standing out on the oil rig, he’s got an embedded Pentaho dashboard, and he’s looking at this data, no clue about the complex transformations that are going on behind it. But that’s really the kind of the power of having that embedded for that specific user.”
That’s one big way data integration can add value for businesses, she said. “We talk a lot about the ‘embedded at the point of impact’ …. where do you need the data to be to actually be valuable and help that end user do their job?”
Watch the full interview below, and be sure to check out more of SiliconANGLE and theCUBE’s coverage of Hadoop Summit 2015.
THANK YOU