Pentaho Moves Big Data Integration Project to Open Source Apache

Pentaho is open sourcing its Big Data integration engine, known as Pentaho Kettle, moving the entire project to the Apache License Version 2.0. Kettle was previously available under the GNU Lesser General Public License.

Kettle includes Pentaho’s extract, transform and load (ETL) engine used to move structured and unstructured data between Big Data sources such as Hadoop and HBase without coding. Kettle can perform ETL jobs both inside and outside Hadoop clusters, and includes a graphical user interface for developers called Spoon to set up Hadoop MapReduce jobs, run Pig scripts, and perform Hive queries.

Apache is, of course, home to a slew of Big Data projects – including the aforementioned Hadoop and Hbase, as well as a number of NoSQL-related projects – and, by adding Kettle to the mix, Pentaho is hoping to spur wider adoption by Big Data developers.

The move “will foster success and productivity for developers, analysts and data scientists giving them one tool for data integration and access to discovery and visualization,” said Matt Casters, Founder and Chief Architect of Pentaho’s Kettle Project.

Pentaho already has partnerships with virtually all of the vendors attempting to commercialize Hadoop, including Cloudera, Hortonworks, MapR and EMC. Pentaho set up a Kettle resource page here.

About Jeffrey Kelly

As Wikibon’s lead Big Data analyst, Jeff Kelly applies a critical eye to trends and developments in the Big Data and business analytics markets, with a strong focus on helping practitioners deliver business value. Jeff’s research includes market analysis, emerging technologies, enterprise Big Data case studies, and more. He also appears frequently on theCUBE to share his insights. Prior to joining Wikibon, Jeff spent seven years as a writer and editor at TechTarget, where covered a number of business and IT topics including IT services, mobile computing, data management and business intelligence. He holds a BA from Providence College and an MA from Northeastern University.