UPDATED 13:13 EDT / NOVEMBER 01 2011

NEWS

Hadoop, there it is: Here Comes Hortonworks

In what appears to be a change in direction, Hortonworks has released a completely open source Hadoop distribution based on Apache Hadoop that will compete head-on with Cloudera’s CDH3. The new distribution, called Hortonworks Data Platform, includes a new, open source management tool the company developed called Ambari and is currently available as a limited technology preview.

The reason I say this is a change of direction for the company, which was spun-out of Yahoo last summer, is because the message from CEO Eric Baldeschwieler and team until recently was that Hortonworks was going to focus strictly on Hadoop training and technical support, not on producing a distribution of its own.

ServicesANGLE

The idea was that, with its experience deploying and managing Yahoo’s enormous Hadoop cluster, Hortonworks would position itself as the only vendor that could help transition Hadoop early adopters from proof-of-concept deployments to full-on, enterprise-scale deployments. That is still part of Hortonworks’ message. Hortonworks also announced today a public Hadoop training course, as well as a number of other support services.

But Baldeschwieler decided the company also needed to develop a Hadoop distribution of its own. He explains the about-face in a blog post:

As we began to interact with enterprises and ecosystem partners, the one constant was the need for a base distribution of Apache Hadoop that is 100% open source and that contains the essential components used with every Hadoop installation.  A distribution was needed to provide an easy to install, tightly integrated and well tested set of servers and tools.

Hortonworks is also attempting to broaden the Hadoop ecosystem. The new distribution, which is based on Hadoop 0.20.205, includes HCatalog, a metadata management service, and other API’s aimed at making it easier for partners to integrate with Hortonworks Data Platform.

The company also unveiled a new partner program and an initial wave of partners. They include Informatica, the data integration specialist that just released a Hadoop-focused data transformation tool called HParser, and Tresata, a cloud-based Big Data analytics platform for banking that uses Hadoop under the covers to crunch massive data sets.

In a recent interview, Baldeschwieler told me Hortonworks is “completely committed to

Eric Baldeschwieler, CEO, Hortonworks

an open source business model” and “we are always going to ship Hadoop for free.” In other words, Hortonworks is basing a large part of its appeal, in addition to its experience supporting Yahoo, on the fact that its distribution is 100% open source, while Cloudera’s distribution includes some proprietary tools, including its cluster management console, Cloudera’s Services and Configuration Manager.

Hortonworks’ new distribution, partner program, and support/training services are a direct assault on market leader Cloudera. The timing is no coincidence either. Cloudera’s Hadoop World conference takes place next week in New York City, and undoubtedly Hortonworks is looking to steal some of Cloudera’s thunder in the run-up to the event.


A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU