Hadoop, there it is: Here Comes Hortonworks

Hadoop, there it is: Here Comes Hortonworks

In what appears to be a change in direction, Hortonworks has released a completely open source Hadoop distribution based on Apache Hadoop that will compete head-on with Cloudera’s CDH3. The new distribution, called Hortonworks Data Platform, includes a new, open source management tool the company developed called Ambari and is currently available as a limited technology preview.

The reason I say this is a change of direction for the company, which was spun-out of Yahoo last summer, is because the message from CEO Eric Baldeschwieler and team until recently was that Hortonworks was going to focus strictly on Hadoop training and technical support, not on producing a distribution of its own.


The idea was that, with its experience deploying and managing Yahoo’s enormous Hadoop cluster, Hortonworks would position itself as the only vendor that could help transition Hadoop early adopters from proof-of-concept deployments to full-on, enterprise-scale deployments. That is still part of Hortonworks’ message. Hortonworks also announced today a public Hadoop training course, as well as a number of other support services.

But Baldeschwieler decided the company also needed to develop a Hadoop distribution of its own. He explains the about-face in a blog post:

As we began to interact with enterprises and ecosystem partners, the one constant was the need for a base distribution of Apache Hadoop that is 100% open source and that contains the essential components used with every Hadoop installation.  A distribution was needed to provide an easy to install, tightly integrated and well tested set of servers and tools.

Hortonworks is also attempting to broaden the Hadoop ecosystem. The new distribution, which is based on Hadoop 0.20.205, includes HCatalog, a metadata management service, and other API’s aimed at making it easier for partners to integrate with Hortonworks Data Platform.

The company also unveiled a new partner program and an initial wave of partners. They include Informatica, the data integration specialist that just released a Hadoop-focused data transformation tool called HParser, and Tresata, a cloud-based Big Data analytics platform for banking that uses Hadoop under the covers to crunch massive data sets.

In a recent interview, Baldeschwieler told me Hortonworks is “completely committed to

Eric Baldeschwieler, CEO, Hortonworks

an open source business model” and “we are always going to ship Hadoop for free.” In other words, Hortonworks is basing a large part of its appeal, in addition to its experience supporting Yahoo, on the fact that its distribution is 100% open source, while Cloudera’s distribution includes some proprietary tools, including its cluster management console, Cloudera’s Services and Configuration Manager.

Hortonworks’ new distribution, partner program, and support/training services are a direct assault on market leader Cloudera. The timing is no coincidence either. Cloudera’s Hadoop World conference takes place next week in New York City, and undoubtedly Hortonworks is looking to steal some of Cloudera’s thunder in the run-up to the event.

Jeffrey Kelly

As Wikibon’s lead Big Data analyst, Jeff Kelly applies a critical eye to trends and developments in the Big Data and business analytics markets, with a strong focus on helping practitioners deliver business value. Jeff’s research includes market analysis, emerging technologies, enterprise Big Data case studies, and more. He also appears frequently on theCUBE to share his insights. Prior to joining Wikibon, Jeff spent seven years as a writer and editor at TechTarget, where covered a number of business and IT topics including IT services, mobile computing, data management and business intelligence. He holds a BA from Providence College and an MA from Northeastern University.


Join our mailing list to receive the latest news and updates from our team.

Submit a Comment

Your email address will not be published. Required fields are marked *

You may use these HTML tags and attributes: <a href="" title=""> <abbr title=""> <acronym title=""> <b> <blockquote cite=""> <cite> <code> <del datetime=""> <em> <i> <q cite=""> <s> <strike> <strong>

Share This

Share This

Share this post with your friends!