UPDATED 20:38 EDT / NOVEMBER 15 2011

NEWS

Apache Hadoop Gets its First Update in More Than a Year

Apache Hadoop has a new release, its first in more than a year. According to the Cloudera blog, the new release, Apache 0.23.0, has a number of new features and improvements, including HDFS federation and a new MapReduce framework.

But as the Cloudera blog states, before you go to far with this new release it is important to note that 0.23.0 is not a production release. Cloudera warns that it should not be put on a production cluster.

New Features

According to Cloudera:

HDFS federation improves HDFS scalability by allowing multiple independent namenodes, each managing a portion of the namespace. Each datanode in the cluster can provide storage to all the namenodes (which means datanodes do not, for example, belong to a single namenode). Note that HDFS federation is not to be confused with HDFS High Availability, which will be coming in a future 0.23 release.

MapReduce 2 (“next gen”) is a re-write of the the MapReduce runtime to overcome scalability bottlenecks in the jobtracker. It is based on a new framework called YARN for cluster resource management, and a MapReduce “application” which runs users’ jobs on YARN. In this design MapReduce becomes a user-space library, and also allows other parallel applications to run on Hadoop clusters, beside MapReduce applications.

There are some additional changes to MapReduce that should be noted. See the Cloudera blog post for more detail.

A criticism about Hadoop is the slow development. Getting to this new release is important for the community to build on developments such as BigTop.


A message from John Furrier, co-founder of SiliconANGLE:

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+  
CUBE Alumni Network
C-level and Technical
Domain Experts
15M+ 
theCUBE
Viewers
Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.