UPDATED 14:00 EDT / OCTOBER 01 2014

After a year of incubation, Apache welcomes Storm into the Hadoop family

Colorful Sonoran Desert Storm Real-time analysis in Hadoop moved a step closer to enterprise reality on Monday after the Apache Software Foundation promoted Storm to a top-level project. The status upgrade charts a future for the continued development of the platform under the same community umbrella that governs the batch processing framework and its sprawling component ecosystem.

Like most of the other technologies in the extended Hadoop family, Apache Storm started as an internal project at a web company that encountered a challenge no existing solution could adequately address. In this particular case, the firm in question was a little-known social media marketing startup known as BackType Inc., which momentarily appeared on the industry radar after becoming part of Twitter Inc. in August 2011.

The company had already open-sourced much of its homegrown analytics technology by the time of the acquisition, namely the ElephantDB key-value store for exporting information from Hadoop and Cascalog, a library designed to simplify work in the data processing framework. A week after announcing the purchase, Twitter revealed that it would continue BackType’s community-giving tradition and release the source code for the last remaining ace up the startup’s sleeve under a free license. The rest is history.

Storm became an Apache Incubator process last September and garnered the support of several industry heavy hitters over the course of its 12-month induction process, including Cisco Systems Inc., engineering powerhouse PARC and Yahoo Inc., the birthplace of Hadoop. Along the way, the real-time event processor has been enhanced with features such as support for the YARN resource management and scheduling tool, which enables different types of workloads to run in the same deployment so as to make fast-paced stream analysis more practical from a cost standpoint.

Hadoop distributor Hortonworks Inc. – itself a supporter of Storm – only recently started working on integrating YARN with Spark, the other real-time data crunching engine that has been making waves in the ecosystem lately. Yet while there are significant technological overlaps between the projects, they’re built for two fundamentally different tasks.

The product of UC Berkley’s AMPLab, Spark is designed to reduce processing times for the traditional batch workloads Hadoop was built to handle. Storm, in contrast, operationalizes the framework to analyze fast-moving data streams such as tweets and sensory input. The technology, therefore, fills a much bigger functionality gap in the upstream component ecosystem that makes its graduation especially significant for the future development of Hadoop. Expect more and more vendors to start integrating their products with the project and contribute code now that it has reached stable ground.

photo credit: Striking Photography by Bo Insogna via photopin cc

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

After a year of incubation, Apache welcomes Storm into the Hadoop family

photo credit: Striking Photography by Bo Insogna via photopin cc

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Pure Accelerate 2026

FinOps X 2026

Snowflake Summit 2026

Freshworks Refresh 2026

IBM Think 2026

After a year of incubation, Apache welcomes Storm into the Hadoop family

photo credit: Striking Photography by Bo Insogna via photopin cc

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Pure Accelerate 2026

FinOps X 2026

Snowflake Summit 2026

Freshworks Refresh 2026

IBM Think 2026