UPDATED 10:03 EDT / FEBRUARY 17 2016

NEWS

Watch #SparkSummitEast on theCUBE for real-time coverage of real-time Hadoop

This week theCUBE  goes to Apache Spark Summit East 2016 (#SparkSummitEast) for two days of continuous coverage today from 10:00 a.m. to 5:00 p.m. ET and tomorrow from 10:00 a.m. to 3:00 p.m.. Coming just 44 days after the January 4 release of Apache Spark v. 1.6.0, billed as a stable version, this conference marks the start of a shift in the discussion of this core part of the Apache Hadoop big data stack away from technological development and business impact theory and toward production real-time big data business analysis system running on Hadoop.

The May 31, 2014 release of Apache Spark was a seminal moment in the development of big data and arguably of the IT industry as a whole. Before Spark, Apache Hadoop’s main data analysis program was MapReduce, a disk-based, batch process platform that limited big data analysis to deep insight applications.

Spark, developed initially by the AMPLab at the University of California, Berkeley, and donated to the Apache Software Foundation, changed that dynamic by opening possibilities for near-real-time analysis of unstructured and semi-structured data. Spark allows users to load data into a Hadoop cluster’s memory and query it repeatedly, making it well-suited to machine learning applications. It supports Hadoop Yarn and Apache Mesos cluster management and a variety of distributed storage systems including Hadoop Distributed File System (HDFS), Cassandra, OpenStack Swift, Amazon S3 and Kudu.

Last year Spark was the most active project in the Apache Software Foundation and one of the most active in the entire open source big data ecosystem, with more than 1,000 contributors, including IBM, which has made a major commitment to it.

Interviewees on TheCUBE are scheduled to include Databricks CTO and creator of Apache Spark Matei Zaharia (@matei_zaharia), Databricks Inc. co-founder Reynold Xin (@rxin), Hortonworks Inc.’s Arun Murphy (@acmurphy), and IBM VP of Engineering Anjui Bhambhri (@AnjulBhambhri). Watch live to see what the key players in Apache Spark are saying, and join the conversation in the #SparkSummit CrowdChat, already live, where you can post questions and comments for Wikibon Co-founder David Vellante (@dvellante) and SiliconAngle founder John Furrier (@furrier) to ask on air.

Image courtesy Spark Summit

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU