UPDATED 14:10 EDT / FEBRUARY 18 2016

NEWS

Demystifying the complexities of Spark and Hadoop | #SparkSummit

There is a lot of confusion around about Spark and Hadoop, but Matthew Hunt, head of Big Data at Bloomberg LP, thinks this is only due to a lack of understanding.

In an interview at Spark Summit East 2016 at the New York Hilton Midtown in NYC, Hunt talked with Jeff Frick and George Gilbert, cohosts of theCUBE from the SiliconANGLE Media team, to demystify the complexities of working with Spark and Hadoop, as well as provide an analysis of how Spark and Hadoop fit within the market.

Hadoop developed to answer a practical problem

In a conversation that delved deep into the mechanics of Big Data frameworks, Hunt explained the differences between Spark and Hadoop, answering where they come from, where they are going, what they do today and how they fit together.

For example, Hadoop was created to resolve a practical problem — how to download and index the web economically — by engineers rolling up their sleeves to solve real issues. As the platform grew, layers were added on top and the complexity and number of tools grew, yet the instruction set remained basic. Spark was developed with a more complicated instruction set that increased its speed yet simplified the many tools of Hadoop into one.

Spark takes language constructs and makes them performant

Hunt gave a practical example to help viewers create a mental model of Spark: When you compile a program, you write code, hit a button and the computer turns it into machine-level instructions. The same thing happens in Spark — it has an instruction set under the hood where whatever you are writing in is transformed.

Gilbert summarized this as Spark “taking language constructs and making it performant.”

Mental model shift

People assume what will be in a fast computation engine, but they are often not accurate and this causes confusion, said Hunt. “There is a mental model shift, and there are pieces that haven’t come together yet to make that happen.”

Watch the full video interview below, and be sure to check out more of SiliconANGLE and theCUBE’s coverage of Spark Summit East 2016. You can also join in on the conversation by CrowdChatting with theCUBE hosts.

Photo by SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Demystifying the complexities of Spark and Hadoop | #SparkSummit

Hadoop developed to answer a practical problem

Spark takes language constructs and makes them performant

Mental model shift

Photo by SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Pure Accelerate 2026

FinOps X 2026

Snowflake Summit 2026

Freshworks Refresh 2026

IBM Think 2026

Demystifying the complexities of Spark and Hadoop | #SparkSummit

Hadoop developed to answer a practical problem

Spark takes language constructs and makes them performant

Mental model shift

Photo by SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Pure Accelerate 2026

FinOps X 2026

Snowflake Summit 2026

Freshworks Refresh 2026

IBM Think 2026