UPDATED 08:00 EDT / APRIL 02 2019

CLOUD

MapR separates Kubernetes storage and compute to boost container flexibility

MapR Technologies Inc. is stepping up its efforts to hasten its customers’ moves to software containers with a new set of features in its MapR Data Platform that separates computing from storage.

MapR described the enhancements, announced today, as deep integrations with the core components of Kubernetes, which is the open-source software that orchestrates applications running in containers.

Containers make it simple to encapsulate applications in a form that’s easy to run on any computing environment in companies’ data centers or in public clouds. Kubernetes, which some people have called the operating system for the cloud, is expected to be in use by 90 percent of enterprises by the end of the year.

MapR said the new features make it easier for organizations to manage highly elastic workloads by enabling them to separately scale compute and storage. The platform will initially support Apache Spark and Apache Drill, which are two popular open-source analytics frameworks, “but this is just the beginning,” said Suzy Visvanathan, a senior director at MapR. “We will continue to build this out. The idea is to have a whole ecosystem.”

MapR has been on a campaign to align itself closely with Kubernetes since it announced support for persistent storage and stateful containerized applications a year ago. “There are things Kubernetes doesn’t do well, like provisioning, multitenancy and snapshots,” Visvanathan said. “We’re giving customers the ability to run Kubernetes in a production environment.”

Separating compute and storage enables workloads to be more appropriately provisioned according to the needs of each use case, she said. “Let’s say one of your users suddenly has a peak workload; how do you make sure others aren’t throttled when one user has 90 percent of the CPU?” she said. “You need to separate compute and storage subscriptions.”

The enhancements enable Spark and Drill processing engines to be deployed within compute containers orchestrated by Kubernetes. Each workload in a Kubernetes cluster is independent of where the data is stored or managed. Independent versions of Spark can be deployed in separate pods, which is the Kubernetes term for a group of containers that are deployed together on the same host. This enables multiple stages of development, testing, and quality assurance to co-exist within a cluster.

The company is introducing an approach it calls multitenancy in the compute layer to make such intricacies “agnostic to the end user,” Visvanathan said. The technology makes it possible for users to creates tenant namespaces for compute applications, enabling each container to get the resources it needs without infringing upon other containers in the cluster. Tenants can point to a storage cluster located elsewhere.

“If Sally needs four cores, 256 gigabytes of memory and a 2-terabyte volume, those resources can be provided only for user Sally and only user Sally knows it,” Visvanathan said. “You can scale in and scale out your compute jobs independent of scaling your storage.”

That’s in contrast to the approach Hadoop took the early days of big data by closely aligning storage and compute. The intent was to minimize latency, but the ultimate effect was to create a lot of excess computing capacity to accommodate duplicate data for tasks such as disaster recovery, Visvanathan said.

The enhancements are set to ship in the second quarter. The company hasn’t yet decided whether to make the available as a separate product or as an in-line enhancement to the data platform.

Image: Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

MapR separates Kubernetes storage and compute to boost container flexibility

Image: Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

KubeCon + CloudNativeCon EU 2026

RSAC 2026 Conference

Nvidia GTC 2026

Google Cloud AI Agents in Action Series 2025/2026

MWC Barcelona 2026

MapR separates Kubernetes storage and compute to boost container flexibility

Image: Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

KubeCon + CloudNativeCon EU 2026

RSAC 2026 Conference

Nvidia GTC 2026

Google Cloud AI Agents in Action Series 2025/2026

MWC Barcelona 2026

Cookies