

John Furrier and Jeff Kelly, theCUBE co-hosts, sat down with Jack Norris, CMO of MapR, for a chat revolving around issues such as the adoption of Hadoop, the improvements in clients’ performance and the Enterprise-grade efforts.
What seems to have changed in this year’s event compared to the previous year, is the mentality, notes Norris. It’s no longer “if” we should use Hadoop, but “how”. At least that is his impression, because enterprises have now become very interested in adopting new technologies, once they realized the way it could impact their production scale.
MapR is future-proofing the Hadoop cluster for organizations who look at tech an try to figure out what it means for their investment. “The announcement with Fusion-io was that we’re 25 x faster on reading HBase applications,” Norris says. “As organizations are deploying Hadoop, and looking at technology changes, they can rest assured that they’ll be able to take advantage of those in a much more aggressive fashion with MapR than with other distributions.”
”Enterprise-grade is more the UX than a marketing claim,” Norris explains. “We are talking about the capabilities and features that they’ve grown to expect – the ability to meet full SLA, full HA, recovery from multiple failures, rolling upgrades, data protection with consistent snapshots, business continuity, ability to share a cluster across multiple groups. There’s a host of features that fall under the umbrella of Enterprise-grade. And when you move from no support to any of those features to support to a few of them, that’s not going to HA, it’s more like moving to low availability.”
Picking on the term, Furrier asked Norris to expand a bit on the idea of “low availability”. “If you have an HA solution that can recover from multiple failures, that’s downtime. If you have an HBase application that’s running online, and you have data that goes down and it takes 10 to 30 minutes to have the RegionServers recover from another place in the distribution, that’s downtime,” says Norris.
“If you have snapshots that aren’t consistent across the cluster, that doesn’t provide data protection; there’s no point in time recovery for a cluster. So, it’s basically down to interruptions, downtime and the potential of losing data. Our answer is that you need a series of features that are hardened and proven to deliver that.”
MapR prides itself with extensive competencies in recoverability. “Right now there’s no point in time recovery for HDFS and HBase tables,” clarifies Norris. ”There’s snapshot support, and snapshots are being compared to copy table.” MapR develops, distributes, and supports a distribution of Apache Hadoop that addresses many of the enterprise quality issues currently limiting its implementation in production data centers. MapR replaces the Hadoop Distributed File System (HDFS) with one that eliminates weak points in HDFS, but is fully compatible with MapReduce, HDFS and HBase.
As MapR has announced before, Apache Hadoop could be made more enterprise ready by:
There are three main areas that the Organizations are taking into account when adopting Hadoop:
”What companies will start to realize is that instead of having large data moving across different sylos, they’re going to be faster, more agile, and more competitive by processing data in one place, sending small results sets,” says Norris. That puts the spotlight on what data platform is out there that can support a broad set of applications and have a broad set of functionality. “We are delivering an Enterprise-grade, mission critical support platform that supports map reduce, does high-performance, provides NFS, integrades Enterprise-grade, NoSql applications, so that the customers can do high-speed, consistent performance and real-time operations in addition to batch, streaming, integrated search,” he concludes.
THANK YOU