UPDATED 16:16 EDT / AUGUST 29 2012

Pervasive’s DataRush Unlocks High Utilization for Hadoop

For all its advantages in Big Data research, Hadoop has one major drawback, it struggles to achieve 20% server utilization, writes Jeff Kelly in his latest Wikibon Peer Incite, “Improving Hardware Efficiency Important to Overall Hadoop ROI”.  This makes large, production Hadoop installations running on hundreds of nodes into a drain on IT CapEx. Earlier this summer VMware announced Project Serengeti, including the contribution of code to Apache Hadoop to make HDFS and MapReduce “virtualization aware”, to support Hadoop virtualization to attack this issue. The only problem – each node of a Hadoop cluster needs its own copy of vSphere Enterprise Edition, which adds a major “virtualization tax” to the implementation.

Now Pervasive Software has introduced a potentially less expensive solution based on a Java framework called DataRush that allows multiple Hadoop jobs to run in parallel on commodity hardware in vanilla JVMs, according to Pervasive Senior Director of Business Development and Strategy David Inbar. A second product, RushAnalyzer, speeds up Big Data preprocessing, Inbar says.

Together they abstract away the complexity of parallelizing Hadoop jobs, allow users to monitor IO and CPU use in real time and mitigate memory constraints. Inbar claims server efficiency rates as high as 80%. The products are getting attention from systems integrators and consultancies and, says Kelly, should also be considered along with Project Serengeti by IT groups moving from proof-of-concept trials of Hadoop to large-scale implementations. If Pervasive’s products perform as promised, the savings in hardware costs, power and cooling, and data center floor space can be significant, without the “virtualization tax”.


A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.