UPDATED 08:00 EDT / JANUARY 17 2012

RainStor logo NEWS

RainStor Runs Its Database Natively on Hadoop

RainStor logo Today RainStor announced that it is releasing a version of its database that runs natively on Apache Hadoop called RainStor Big Data Analytics on Hadoop. This will enable users to query data stored in Hadoop using both SQL and MapReduce. Also, thanks to the advanced compression techniques users will be able to store more data in less space, reducing server overhead and reducing the time and complexity of backups, replication and other activities.

RainStor’s main product is a relational database and has focused mostly on providing cloud backups. It has a big client list, including household names like AT&T, Bank of America, Merck, Pfizer and more.

RainStor Big Data Analytics on Hadoop is not a connector – the database runs natively on top of Hadoop. Because it runs on the Hadoop stack, there’s no need to pipe data in and out of the Hadoop Distributed File System in order to run SQL or even MapReduce queries on it.

Here’s an illustration of how it works:

Rainstor illustration

Rainstor claims to be able to compress data by 40 fold, and also claims huge performance boosts in querying large datasets through its query optimization and filtering techniques.

One of the big advantages of this approach, other than the speed and compression, is that it enables users to choose between either SQL or MapReduce. That means that business analysts or data scientists not familiar with MapReduce can analyze data using a language they already understand. But serious Hadoop developers can still use MapReduce if they want.

One thing this doesn’t really do is streaming/complex event processing (CEP). RainStor CEO John Bantleman describes the product as a complement to CEP rather than a replacement.

RainStor Big Data Analytics on Hadoop addresses some of the big problems with Hadoop, namely the complexity of interacting with it, the time it takes to run MapReduce jobs on large datasets and the complexity of running large clusters. In many ways it reminds me of HPCC, which is an open source alternative to Hadoop that uses its own SQL-like language for . RainStor shows the strength of the Hadoop ecosystem. Companies ranging from HStreaming to Tresata are building on Hadoop rather than trying to replace it with something else.


A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.