The O’Reilly Strata Conference in Santa Clara is abuzz with the promise of making analytics power more accessible than ever before through new applications of Big Data. Tresata, the brainchild of former Bank of America data science chief and theCUBE regular Abhishek Mehta, is leading the way with a new software solution that runs on Apache Spark.
NET 1.0 is one of the first commercial apps to utilize the open source clustering system, which executes queries up to 100 faster than MapReduce using a sophisticated in-memory processing engine. Tresata says that its solution allows business users to harness the technology via a “compelling and easy to use” interface that visualizes patterns in large datasets. The offering advances Mehta’s plan to gain a head start in the emerging market for prepackaged Hadoop applications, a segment he predicts will take off in the next four years.
“The value in the ecosystem sits really high up the stack, in what we call advanced analytics applications, and the market has taken time to get there,” the executive remarked in an interview on theCUBE last year. “With this massive race to zero in large parts of the historical data analytics stack where billions are currently being made, that is now going to transition over to the highest point of the stack to what we call analytics applications, predictive analytics software that works in Hadoop.”
Revolution Analytics is also working make analytics more accessible, but to a different audience. The firm is targeting data scientists with a new cloud-based version of its enterprise R distribution that allows users to start crunching large volumes of data without a major upfront capital investment in infrastructure. The software is now available now through the AWS Marketplace for $1.25 per core per hour.
Bringing self-service to the database
Following the example of Amazon, Hewlett-Packard on Monday introduced a marketplace of its own that gives Vertica customers access to a catalog of Big Data connectors, extensions and a community portal for engaging others in the install base. The store also features a number of homegrown add-ons, including the Distributed R development platform, a sentiment analysis tool called Pulse and the Place geospatial analytics pack.
The update underlines the company’s continued efforts to push back against fast-growing rivals such as Sqrrl, which is making big gains with its ultra-secure Accumulo database. Just this week, the startup announced that Medyear, the operator of the world’s first personal health exchange, deployed the key-value store in a bid to more easily meet privacy standards for managing medical records.
Sqrrl vice president of business development Ely Kahn commented that “shifts in healthcare are dramatically increasing the need for a person to possess their own health care records and to share their records as they see fit. This introduces a complex array of new privacy norms. Through our partnership with Medyear, we are building the modern tools necessary to do so confidently and securely.”