UPDATED 10:28 EST / APRIL 26 2018

BIG DATA

Dremio adds machine learning to self-service data analytics platform

Dremio Corp. is folding machine learning into its self-service data analytics platform, boosting performance and integrating with Looker Data Science Inc.’s namesake business intelligence query engine.

Dremio claims to eliminate the need for time-consuming data transformation tasks needed to load information into data warehouses, multidimensional cubes and aggregation tables for business intelligence uses. Its Data Reflections technology, which is based upon the open-source Apache Arrow columnar in-memory query engine, physically optimizes representations of source data for rapid query processing. Dremio separates compute and storage capabilities and uses in-memory processing to optimize performance and cost.

Enhancements in this release can accelerate processing by a factor of up to 1,000, the company said. The query planner now automatically selects the best reflections to accelerate queries for ad hoc requests, business intelligence and data science workloads.

Dremio can also now automatically detect the star and snowflake schemas, which are logical arrangements of data in multidimensional tables commonly used in data warehousing scenarios. “If there is a notional sense of a star or snowflake schema, we can detect it and optimize queries so they run at interactive speed without requiring you to load data into a cube or warehouse,” said Chief Executive Tomer Shiran.

The new release includes a management engine that automatically optimizes the priority, ordering and queuing of reflection refreshes along with error recovery. With this release, users can also now access cloud object stores such as Amazon Web Services Inc.’s S3 and Microsoft Corp.’s Azure Data Lake Store. Enhancements to Apache Arrow in this release provide up to a 60 percent reduction in query latency.

Learning by doing

The new Dremio Learning Engine makes recommendations based upon patterns it observes in user queries over time. “For example, if I’m working with a particular data set, Dremio can automatically recommend to me another data set I didn’t know about that is suitable to be combined with the data I’m working with,” Shiran said.

Machine learning is also used to observe data during query execution to detect schema changes in source systems and adapt the data catalog automatically. This is particularly useful when querying data from unstructured sources like unstructured text and NoSQL databases, where schemas can vary from record to record. Dremio can also intelligently cache and index metadata into its catalog, taking into account user access patterns.

With this release, Dremio is courting users of the rapidly growing Looker platform (pictured), which simplifies data extraction processes by pulling in data directly from multiple sources for visualizations. “Looker is designed primarily to work with one relational database at a time,” Shiran said. “Now Dremio can make things like MongoDB and ElasticSearch appear like a relational database. You get all the benefits of Looker without the need for a relational source, and you can perform joins across multiple data sources.”

Dremio recently raised $25 million in venture capital, bringing its total funding to $40 million. It provides both open-source and enterprise versions of its analytics engine. Licensing is per-node on an annual subscription basis, but the company declined to provide specifics.

Image: Dremio

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU