UPDATED 17:22 EDT / OCTOBER 23 2014

ebay CEO John Donahoe NEWS

eBay joins open-source community with ultra-fast OLAP engine for Hadoop

ebay CEO John Donahoe

ebay CEO John Donahoe

Like arch-rival Amazon.com, the soon-to-split eBay Inc. is something of an oddity in that it hasn’t historically been a big contributor to the open-source community. But the e-commerce pioneer hopes to change that with the release of the source-code for a homegrown online analytics processing (OLAP) engine that promises to speed up Hadoop while also making it more accessible to everyday enterprise users.

Dubbed Kylin, the platform was developed after eBay failed to find a solution to help it effectively address the rapid growth in the volume and diversity of data generated by its customers, a story that is familiar to other contributors to the Hadoop community. Kylin optimizes the storage of information by leveraging existing technologies whenever possible from the upstream component ecosystem.

By default, data is stored in Apache Hive, which layers a familiar SQL interface on top of Hadoop that allows business workers to harness the distributed analytics capabilities of the system without having to learn the nuances of the native MapReduce execution paradigm. When Kylin comes across certain repetitions in the rows and columns inside the sub-project – such as a particular product appearing multiple times with different prices – it maps that data into key-value pairs which are then whisked off to Apache Hive, which is another component designed with that specific type of workload in mind.

Specifically, Hive provides random access to information that Kylin exploits to avoid having to sequentially scan tens or hundreds of billions of rows in Hive whenever an eBay employee looks up a certain business detail. That has helped to significantly improve response times at the company, with eBay claiming that the technology handles certain queries in less than a second, allowing truly interactive analytics.

Topping off that performance advantage are a number of complementary features such as integration with popular business intelligence tools like Tableau Inc.’s wildly popular data visualization platform, storage compression and monitoring. Future versions of Kylin will also add better support for more processing paradigm, eBay promises, including multidimensional and hybrid OLAP.

Kylin is not as groundbreaking as some of the other emerging Hadoop add-ons that have been making headlines recently, but it does address an important pain point currently holding back traditional enterprises from taking advantage of the batch processing framework. It’s these kinds of relatively mundane but vital knots that the upstream community must smooth out as more and more corporate deployments of the project move from pilot to production, a mission that eBay’s first major contribution brings an important step forward.


A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU