UPDATED 15:14 EDT / JUNE 22 2017

BIG DATA

If streaming data’s breaking up databases, what can put them together?

As pretty as it sounds, “streaming data” in an actual database is not the elegant, unbroken chain that data scientists and developers wish it were, according to Itamar Ankorion (pictured, right), chief marketing officer at Attunity Inc.

“You’re inherently building an architecture that takes what was originally a database, but you’re kind of, in a sense, breaking it apart into partitions as you’re loading it over time,” he explained in an interview at DataWorks Summit in San Jose, California. 

Ankoria and Arvind Rajagopalan (pictured, left), director of global technology services at Verizon Communications Inc., spoke with Lisa Martin (@Luccazara) and George Gilbert (@ggilbert41), co-hosts of theCUBE, SiliconANGLE Media’s mobile livestreaming studio. (* Disclosure below.)

The partitions that result from ingesting continuous or incremental data streams can be awkward to pull together for cohesive analytics, Ankoria explained. Attunity is attempting to solve this problem with the announcement of Compose for Hive (Hive is Apache data warehouse software), which automates data lake processes.

“It reassembles these partitions, and it then creates analytic-ready datasets back in Hive. It can create operational data stores; it can create historical data stores, so then the data becomes formatted in a manner that’s more easily accessible for users who want to use analytic tools, BI [business intelligence] tools,” Ankorian said.

Knocking down ERP walls

Verizon struggled for some time to bring its three separate Enterprise Resource Planning systems’ data together, Rajagopalan said.

Standard reporting in each of the ERP systems was mature and adequate, however, “When you want to look at combining all of the data, it’s very hard,” he said.

The company gave both Oracle Corp. and SAP SE Hana in-memory data platform a shot; results in both cases were disappointing due to high cost, manual tasks and minimal self-service options, according to Rajagopalan.

Verizon ultimately decided on Attunity’s data integration and management software. “It has the intelligence to look at and understand the proprietary data structures of the ERPs, and it’s able to bring all the data from the ERP source systems directly into [Apache] Hadoop without any stops or staging databases along the way,” he said. (Hadoop is an open-source-based software used for storing, processing and analyzing big data.)

Rajagopalan added that its Replicate CDC (Change-Data-Capture) tool helps developers perform ad-hoc analytics.

Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s independent editorial coverage of DataWorks Summit. (* Disclosure: Attunity Inc. sponsored this DataWorks Summit segment on SiliconANGLE Media’s theCUBE. Neither Attunity nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU