Coverage from SiliconANGLE's livestreaming video studio

UPDATED 20:30 EDT / AUGUST 01 2019

BIG DATA

Cataloging is the data platform of the future, says IDC analyst

Algorithms that read customers’ minds and predict a market’s future are the sirens alluring today’s companies to big data. But before this is possible, there’s some unglamorous work to be done excavating the raw resources necessary. Finding out where data is and preparing it for prime time is critical first step. Pre-existing silos and multicloud can give companies a lot of disparate spaces to scavenge through.

The most sensible place to start may be with the available data about all that data — or metadata, according to Stewart Bond (pictured), research director at IDC Research Inc. “That’s why we’ve seen such a jump in the number of vendors that are providing data cataloging solutions,” he said.

Bond spoke with Dave Vellante (@dvellante) and Paul Gillin (@pgillin), co-hosts of theCUBE, SiliconANGLE Media’s mobile livestreaming studio, during the MIT CDOIQ Symposium in Cambridge, Massachusetts. They discussed data cataloging as the best hope for handling big data in multicloud (see the full interview with transcript here).

Spider legs go farthest in multicloud

All types of data initiatives — from monetization to artificial intelligence to governance — require some way to find, label and organize massive data sets. Companies are realizing that poorly cleansed or inaccurately labelled data are resulting in inaccurate insights. And vendors are rushing to the rescue. The number of vendors offering cataloging solutions has increased about 240% in the last year and a half, according to Bond’s research.

Selecting data to train models to make accurate predictions is challenging even when it’s all in one place. Vendors are trying to manage all of the dispersed data enterprises want to analyze in a number of ways. There is software for data integration, data intelligence, data profiling, etc. It is the “spidering” of data cataloging that has the most promise, Bond explained.

Multicloud has flung data all over the place. Effective software must have spider legs that can reach out and quickly gather intelligence about it. Data cataloging may do this with machine learning, human annotation, Google-like search features, etc.

“I think that’s going to be the data platform of the future,” Bond stated.

Informatica Corp. currently leads in this market, according to Bond. Hyperscaler clouds Amazon Web Services Inc. and Google Cloud Platform have recently brought out data-intelligence offerings, he added.

Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of the MIT CDOIQ Symposium.

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.