UPDATED 19:11 EDT / MARCH 14 2018

BIG DATA

Hot data cataloging collapses the thousand steps to value

Data cataloging is emerging as one of the surest routes through the big data muddle in enterprises. Its powers lay in combining and collapsing steps that would otherwise run end-users ragged.

“There are a lot of data catalog products out there, but our huge differentiator — one of them — is the fact that we have integrated data preparation,” said Chris Selland (pictured), vice president of strategic alliances and corporate development at Unifi Software Inc. This removes a huge burden from users’ hands, he explained. A single monolithic receptacle for every data point in the universe is not what’s taking shape in most enterprises. Instead, we are seeing networks of data spring up.

“In reality, some of the data’s in the warehouse; some of the data’s in the lake; some of the data’s in SaaS applications; some of the data’s in block storage. And where is all of that data, what is it, and what can I do with it?” Selland asked. Those are the questions that built-in data preparation answers, he stated.

Selland spoke with George Gilbert (@ggilbert41) and Lisa Martin (@LuccaZara), co-host of theCUBE, SiliconANGLE Media’s mobile livestreaming studio, at the BigData SV event in San Jose, California. They discussed the folly of data dumping, preparation versus governance, and GDPR compliance.

Data dumps put off until tomorrow …

Dispersed data networks seem more complex than a single data lake — that was the thinking behind early Apache Hadoop projects, Selland pointed out. “It was just like, let’s just dump it all into Hadoop, because it’s cheaper than the warehouse. So let’s just put it all in there, and we’ll figure out what to do with it later,” he said. But once it was time to extract and analyze the data, many found their big data lakes unnavigable swamps.

A more thoughtful approach to storing and preparing data results is less grunt work for analysts, according to Selland. Unifi actually leverages Hadoop in appropriate cases. “If you’re using it in a structured way, it can be extremely useful,” he said.

While Unifi is not a governance platform, data preparation is ultimately a crucial component of governance. “GDPR is coming in May, and you’re starting to really see a groundswell of, ‘OK … that’s not about speeds and feeds; that’s ultimately about making sure that I’m compliant with this huge regulatory environment,'” he said.

Unifi is committed to complying with the law, as well as the court of public opinion. “We want to make sure that we’re doing the right thing with data,” Selland concluded.

Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of the BigData SV event.

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+  
CUBE Alumni Network
C-level and Technical
Domain Experts
15M+ 
theCUBE
Viewers
Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.