UPDATED 07:00 EDT / MARCH 19 2018

BIG DATA

Io-Tahoe brings machine learning to data discovery and cataloging

Another company has joined the smart data catalog party.

Io-Tahoe LLC, a unit of British utility giant Centrica PLC, today is introducing a machine learning-driven data discovery product that it says can find and classify data across a wide range of platforms ranging from traditional databases to semistructured data lakes.

At the center of the software is a data catalog that uses a set of 14 machine learning algorithms to create, maintain and search business rules, define policies and support governance-based workflows. The software automatically enriches metadata and enables business users to create and manage policies and rules.

Data catalogs are similar to card catalogs in a library. They tell where data elements can be found and may also include other information, such as ownership, intended use and governance policies.

Io-Tahoe said its machine learning technology can look beyond metadata to the underlying source data for deep visibility into complex data sets, the company said. Io-Tahoe has filed for two patents on its relationship discovery technology, which examines the primary foreign key relationships in relational tables and plots them on a map.

“We look at data only, so if you have a field like a transaction ID that goes across multiple databases, we’re able to find it,” said Chief Executive Oksana Sokolovsky. The software can also be used for impact analysis to help organizations detect changes in data. “A week or a year later, we can look at the databases and see how the landscape has changed over time, as well as if data elements have been introduced without the company’s knowledge,” Sokolovsky said. 

The technology grew out of Sokolovsky’s experience as a top information technology executive at Wall Street investment and health care firms. “I spent 20 years dealing with large enterprises and relied a lot on data discovery,” she said. “Much of it was on spreadsheets, which resulted in inaccuracies.”

She founded Rokitt Inc. in 2014 to sell Rokitt Astra, a tool for finding hidden relationships within relational databases. Rokitt was acquired by Centrica last year and renamed Io-Tahoe. Rokitt Astra was primarily used by technical organizations for tasks like migrating between relational databases or inferring structure from messy data lakes.

With the addition of the data catalog, Io-Tahoe is now targeting nontechnical business users. “Data catalogs allow us to work with the data owners, who can create business rules, search for rules that exist and ultimately enhance the description of data elements so others can get the benefits of those rules,” she said. The technology currently works only on structured and semistructured data, but support for unstructured data is in the works.

The market for data discovery and cataloging tools has been hot of late, in part due to the impending imposition of the General Data Protection Regulation in Europe. Research firm MarketsandMarkets Research Private Ltd. estimates that the data discovery market will grow from $4.33 billion in 2016 to $10.66 billion in 2021. Waterline Data Inc. recently introduced a data discovery platform targeted specifically at GDPR compliance. One month earlier, Podium Data Inc. migrated its data catalog to the cloud.

Pricing wasn’t disclosed.

Image: Io-Tahoe

A message from John Furrier, co-founder of SiliconANGLE:

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+  
CUBE Alumni Network
C-level and Technical
Domain Experts
15M+ 
theCUBE
Viewers
Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.