UPDATED 16:30 EST / MARCH 24 2021

BIG DATA

Knowledge graphs provide smart database management across hybrid computing environment

The next challenge in data management is accessing data resources that are dispersed across a hybrid computing environment. Companies have invested in master data management solutions, breaking down silos and centralizing resources to simplify access. But moving data is costly, and silos serve a purpose by isolating secure data and allowing local control and governance.

Gaining a comprehensive view of data across locations from on-premises out to the edge requires the merger of human and machine intelligence in solutions that connect data and leverage it to provide context in situ.

“What we do differently than everyone else is by allowing you to keep the data in its existing data silos, … we allow you to connect to that data where it is, cross-zone, whether it’s on-prem or on the cloud,” said Rob Harris (pictured), vice president of solutions consulting at Stardog Union Inc.

Harris spoke with John Furrier, host of theCUBE, SiliconANGLE Media’s livestreaming studio, during the AWS Startup Showcase Event: Innovators in Cloud Data. They discussed how data fabric and knowledge graph technology is connecting data in a hybrid world. (* Disclosure below.)

Unifying data regardless of location

The answer to unifying data across silos is to stitch it together into a data fabric, which unites the disparate data sources at the compute layer. Forming that fabric is the job of the knowledge graph.

By tying together customer data, the Stardog Enterprise Knowledge Graph unifies data access in real time across multiple applications. This enables companies to find, search and understand the context and relationship of all the data within their organization. Pivotal is the ability to manage not only the data itself, but the relationships between bits of data.

“What we look for is the horizontal-type solution where you have a lot of systems that you want to tie together or you want to have that understanding of your data all within context throughout your organization,” Harris said.

Knowledge graphs present a major business opportunity. The global data analytics market has a current compound annual growth rate of 28.9%, according to market research, and is expected to reach a market value of $133 billion by 2026. Graphing database management solutions have consistently outpaced all other database models in popularity, according to ranking statistics on db-engines.com, and were named by Gartner as a top data and analytics trend for 2021. According to the same Gartner report, 50% of the organization’s client inquiries on artificial intelligence involve a discussion on graph technology.

“A lot of people have gone down this path of trying to create these large repositories, data lakes, data warehouses. We try to provide the additional value on top of them by not forcing you to continue to invest in moving and centralizing all your data together, but connecting it and providing context while leaving and leveraging where it is,” Harris stated.

Semantic modeling and automated management infrastructure

As well as unifying data regardless of location, the Stardog Knowledge Graph platform has the benefit of representing the data with a flexible, reusable semantic model. This is key to efficient, successful data strategy, according to Stardog’s founder and Chief executive officer Kendall Clark.

“Relational data was never designed to support complex business processes with changing requirements, particularly with the incredible data variety we see today,” Clark stated in a blog post in January.  “Semantic graph … is the natural way to represent data that is natively stored in other structures.”

Semantic technology also solves the constant need to spin up new repositories for each question, which is a common failing in analytics, according to Harris. “By starting with the semantic graph, we allow you to incrementally invest in building out your model and then being able to reuse that model as you go through your implementations,” he said. “I think it’s time for that to hit mainstream.” 

Enriching the data through machine learning gives predictive abilities, and automation removes monotonous tasks and disrupts the traditional extract, transform, load pipeline.

“Machine learning is how we use things like linear regression algorithms or other types of community detection algorithms in order to build more connections in the data so that you can get really unlock that value that you are looking for when you’re leveraging graph technology,” Harris said, adding that “people need help” with the complexity of managing data in a hybrid environment, especially as edge devices proliferate.

“It can’t be that you’re going to define all those edges and connections by hand yourself through some systems or keys,” Harris said. ”It’s not sufficient in order to really get the value out of that graph that you expect.”

Ingest, connect, and see value

Although still a startup, Stardog lists an impressive customer lineup that includes the National Aeronautics and Space Administration. Andrew Schain, program data integration manager at NASA’s Exploration Systems Division, claims the agency reduced costs by up to 90%.

As well as supporting missions to Mars, Stardog’s knowledge graphs reduce costs and decrease time-to-market for products and solutions in the energy sector, pharmaceutical industry and financial services.

“One of our largest customer base is really in pharmaceuticals, and they’re using this technology in order to find more connections in the data so that they can really decrease the amount of time for getting a drug to market on the research and development time,” Harris said.

Handling data without the additional costs related to moving it around is Stardog’s key value proposition, according to Harris.

“The bottom line is you want to try to get more value out of your data at a lower cost and make it easier and faster to do,” Harris stated. “As people try to figure out how [to] handle this large amount of data without having to generate all the additional costs about moving it around, we look at how do I push that compute down to the storage layers where the data already exists.”

Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of the AWS Startup Showcase Event: Innovators in Cloud Data. (* Disclosure: Stardog Union sponsored this CUBE Conversation. Neither Stardog nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU