UPDATED 15:00 EDT / MAY 25 2018

INFRA

Analysis by paralysis: Why data should stay put in AI infrastructure

Artificial intelligence and machine learning software geared for data analysis is flooding the market, and companies are buying. But a close examination reveals that the vast majority of big-data projects implode and most data scientists spend little time actually doing analytics. What are all of these people doing wrong?

“The bigger movement here is that recent advances in technology have really rehighlighted a focus on organizations getting more out of their data of all forms,” said Rob Lee (pictured), vice president and chief architect of Pure Storage Inc.

Lee spoke with Dave Vellante (@dvellante) and Lisa Martin (@LuccaZara), co-hosts of theCUBE, SiliconANGLE Media’s mobile livestreaming studio, during the Pure Storage Accelerate event in San Francisco. They discussed the holdups in big data and AI and how companies can bust through them(* Disclosure below.)

The wide availability of advanced algorithms is democratizing AI for all businesses, on the one hand, Lee pointed out. Conversely, there are two factors that will separate the sprinters from the hobblers in the race to big data and success. One is the sheer wealth of data in their possession, he said, pointing to Google Inc. as the obvious proof point.

“The takeaway point there is having a lot of data trumps having the best algorithm, and we expect that to continue as AI research and algorithms continue to evolve,” Lee stated.

The second is infrastructure that ties AI algorithms and applications together to deliver insight or action some time before next Christmas.

What type of infrastructure should companies serious about AI be looking at? “It’s all about simplicity; it’s all about removing friction and bottlenecks,” Lee said. A commonly cited statistic is that data scientists spend 80 percent of their time wrangling, funneling and transporting data in various forms — rather than real analytics. “And the other 20 percent is spent complaining about the first 80 percent,” Lee joked.

“If you take a look at an AI pipeline to do something like training an object detection system for self driving cars, that pipeline — that simple sentence — may encapsulate 30 or 40 different applications,” he said.

Humans have to be removed and replaced by automation as much as possible in that scenario. “Without an infrastructure to make it easy to centralize the data-management portion of that, you’ve also potentially got 30 or 40 different data silos,” Lee said. This requires new data-centric architectures (as well as practice and processes) built around the idea that data is very difficult to move.

“You want to move it as few times as possible, manage it as little as possible,” Lee said.

Here’s the complete video interview, and there’s more coverage on SiliconANGLE and theCUBE. (* Disclosure: TheCUBE is a paid media partner for the Pure Storage Accelerate event. Neither Pure Storage Inc., the event sponsor, nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.