UPDATED 18:21 EDT / JULY 22 2015

NEWS

The data integration debacle; beyond ‘nirvana’ solutions | #MITCDOIQ

“Data integration is the 800-pound gorilla in the corner, and everyone’s got it in spades,” according to Mike Stonebraker, MIT professor and data scientist. The recent recipient of the Turing Award, which is presented for major contributions of lasting importance to computing, Stonebraker sat down with theCUBE, SiliconANGLE’s Media team, during MIT CDOIQ Symposium.

Stonebraker concluded that many members of the industry believe that introducing and adhering to standards will solve current data integration issues. However, the professor remarked that such expectations were a kind of “nirvana.” For the time being, Stonebraker said he was “in favor of after-the-fact data integration to capture value in the short run.”

Stonebraker gave the example of the Beth Israel Deaconess Medical Center, which has data for “26k intensive care unit patients creating real-time data” from the monitoring equipment, as well as information on prescriptions and notes from doctors and nurses. This hospital is currently working to incorporate imaging as well. The goal of such a system, according to Stonebraker, is to be able to access all of that data at once, even if that information is generated at a different hospital.

Privacy considerations more challenging than technologies

Eventually, Stonebraker envisions, a patient with chest pains would be x-rayed, and his or her physician would be able to run a query on every x-ray worldwide that resembles that patient’s images. Nevertheless, Stonebraker declared, “Privacy considerations are more challenging than technologies.” He expressed concerns not only with the regulatory body overseeing patient data HIPPA, but also stated that a national record system would be blocked by politics between hospitals.

Stonebraker discussed his view of Big Data as a “marketing buzz word” for three reasons: too much, denoting an issue with “volume”; too fast, marking a problem with “velocity”; or too many, indicative of what Stonebraker calls “variety.” Tamr, a BI and analytics tool that Stonebraker helped create, helps “scale in variety.” Data integration cannot be done using standard techniques, according to Stonebraker. Instead, what Tamr does is isolate source number one and “de-duplicate” using statistical techniques. Additionally, the program sets a “threshold for accuracy,” which ultimately comes down to an “accuracy versus cost” choice.

Stonebraker said Tamr “organizes their human labor differently” so that a duplication would be analyzed by a domain expert in the event a company chooses not to utilize automatic processes. Tamr uses “crowd-sourcing for domain experts” for questions as well. Tamr gets smarter over time and begins to use the parameters previously used for duplications automatically,” he concluded.

Watch the full interview below, and be sure to check out more of SiliconANGLE and theCUBE’s coverage of MITCDOIQ Symposium 2015.

Photo by SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+  
CUBE Alumni Network
C-level and Technical
Domain Experts
15M+ 
theCUBE
Viewers
Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.