UPDATED 17:12 EDT / OCTOBER 10 2017

BIG DATA

IBM wants to give data scientists a lot more free time

When it comes to managing data in the enterprise, words like “cleansing,” “wrangling” or “preparation” are frequently used to describe the work necessary to place information in the right shape and form so it can be effectively used. If this sounds like a lot of work, it is. So IBM has introduced its Integrated Analytics System, based on an SQL engine, to let data scientists work across multiple data stores and save significant time.

“Most enterprises struggle with complexity. That’s the number one problem when it comes to analytics. We are trying to make data really simple to use,” said Rob Thomas (pictured), general manager of IBM Analytics at IBM Corp.

Thomas stopped by theCUBE, SiliconANGLE’s mobile livestreaming studio, and spoke with host John Furrier (@furrier) during the recent BigData NYC event in New York City. They discussed the technology behind the new analytics offering, how users can obtain and run it, and the future direction of the multicloud world. (* Disclosure below.)

The Integrated Analytics System is designed for deployment across private, public or hybrid clouds, with machine learning via Apache Spark (an open-source in-memory data processing engine) embedded in the enterprise offering. The concept is to integrate time-consuming functions like combining and cleaning the data, building a warehouse and selecting data science tools into a single system.

“If you move to this model, suddenly what was a bunch of disparate tools are now microservices against a common architecture,” Thomas explained. “So it totally changes the nature of a data platform in the enterprise.”

Eliminates the data wrangling

IBM has also simplified access to the analytics solution. Users can bring the Spark-loaded box into the data center, download a containerized version available on the Web or run it directly on the IBM cloud. “We’ve eliminated that need for all of that data movement, for all of the data wrangling,” Thomas said. “We’ve made it really simple.”

The release of IBM’s analytics tool, which can be used across a variety of cloud environments, follows its announcement of the Hortonworks Inc. DataPlane Service, a cloud offering designed to collect data in multiple locations. These recent announcements from IBM appear to be geared toward meeting the increasing demands in a multicloud world, although this evolution remains a work in progress.

“I don’t think any enterprise will go ‘all in’ on one cloud; it’s delusional for people to think that,” said Thomas, though he also cautioned that it remains to be seen what a multiple cloud world may actually look like. “Let’s be honest, the multicloud world is still pretty early,” he added.

Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of BigData NYC 2017. (* Disclosure: IBM Corp. sponsored this segment of theCUBE. Neither IBM nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo: SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU