UPDATED 16:38 EDT / DECEMBER 18 2019

BIG DATA

Inside Dataiku’s data-science, future-proof platform

At the risk of becoming irrelevant, many legacy companies are making greater efforts to harness cloud-based technology. Dataiku is a data-science platform that provides a collaborative environment for data scientists and business analysts to design and launch data reports and predictive machine learning models. Whether companies are “born in the cloud” or have been around well before cloud popularity, they can optimize what Dataiku refers to as a future-proof platform.

“[It’s] maybe not sexy, but having good reporting and analytics is something that both 200-year-old enterprise organizations and data-native organizations, startups, need,” said Will Nowak (pictured), solutions architect at Dataiku.

Nowak spoke with Lisa Martin (@LisaMartinTV), host of theCUBE, SiliconANGLE Media’s mobile livestreaming studio, and guest host Justin Warren (@jpwarren) during the AWS re:Invent conference in Las Vegas. They discussed how Dataiku serves both native and non-native organizations, as well as what it means to have future-proof data solutions. (* Disclosure below.)

A platform for native and non-native organizations alike

Like a great restaurant, Dataiku provides customers various high-quality options in one central, cohesive setting. The enterprise data-science platform provides a collaborative environment for native data scientists and business analysts. Organizations use Dataiku to build and deploy reports, as well as predictive machine-learning models.

Native data organizations that were born or reborn in the cloud, as well as legacy enterprises, can optimize the Dataiku platform, according to Nowak. Both organizations can benefit from simple charts and graphs that don’t require advanced data analytics. However, “[building] predictive machine-learning models and deploying those as rest API endpoints … to provide a data-driven product for your consumers” is a more advanced use-case, which Dataiku also supports, Nowak explained.

Whether an organization has developers that perform advanced models and analytics, companies can still use Dataiku’s platform for important end results, Nowak added. “[Maybe] you don’t have developers who are very fluent in turning out fast applications. We can give you a place to build a predictive model and deploy that predictive model, saving you time to write all that code on the back end,” he said.

Data quality is also an important concern for various organizations, regardless of their status as natively cloud-based or legacy. Dataiku makes visual indications of data simple. Analysts and data scientists can easily discern if data conforms to quality standards that organizations have established. There is also added functionality regarding data quality, including those that can be configured.

“So, does this column have the appropriate schema? Does it have the appropriate cardinality? These are things that an individual might decide to use,” Nowak stated.

Future-proof data solutions

Artificial intelligence has been a very popular and lucrative trend, which some jokingly refer to as “the hype cycle of AI.” But investing in one particular technology can be a costly risk, potentially locking organizations into technology that could become obsolete. As an open-source platform, Dataiku allows for many languages and their iterances to be adapted and applied for various uses. For example, SQL is the go-to language for data transfer, and the platform is designed to make SQL coding simple, Nowak explained. At the same time, businesses can use the platform with the same ease of use to code in Python, a common language for machine-learning model building.

“[By] leveraging open source, we figured we’re making our clients more future proof. As long as they’re [using] Dataiku to leverage the best-in-breed in open source, they’ll always be where they want to be in the technological landscape,” Nowak stated. 

Users can integrate with Dataiku regardless of the organization’s underlying security mechanisms. For example, “If you’re using AWS and you have IM roles to manage your security, Dataiku can port those and apply those to the Dataiku environment,” Nowak noted.

If someone uses on-prem processing, like Hadoop, they can leverage Kerberos to manage data access. Essentially, Dataiku’s aim is to leverage the best technology the organization already has on-hand and has invested in. “We’re not trying to compete with them, but rather we’re enabling organizations to use these technologies efficiently,” Nowak concluded. 

Watch the complete video interview below, and be sure to check out more of SiliconANGLE’s and theCUBE’s coverage of the AWS re:Invent event. (*Disclosure: Dataiku Inc. sponsored this segment of theCUBE. Neither Dataiku nor other sponsors have editorial control over content on theCUBE or SiliconANGLE.)

Photo:SiliconANGLE

A message from John Furrier, co-founder of SiliconANGLE:

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+  
CUBE Alumni Network
C-level and Technical
Domain Experts
15M+ 
theCUBE
Viewers
Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.