UPDATED 12:00 EDT / OCTOBER 25 2017

THOUGHT LEADERSHIP

As deep learning frameworks converge, automation possibilities unfold

From a developer’s standpoint, deep learning is usually a hands-on exercise conducted within a particular modeling framework. Typically, a developer has needed to adapt their own manual coding style to interfaces provided by a specific framework, such as TensorFlow, Apache MXNet, Microsoft Cognitive Toolkit (CNTK), Caffe, Caffe2, Torch and Keras.

Getting productive on a new DL project may require that DL developers be cross-trained on a different modeling framework. However, this requirement is becoming more cumbersome as the range of open-source and commercial DL frameworks grows. One way in which developers are attempting to mitigate these complexities is by standardizing the most popular DL modeling frameworks. A recent article provides quantitative metrics on adoption of the leading DL frameworks.

Another important trend is that DL modeling frameworks are converging around higher-level development abstractions, intermediate representation languages and optimized cross-platform model compilers. Building DL models is becoming just another branch of application development, so it was significant when Amazon Web Services Inc. and Microsoft Corp. recently announced their jointly developed Gluon reference specification.

As detailed in a recent SiliconANGLE article, Gluon provides a development abstraction for prototyping, building, training and optimization of DL models. The Gluon application programming interface, defined in Python, is agnostic to underlying DL frameworks and runtime engines. The specification allows other DL engines to be plugged into the Gluon API without hurting the training speed enabled by those engines.

Currently available in the AWS-developed MXNet 0.11 and to be rolled out soon in CNTK, Gluon enables DL developers to:

Prototype, build, train and deploy DL models in any framework and deploy in an efficient format to any target platform;
Program models using a concise Python API, which reduces the amount of coding associated with any given DL project;
Model DL models flexibly like any other data structure;
Create DL models on the fly, with any structure, and change them rapidly using Python’s native control flow;
Reuse pre-built, optimized DL building blocks, including predefined layers, optimizers and initializers;
Allow developers to use standard programming loops and conditionals to prototype, build, revise and debug DL models;
Free developers from having to know about specific DL compilation or execution details; and
Optimize DL training algorithms automatically in alignment with model revisions.

From what I can see, the most important innovation in Gluon is the last feature: automatic optimization of DL training workflows to align with changes in the models. Considering that training is one of the most time-consuming aspects of any DL initiative and that DL developers rarely have expertise in how to accelerate this process, this automation feature promises a significant boost in developer productivity — that is, if Gluon becomes widely adopted.

Though its developers haven’t announced any future roadmap for Gluon, it’s highly likely that more automation features will be introduced. This is almost inevitable, considering how rapidly automation is coming to every aspect of the DL and machine learning development cycle. Across the industry, we’re seeing mainstream DL development tools starting to incorporate these features:

Automatically generate customized REST APIs and Docker images around DL models during the promotion and deployment stages;
Automatically deploy DL models for execution into private, public or hybrid multi-cloud platforms;
Automatically scale DL models’ runtime resource consumption up or down based on changing application requirements;
Automatically retrain DL models using fresh data prior to redeploying them;
Automatically keep track of which DL model version is currently deployed; and
Automatically ensure that a sufficiently predictive DL model in always in live production status.

It wouldn’t be surprising if Microsoft were to weave more of its ML-accelerated auto-programming technology into Gluon to speed writing of Python code for DL deployment across runtime engines, platforms and hardware platforms.

What do you think? If you’re a DL, data science, DevOps or IT operations professional, you almost certainly have practical insights for automating data-driven business processes. I would love to hear your thoughts. Please join me on Wednesday, Nov. 1, from 2 to 3 p.m. EDT for the Wikibon CrowdChat “Automating Data Analytics Management to the Max.” You can participate simply by clicking here, logging in with your Twitter handle and posting your thoughts in an interactive, moderated Q&A format.

Image: geralt/Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.

About SiliconANGLE Media

SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

As deep learning frameworks converge, automation possibilities unfold

Image: geralt/Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RECENT CUBE EVENTS

Google Cloud Next 2026

Phi Moments @ Next 2026

SUSECON 2026

Oracle Data Deep Dive NYC 2026

HPE World Quantum Day 2026

As deep learning frameworks converge, automation possibilities unfold

Image: geralt/Pixabay

A message from John Furrier, co-founder of SiliconANGLE:

LATEST STORIES

LATEST STORIES

Google Cloud Next 2026

Phi Moments @ Next 2026

SUSECON 2026

Oracle Data Deep Dive NYC 2026

HPE World Quantum Day 2026

Cookies