UPDATED 19:34 EST / JUNE 25 2020

AI

Databricks hands its MLflow machine learning platform to the Linux Foundation

Databricks Inc., the big-data and machine learning company that leads the commercial development of Apache Spark, today put its MLflow project into the hands of the Linux Foundation.

MLflow is a machine learning operations or MLOps platform that the company first open-sourced two years ago. The software gives developers a programmatic way to handle all of the pieces of a machine learning project, from construction, to training, fine-tuning, deployment, management and revision.

MLflow is used to track all of the datasets, model instances, model parameter and algorithms developers use in their machine learning projects. That enables them to be versioned, stored in a central repository and then repackaged so they can be used in other projects as required.

The MLflow project has proven to be a big hit in the machine learning community, with more than 2 million downloads per month. It also counts more than 200 contributors who regularly update the software.

Databricks said it made the decision to hand over stewardship of the project to the Linux Foundation in order to give it a “vendor-neutral home with an open governance model.”

The main motivation for doing so is to try to get more people and more enterprises to work on and contribute to the project, Constellation Research Inc. analyst Holger Mueller told SiliconANGLE.

“But the track record of that is mixed, not only because there are so many open source projects for developers to work on, but because other companies that compete with the gifting vendor are generally suspicious in the first phases of incubation,” Mueller said. “It’s too early to tell what this means for MLflow at this point, so the key thing to watch is if more resources join the project in the coming months.”

MLflow isn’t the only open source project for managing machine learning pipelines that’s already seen widespread adoption. One of its rival projects is kubeflow, which relies on the Kubernetes container orchestration software to manage machine learning pipelines.

Although MLflow and kubeflow are rival projects, they also differ in quite a few ways. For instance, MLflow doesn’t rely on Kubernetes as a component. Instead, it runs on local machines using Python scripts or Databricks as a hosted environment. And MLflow is far more versatile, able to work with multiple machine learning frameworks, whereas kubeflow can only be used with TensorFlow and PyTorch.

Image: Databricks

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU