UPDATED 17:15 EDT / AUGUST 25 2016

NEWS

Facebook open sources its computer vision tools

Facebook has announced that it will be open sourcing several of the computer vision tools it has developed, including DeepMask, SharpMask, and MultiPathNet.

All three tools were developed by the Facebook Artificial Intelligence Research (FAIR) team, with the goal of teaching computers to intelligently breakdown what images and recognize objects, locations, and people. According to Facebook, the company hopes that opening up its tools will help the AI research community as a whole to move forward with new advancements in computer vision.

“We’re making the code for DeepMask+SharpMask as well as MultiPathNet — along with our research papers and demos related to them — open and accessible to all, with the hope that they’ll help rapidly advance the field of machine vision,” Piotr Dollar, a researcher with FAIR, wrote in a blog post. “As we continue improving these core technologies we’ll continue publishing our latest results and updating the open source tools we make available to the community.”

What do they do?

In his post, Dollar went into detail about how exactly Facebook’s tools work, beginning with DeepMask. According to Dollar, DeepMask breaks images down into a grid-like series of patches. The program then looks at each patch to see if it contains any objects, and if so, how many. This process allows DeepMask to determine how many objects it contains what what their general shapes are, but it does not capture any details about those objects. That is where SharpMask comes in.

True to its name, SharpMask takes the vague data from DeepMask and sharpens it, bringing objects into focus and perfecting their shapes. Dollar said that SharpMask analyzes images pixel by pixel based on information already gathered from DeepMask.

“To capture general object shape, you have to have a high-level understanding of what you are looking at (DeepMask), but to accurately place the boundaries you need to look back at lower-level features all the way down to the pixels (SharpMask),” Dollar said. “In essence, we aim to make use of information from all layers of a network, with minimal additional overhead.”

Of course, simply figuring out an object’s shape is only the first step in computer vision, and the real challenge is recognizing what that shape represents. If DeepMask and SharpMask are the eyes, then MultiPathNet is the brain. MultiPathNet uses a deep learning neural network to examine the shapes created by DeepMask and SharpMask and assign meaning to them.

So for example, DeepMask might look at an image and determine that it contains six misshapen blobs of some sort. SharpMask would then come in and determine that those six blobs actually have legs, feet, ears, and so on. Finally, MultiPathNet would analyze those shapes and recognize that they are actually six sheep standing in a field.

Image courtesy of Facebook

A message from John Furrier, co-founder of SiliconANGLE:

Support our open free content by sharing and engaging with our content and community.

Join theCUBE Alumni Trust Network

Where Technology Leaders Connect, Share Intelligence & Create Opportunities

11.4k+

CUBE Alumni Network

C-level and Technical

Domain Experts

15M+

theCUBE

Viewers

Connect with 11,413+ industry leaders from our network of tech and business leaders forming a unique trusted network effect.

SiliconANGLE Media is a recognized leader in digital media innovation serving innovative audiences and brands, bringing together cutting-edge technology, influential content, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — such as those established in Silicon Valley and the New York Stock Exchange (NYSE) — SiliconANGLE Media operates at the intersection of media, technology, and AI. .

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a powerful ecosystem of industry-leading digital media brands, with a reach of 15+ million elite tech professionals. The company’s new, proprietary theCUBE AI Video cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.

Facebook open sources its computer vision tools

What do they do?

Image courtesy of Facebook

A message from John Furrier, co-founder of SiliconANGLE:

Join theCUBE Alumni Trust Network

LATEST FROM THECUBE

UPCOMING CUBE EVENTS

RAISE Summit 2025

AWS & Ecosystem Leaders Halftime Report - 2025

Understanding Today's Digital Business With Dynatrace

Black Hat USA 2025

World of Workato 2025

RECENT CUBE EVENTS

Open Source Summit NA 2025

theCUBE + NYSE Wired: Robotics & AI Infrastructure Leaders 2025

AppDev Done Right Summit 2025

Broadcom Delivers the Modern Private Cloud 2025

Databricks Data + AI Summit 2025

Facebook open sources its computer vision tools

What do they do?

Image courtesy of Facebook

A message from John Furrier, co-founder of SiliconANGLE:

Join theCUBE Alumni Trust Network

LATEST STORIES

LATEST STORIES

RAISE Summit 2025

AWS & Ecosystem Leaders Halftime Report - 2025

Understanding Today's Digital Business With Dynatrace

Black Hat USA 2025

World of Workato 2025

Open Source Summit NA 2025

theCUBE + NYSE Wired: Robotics & AI Infrastructure Leaders 2025

AppDev Done Right Summit 2025

Broadcom Delivers the Modern Private Cloud 2025

Databricks Data + AI Summit 2025

Cookies