UPDATED 20:27 EST / MARCH 04 2021

AI

Facebook’s new AI model SEER can teach itself to recognize images

Researchers at Facebook Inc. have created an artificial intelligence-based image recognition model called SEER that’s able to describe what it’s seeing without being trained first on a labeled dataset.

Facebook said that SEER, which is an acronym for “Self-SupeERvised,” is a breakthrough that could lead to a “revolution” in computer vision.

The SEER model, outlined in a paper released March 2, was fed 1 billion publicly available images without annotations or labels from Instagram. It then worked through the dataset, learning as it progressed, and was eventually able to achieve extremely high accuracy in tasks such as object detection.

Self-supervised AI learning is already established in the AI field. It refers to AI systems that can learn directly from whatever information they are given without being trained on carefully labeled datasets that can teach them how to perform a given task, such as recognizing an object in a photo or translating a piece of text.

The advantage of self-supervised learning is that these models can be trained much faster. Labeling data is a painstaking and time-consuming task that takes hours for humans to perform. In addition, self-supervised learning models are also able to work with bigger and more diverse datasets too.

Facebook’s AI team said in a blog post that SEER eventually outperformed existing AI models in the ImageNet object recognition test, which is an industry standard for gauging the effectiveness of such systems. SEER achieved a classification accuracy score of 84.2% in the test, which asks the model to identify what it’s seeing in thousands of different photos.

SEER, Facebook explained, takes advantage of an older algorithm called SwAV that relies on online clustering to group images that have similar visual concepts rapidly, to leverage their similarities. “With SwAV, we were able to improve over the previous state of the art in self-supervised learning, and did so with 6x less training time,” Facebook said.

SEER also incorporates a new AI model architecture called RegNets, which are convolutional neural networks capable of scaling to trillions of parameters that can be optimized to fit various runtime and memory limitations. The third and final component of SEER is an all-purpose library for self-supervised learning called VISSL.

“The future of AI is in creating systems that can learn directly from whatever information they’re given — whether it’s text, images, or another type of data — without relying on carefully curated and labeled data sets to teach them how to recognize objects in a photo, interpret a block of text, or perform any of the countless other tasks that we ask it to,” Facebook’s researchers said. “This is a breakthrough that ultimately clears the path for more flexible, accurate, and adaptable computer vision models in the future.”

Facebook listed a number of potential use cases for SEER, such as automatically generating text to describe images to people with visual impairments, better categorization of items sold on the Facebook Marketplace and better systems for censoring “harmful” images on Facebook’s platform.

Facebook looks intent on keeping the secrets of SEER to itself, at least for now. The company didn’t mention any plans to open-source SEER, though it will release the VISSL library to the research community so that it can be used to train other self-supervised learning models.

Image: Facebook

A message from John Furrier, co-founder of SiliconANGLE:

Your vote of support is important to us and it helps us keep the content FREE.

One click below supports our mission to provide free, deep, and relevant content.  

Join our community on YouTube

Join the community that includes more than 15,000 #CubeAlumni experts, including Amazon.com CEO Andy Jassy, Dell Technologies founder and CEO Michael Dell, Intel CEO Pat Gelsinger, and many more luminaries and experts.

“TheCUBE is an important partner to the industry. You guys really are a part of our events and we really appreciate you coming and I know people appreciate the content you create as well” – Andy Jassy

THANK YOU