UPDATED 23:34 EDT / JUNE 06 2019

AI

Microsoft pulls massive facial recognition database from internet

Microsoft Corp. this week discreetly pulled a facial recognition database from the internet, according to a report published Thursday.

Built back in 2016, the database known as MS Celeb at one point contained 10 million images featuring about 100,000 people, said to be the largest database of its kind. It was only supposed to contain celebrities, but according to one researcher photos of “journalists, artists, musicians, activists, policy makers, writers, and academics” were also in the database.

Such images can be pulled from the internet under the Creative Commons license, meaning people could have ended up in the database without knowing about it. It seems China was the country that made the most use of those images.

Chinese tech firms SenseTime and Megvii both used the database. Both those companies, according to The New York Times have supplied facial recognition technology to the Chinese government. That report states that the government has used the software to keep tabs on ethnic minorities.

Suffice to say, it’s not a good look for Microsoft, a company that has been outspoken concerning how facial recognition could be misused in matters of surveillance. In April, the company said it turned down an opportunity to share its facial recognition technology with U.S. law enforcement amid fears it could breach human rights.

“The site was intended for academic purposes,” Microsoft said in a statement to The Financial Times. “It was run by an employee that is no longer with Microsoft and has since been removed.”

One journalist in that report said he was unhappy about his inclusion in the dataset, saying, “Microsoft’s inability to hold their own researchers to integrity and probity that this was not torpedoed before it left the building.”

Microsoft added that the dataset was taken down once “the challenge was over,” although it’s not completely expunged.

“Despite the recent termination of the msceleb.org website, the dataset still exists in several repositories on GitHub, the hard drives of countless researchers, and will likely continue to be used in research projects around the world,” Berlin-based researcher Adam Harvey wrote as part of his project, MegaPixels. The project clearly outlines who used the project and also gives names of those whose images were used.

Image: Microsoft

A message from John Furrier, co-founder of SiliconANGLE:

Support our mission to keep content open and free by engaging with theCUBE community. Join theCUBE’s Alumni Trust Network, where technology leaders connect, share intelligence and create opportunities.

  • 15M+ viewers of theCUBE videos, powering conversations across AI, cloud, cybersecurity and more
  • 11.4k+ theCUBE alumni — Connect with more than 11,400 tech and business leaders shaping the future through a unique trusted-based network.
About SiliconANGLE Media
SiliconANGLE Media is a recognized leader in digital media innovation, uniting breakthrough technology, strategic insights and real-time audience engagement. As the parent company of SiliconANGLE, theCUBE Network, theCUBE Research, CUBE365, theCUBE AI and theCUBE SuperStudios — with flagship locations in Silicon Valley and the New York Stock Exchange — SiliconANGLE Media operates at the intersection of media, technology and AI.

Founded by tech visionaries John Furrier and Dave Vellante, SiliconANGLE Media has built a dynamic ecosystem of industry-leading digital media brands that reach 15+ million elite tech professionals. Our new proprietary theCUBE AI Video Cloud is breaking ground in audience interaction, leveraging theCUBEai.com neural network to help technology companies make data-driven decisions and stay at the forefront of industry conversations.