Microsoft quietly deleted a data set of more than 10 million images, intended as a test and training data set for facial recognition algorithms, according to a report (paywall) by the Financial Times. The database, dubbed MS Celeb, was the largest public facial recognition data set in the world, and contained more than 10 million images of more than 100,000 people — largely scrapped from publicly available online sources. Uncovered by Berlin-based researcher Adam Harvey, it was reportedly being used by companies to test their facial recognition software. The takedown came after an Financial Times investigation (paywall) found that many of the people in the database were not aware of they were on it, and had not consented to having their pictures used. In a statement to FT, Microsoft tried to downplay the controversy, saying the database was only for “academic purposes” and was run by an employee who “no longer works for the company”. The FT report noted that the MS Celeb database is still available to any academic institution or company that had previously downloaded it, and is still being shared on GitHub, Dropbox, and Baidu Cloud. Harvey also discovered very similar databases hosted by Duke and Stanford Universities' researchers during his investigation, which have since been taken down. Microsoft is not the only company to have assembled a large data set by scraping photos from open Internet. In January, IBM announced it was sharing a collection of 1 million publicly available faces to "study the fairness and accuracy in facial recognition technology". It…
- How much money was seized by the Indian state while investigating illegal loan apps? February 8, 2023
- Publishers Aren’t ‘Intermediaries’, IT Ministry Says: What Does This Mean for India’s Online Gaming Laws? February 8, 2023
- You can now use PhonePe UPI to pay in UAE, Singapore, Nepal and Bhutan February 8, 2023
- Wikipedia ‘Unblocked’ in Pakistan, PM Forms A Committee To Regulate ‘Objectionable’ Content February 8, 2023
- Explained: What is the feud over ownership between Binance and WazirX? February 8, 2023
MediaNama’s mission is to help build a digital ecosystem which is open, fair, global and competitive.
India's smartphone operating system BharOS has received much buzz in the media lately, but does it really merit this attention?
After using the Mapples app as his default navigation app for a week, Sarvesh draws a comparison between Google Maps and Mapples
The regulatory ambivalence around an instrument so essential to facilitate data exchange – the CM framework – is disconcerting for several reasons.
The provisions around grievance redressal in the Data Protection Bill "stands to be dangerously sparse and nugatory on various counts."
Please subscribe to MediaNama. Don't share prints and PDFs.
You May Also Like
Google has released a Google Travel Trends Report which states that branded budget hotel search queries grew 179% year over year (YOY) in India, in...
135 job openings in over 60 companies are listed at our free Digital and Mobile Job Board: If you’re looking for a job, or...
Twitter takes down tweets from MP, MLA, editor criticising handling of pandemic upon government request
By Aroon Deep and Aditya Chunduru You’re reading it here first: Twitter has complied with government requests to censor 52 tweets that mostly criticised...