X

It's not that hard to unmask real people in anonymous data, researchers warn

How easily can your data be unmasked? Here’s how to check your score.

Rae Hodge Former senior editor
Rae Hodge was a senior editor at CNET. She led CNET's coverage of privacy and cybersecurity tools from July 2019 to January 2023. As a data-driven investigative journalist on the software and services team, she reviewed VPNs, password managers, antivirus software, anti-surveillance methods and ethics in tech. Prior to joining CNET in 2019, Rae spent nearly a decade covering politics and protests for the AP, NPR, the BBC and other local and international outlets.
Rae Hodge
keyboard-security-privacy-laptop-hacking-7923

Researchers say their work shows that re-identification is a real risk. 

James Martin/CNET

Health care information, tax records, credit scores and browsing history -- every day data brokers traffic in datasets about you that are supposed to be anonymized. But unmasking personally identifying information in those datasets is easier than you might think, according to a study released Tuesday in Nature Communications

By using just 15 demographic attributes and a bit of machine learning, researchers from Imperial College London and the University of Louvain said "99.98% of Americans would be correctly re-identified in any dataset." The researchers said their work shows that re-identification is a real risk and question whether current practices satisfy modern data protection laws such as Europe's General Data Protection Regulation and the California Consumer Privacy Act.    

Researchers fed the new machine learning tool public information on more than 11 million individuals, obtained from 210 different data sets from five sources, including the US Census Bureau. How easily could you be spotted in anonymized data? The Computational Privacy Group at Imperial College London has also created a tool to check how likely you'd be correctly re-identified in anonymous data sets. (And in case you're wondering, a note on the site says the demo runs only in your browser and they don't collect your info.)

The researchers didn't immediately respond to a request for additional comment. 

Watch this: Loads of Android apps are skirting privacy controls