TY - GEN
T1 - Hiding distinguished ones into crowd
T2 - 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09
AU - Wang, Hui
AU - Liu, Ruilin
PY - 2009
Y1 - 2009
N2 - Publishing microdata raises concerns of individual privacy. When there exist outlier records in the microdata, the dis-tinguishability of the outliers enables their privacy to be easier to be compromised than that of regular ones. However, none of the existing anonymization techniques can provide sufficient protection to the privacy of the outliers. In this paper, we study the problem of anonymizing the micro-data that contains outliers. We define the distinguishability-based attack by which the adversary can infer the existence of outliers as well as their private information from the anonymized microdata. To defend against the distinguishabilitj based attack, we define the plain k-anonymity as the privacy principle. Based on the definition, we categorize the outliers into two types, the ones that cannot be hidden by any plain k-anonymous group (called global outliers) and the ones that can (called local outliers). We propose the algorithm to efficiently anonymize local outliers with low information loss. Our experiments demonstrate the efficiency and effectiveness of our approach.
AB - Publishing microdata raises concerns of individual privacy. When there exist outlier records in the microdata, the dis-tinguishability of the outliers enables their privacy to be easier to be compromised than that of regular ones. However, none of the existing anonymization techniques can provide sufficient protection to the privacy of the outliers. In this paper, we study the problem of anonymizing the micro-data that contains outliers. We define the distinguishability-based attack by which the adversary can infer the existence of outliers as well as their private information from the anonymized microdata. To defend against the distinguishabilitj based attack, we define the plain k-anonymity as the privacy principle. Based on the definition, we categorize the outliers into two types, the ones that cannot be hidden by any plain k-anonymous group (called global outliers) and the ones that can (called local outliers). We propose the algorithm to efficiently anonymize local outliers with low information loss. Our experiments demonstrate the efficiency and effectiveness of our approach.
UR - http://www.scopus.com/inward/record.url?scp=70349108684&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=70349108684&partnerID=8YFLogxK
U2 - 10.1145/1516360.1516433
DO - 10.1145/1516360.1516433
M3 - Conference contribution
AN - SCOPUS:70349108684
SN - 9781605584225
T3 - Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology, EDBT'09
SP - 624
EP - 635
BT - Proceedings of the 12th International Conference on Extending Database Technology
Y2 - 24 March 2009 through 26 March 2009
ER -