TY - JOUR
T1 - Privacy-preserving publishing microdata with full functional dependencies
AU - Wang, Hui
AU - Liu, Ruilin
PY - 2011/3
Y1 - 2011/3
N2 - Data publishing has generated much concern on individual privacy. Recent work has shown that different background knowledge can bring various threats to the privacy of published data. In this paper, we study the privacy threat from the full functional dependency (FFD) that is used as part of adversary knowledge. We show that the cross-attribute correlations by FFDs (e.g., Phone → Zipcode) can bring potential vulnerability. Unfortunately, none of the existing anonymization principles (e.g., k-anonymity, ℓ-diversity, etc.) can effectively prevent against an FFD-based privacy attack. We formalize the FFD-based privacy attack and define the privacy model, (d,ℓ)-inference, to combat the FD-based attack. We distinguish the safe FFDs that will not jeopardize privacy from the unsafe ones. We design robust algorithms that can efficiently anonymize the microdata with low information loss when the unsafe FFDs are present. The efficiency and effectiveness of our approach are demonstrated by the empirical study.
AB - Data publishing has generated much concern on individual privacy. Recent work has shown that different background knowledge can bring various threats to the privacy of published data. In this paper, we study the privacy threat from the full functional dependency (FFD) that is used as part of adversary knowledge. We show that the cross-attribute correlations by FFDs (e.g., Phone → Zipcode) can bring potential vulnerability. Unfortunately, none of the existing anonymization principles (e.g., k-anonymity, ℓ-diversity, etc.) can effectively prevent against an FFD-based privacy attack. We formalize the FFD-based privacy attack and define the privacy model, (d,ℓ)-inference, to combat the FD-based attack. We distinguish the safe FFDs that will not jeopardize privacy from the unsafe ones. We design robust algorithms that can efficiently anonymize the microdata with low information loss when the unsafe FFDs are present. The efficiency and effectiveness of our approach are demonstrated by the empirical study.
KW - Functional dependency
KW - Privacy-preserving data publishing
KW - Utility
UR - http://www.scopus.com/inward/record.url?scp=79951680959&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=79951680959&partnerID=8YFLogxK
U2 - 10.1016/j.datak.2010.11.002
DO - 10.1016/j.datak.2010.11.002
M3 - Article
AN - SCOPUS:79951680959
SN - 0169-023X
VL - 70
SP - 249
EP - 268
JO - Data and Knowledge Engineering
JF - Data and Knowledge Engineering
IS - 3
ER -