Privacy-preserving publishing microdata with full functional dependencies

Hui Wang, Ruilin Liu

Research output: Contribution to journalArticlepeer-review

30 Scopus citations

Abstract

Data publishing has generated much concern on individual privacy. Recent work has shown that different background knowledge can bring various threats to the privacy of published data. In this paper, we study the privacy threat from the full functional dependency (FFD) that is used as part of adversary knowledge. We show that the cross-attribute correlations by FFDs (e.g., Phone → Zipcode) can bring potential vulnerability. Unfortunately, none of the existing anonymization principles (e.g., k-anonymity, ℓ-diversity, etc.) can effectively prevent against an FFD-based privacy attack. We formalize the FFD-based privacy attack and define the privacy model, (d,ℓ)-inference, to combat the FD-based attack. We distinguish the safe FFDs that will not jeopardize privacy from the unsafe ones. We design robust algorithms that can efficiently anonymize the microdata with low information loss when the unsafe FFDs are present. The efficiency and effectiveness of our approach are demonstrated by the empirical study.

Original languageEnglish
Pages (from-to)249-268
Number of pages20
JournalData and Knowledge Engineering
Volume70
Issue number3
DOIs
StatePublished - Mar 2011

Keywords

  • Functional dependency
  • Privacy-preserving data publishing
  • Utility

Fingerprint

Dive into the research topics of 'Privacy-preserving publishing microdata with full functional dependencies'. Together they form a unique fingerprint.

Cite this