Skip to main content
Article
Differentially private multidimensional data publishing
Knowledge and Information Systems
  • Khalil Al-Hussaeni, Concordia University
  • Benjamin C.M. Fung, McGill University
  • Farkhund Iqbal, Zayed University
  • Junqiang Liu, Zhejiang Gongshang University
  • Patrick C.K. Hung, Faculty of Business and Information Technology
ORCID Identifiers

0000-0001-8423-2906

Document Type
Article
Publication Date
9-1-2018
Abstract

© 2017, Springer-Verlag London Ltd., part of Springer Nature. Various organizations collect data about individuals for various reasons, such as service improvement. In order to mine the collected data for useful information, data publishing has become a common practice among those organizations and data analysts, research institutes, or simply the general public. The quality of published data significantly affects the accuracy of the data analysis and thus affects decision making at the corporate level. In this study, we explore the research area of privacy-preserving data publishing, i.e., publishing high-quality data without compromising the privacy of the individuals whose data are being published. Syntactic privacy models, such as k-anonymity, impose syntactic privacy requirements and make certain assumptions about an adversary’s background knowledge. To address this shortcoming, we adopt differential privacy, a rigorous privacy model that is independent of any adversary’s knowledge and insensitive to the underlying data. The published data should preserve individuals’ privacy, yet remain useful for analysis. To maintain data utility, we propose DiffMulti, a workload-aware and differentially private algorithm that employs multidimensional generalization. We devise an efficient implementation to the proposed algorithm and use a real-life data set for experimental analysis. We evaluate the performance of our method in terms of data utility, efficiency, and scalability. When compared to closely related existing methods, DiffMulti significantly improved data utility, in some cases, by orders of magnitude.

Publisher
Springer London
Disciplines
Keywords
  • Data sharing,
  • Differential privacy,
  • Multidimensional generalization,
  • Privacy protection
Scopus ID
85035141875
Indexed in Scopus
Yes
Open Access
No
https://doi.org/10.1007/s10115-017-1132-3
Citation Information
Khalil Al-Hussaeni, Benjamin C.M. Fung, Farkhund Iqbal, Junqiang Liu, et al.. "Differentially private multidimensional data publishing" Knowledge and Information Systems Vol. 56 Iss. 3 (2018) p. 717 - 752 ISSN: <a href="https://v2.sherpa.ac.uk/id/publication/issn/0219-3116" target="_blank">0219-3116</a>
Available at: http://works.bepress.com/farkhund-iqbal/161/