Satisfying Privacy Requirements Before Data Anonymization

Authors:
Xiaoxun Sun;Hua Wang;Jiuyong Li;Yanchun Zhang
Affiliations:
-;-;-;-
Venue:
The Computer Journal
Year:
2012

Citing 0
Cited 3

Mapping semantic knowledge for unsupervised text categorisation

ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137
A self-stabilizing algorithm for finding a minimal positive influence dominating set in social networks

ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137
Role-based access control to outsourced data in cloud computing

ADC '13 Proceedings of the Twenty-Fourth Australasian Database Conference - Volume 137

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we study a problem of protecting privacy of individuals in large public survey rating data. We propose a novel (k,ε, l)-anonymity model to protect privacy in large survey rating data, in which each survey record is required to be similar to at least k-1 other records based on the non-sensitive ratings, where the similarity is controlled by ε, and the standard deviation of sensitive ratings is at least l. We study an interesting yet non-trivial satisfaction problem of the proposed model, which is to decide whether a survey rating data set satisfies the privacy requirements given by the user. For this problem, we investigate its inherent properties theoretically, and devise a novel slicing technique to solve it. We analyze the computation complexity of the proposed slicing technique and conduct extensive experiments on two real-life data sets, and the results show that the slicing technique is fast and scalable with data size and much more efficient in terms of execution time and space overhead than the heuristic pairwise method.