Incorporating privacy concerns in data mining on distributed data

  • Authors:
  • Hui-zhang Shen;Ji-di Zhao;Ruipu Yao

  • Affiliations:
  • Aetna School of Management, Shanghai Jiao Tong University, Shanghai, P.R. China;Aetna School of Management, Shanghai Jiao Tong University, Shanghai, P.R. China;School of Information Engineering, Tianjin University of Commerce, Tianjin, P.R. China

  • Venue:
  • AIMSA'06 Proceedings of the 12th international conference on Artificial Intelligence: methodology, Systems, and Applications
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Data mining, with its objective to efficiently discover valuable and inherent information from large databases, is particularly sensitive to misuse. Therefore an interesting new direction for data mining research is the development of techniques that incorporate privacy concerns and to develop accurate models without access to precise information in individual data records. The difficulty lies in the fact that the two metrics for evaluating privacy preserving data mining methods: privacy and accuracy are typically contradictory in nature. We address privacy preserving mining on distributed data in this paper and present an algorithm, based on the combination of probabilistic approach and cryptographic approach, to protect high privacy of individual information and at the same time acquire a high level of accuracy in the mining result.