Improving Microaggregation for Complex Record Anonymization

  • Authors:
  • Jordi Pont-Tuset;Jordi Nin;Pau Medrano-Gracia;Josep Ll. Larriba-Pey;Victor Muntés-Mulero

  • Affiliations:
  • DAMA-UPC, Computer Architecture Dept., Universitat Politècnica de Catalunya, Barcelona, Spain 08034;IIIA, Artificial Intelligence Research Institute CSIC, Spanish National Research Council, Bellaterra, Spain 08193;DAMA-UPC, Computer Architecture Dept., Universitat Politècnica de Catalunya, Barcelona, Spain 08034;DAMA-UPC, Computer Architecture Dept., Universitat Politècnica de Catalunya, Barcelona, Spain 08034;DAMA-UPC, Computer Architecture Dept., Universitat Politècnica de Catalunya, Barcelona, Spain 08034

  • Venue:
  • MDAI '08 Sabadell Proceedings of the 5th International Conference on Modeling Decisions for Artificial Intelligence
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

Microaggregation is one of the most commonly employed microdata protection methods. This method builds clusters of at least koriginal records and replaces the records in each cluster with the centroid of the cluster. Usually, when records are complex, i.e., the number of attributes of the data set is large, this data set is split into smaller blocks of attributes and microaggregation is applied to each block, successively and independently. In this way, the information loss when collapsing several values to the centroid of their group is reduced, at the cost of losing the k-anonymity property when at least two attributes of different blocks are known by the intruder.In this work, we present a new microaggregation method called One dimension microaggregation(Mic1D茂戮驴 茂戮驴). This method gathers all the values of the data set into a single sorted vector, independently of the attribute they belong to. Then, it microaggregates all the mixed values together. Our experiments show that, using real data, our proposal obtains lower disclosure risk than previous approaches whereas the information loss is preserved.