Discovering private trajectories using background information

  • Authors:
  • Emre Kaplan;Thomas B. Pedersen;Erkay Savaş;Yücel Saygın

  • Affiliations:
  • Faculty of Engineering and Natural Sciences, Sabancı University, Istanbul, Turkey;Faculty of Engineering and Natural Sciences, Sabancı University, Istanbul, Turkey;Faculty of Engineering and Natural Sciences, Sabancı University, Istanbul, Turkey;Faculty of Engineering and Natural Sciences, Sabancı University, Istanbul, Turkey

  • Venue:
  • Data & Knowledge Engineering
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Trajectories are spatio-temporal traces of moving objects which contain valuable information to be harvested by spatio-temporal data mining techniques. Applications like city traffic planning, identification of evacuation routes, trend detection, and many more can benefit from trajectory mining. However, the trajectories of individuals often contain private and sensitive information, so anyone who possess trajectory data must take special care when disclosing this data. Removing identifiers from trajectories before the release is not effective against linkage type attacks, and rich sources of background information make it even worse. An alternative is to apply transformation techniques to map the given set of trajectories into another set where the distances are preserved. This way, the actual trajectories are not released, but the distance information can still be used for data mining techniques such as clustering. In this paper, we show that an unknown private trajectory can be reconstructed using the available background information together with the mutual distances released for data mining purposes. The background knowledge is in the form of known trajectories and extra information such as the speed limit. We provide analytical results which bound the number of the known trajectories needed to reconstruct private trajectories. Experiments performed on real trajectory data sets show that the number of known samples is surprisingly smaller than the actual theoretical bounds.