Enabling access-privacy for random walk based data analysis applications

Authors:
Ping Lin;K. Selçuk Candan
Affiliations:
Department of Computer Science and Engineering, Arizona State University, Tempe, AZ 85287-5406, USA;Department of Computer Science and Engineering, Arizona State University, Tempe, AZ 85287-5406, USA
Venue:
Data & Knowledge Engineering
Year:
2007

Citing 31
Cited 0

On hiding information form an oracle

Journal of Computer and System Sciences
Computationally private information retrieval (extended abstract)

STOC '97 Proceedings of the twenty-ninth annual ACM symposium on Theory of computing
Inferring Web communities from link topology

Proceedings of the ninth ACM conference on Hypertext and hypermedia : links, objects, time and space---structure in hypermedia systems: links, objects, time and space---structure in hypermedia systems
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
One-way functions are essential for single-server private information retrieval

STOC '99 Proceedings of the thirty-first annual ACM symposium on Theory of computing
Authoritative sources in a hyperlinked environment

Journal of the ACM (JACM)
Secure and selective dissemination of XML documents

ACM Transactions on Information and System Security (TISSEC)
Executing SQL over encrypted data in the database-service-provider model

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Securing XML Documents

EDBT '00 Proceedings of the 7th International Conference on Extending Database Technology: Advances in Database Technology
Clustering Categorical Data: An Approach Based on Dynamical Systems

VLDB '98 Proceedings of the 24rd International Conference on Very Large Data Bases
Using Random Walks for Mining Web Document Associations

PADKK '00 Proceedings of the 4th Pacific-Asia Conference on Knowledge Discovery and Data Mining, Current Issues and New Applications
Protecting Mobile Agents Against Malicious Hosts

Mobile Agents and Security
Tools for privacy preserving distributed data mining

ACM SIGKDD Explorations Newsletter
k-anonymity: a model for protecting privacy

International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems
Private information retrieval

FOCS '95 Proceedings of the 36th Annual Symposium on Foundations of Computer Science
The Use of Encrypted Functions for Mobile Agent Security

HICSS '04 Proceedings of the Proceedings of the 37th Annual Hawaii International Conference on System Sciences (HICSS'04) - Track 9 - Volume 9
A formal analysis of information disclosure in data exchange

SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Practical server privacy with secure coprocessors

IBM Systems Journal - End-to-end security
Privacy-Preserving Distributed Mining of Association Rules on Horizontally Partitioned Data

IEEE Transactions on Knowledge and Data Engineering
Access-Private Outsourcing of Markov Chain and RandomWalk based Data Analysis Applications

ICDEW '06 Proceedings of the 22nd International Conference on Data Engineering Workshops
Anti-Tamper Databases: Processing Aggregate Queries over Encrypted Databases

ICDEW '06 Proceedings of the 22nd International Conference on Data Engineering Workshops
How to build a trusted database system on untrusted storage

OSDI'00 Proceedings of the 4th conference on Symposium on Operating System Design & Implementation - Volume 4
Chip-secured data access: confidential data on untrusted servers

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Hippocratic databases

VLDB '02 Proceedings of the 28th international conference on Very Large Data Bases
Controlling access to published data using cryptography

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Privacy-enhanced data management for next-generation e-commerce

VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Secure XML publishing without information leakage in the presence of data inference

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Limiting disclosure in hippocratic databases

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Vision paper: enabling privacy for the paranoids

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
A privacy-preserving index for range queries

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Trust overlay networks for global reputation aggregation in P2P grid computing

IPDPS'06 Proceedings of the 20th international conference on Parallel and distributed processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Random walk graph and Markov chain based models are used heavily in many data and system analysis domains, including web, bioinformatics, and queuing. These models enable the description and analysis of various behaviors of stochastic systems. If the system being modeled has certain properties, such as if it is irreducible and aperiodic, close form formulations corresponding to its stationary behavior can be used to analyze its behavior. However, if the system does not have these properties or if the user is not interested in the stationary behavior, then an iterative approach needs to be used to determine potential outcomes based on the initial probability distribution inputs to the model. In this paper, we focus on access-privacy enabled outsourced Markov chain based data analysis applications, where a non-trusted service provider takes (hidden) user queries that are described in terms of initial state distributions, and evaluates them iteratively in an oblivious manner. We show that this iterative process can leak information regarding the possible values of the hidden input if the server has a priori knowledge about the underlying Markovian process. Hence as opposed to simple obfuscation mechanisms, we develop an algorithm based on methodical addition of extra states, which guarantees unbounded feasible regions for the inputs, thus preventing a malicious host from having an informed guess regarding the inputs. In particular, we show that if the underlying transition matrix is diagonalizable then we can compute the exact number of states needed for access-privacy, while if the matrix is non-diagonalizable, then only a lower-bound can be computed.