Privacy-preserving indexing of documents on the network

Authors:
Mayank Bawa;Roberto J. Bayardo, Jr.;Rakesh Agrawal
Affiliations:
Stanford University, Stanford, CA;IBM Almaden Research Center, San Jose, CA;IBM Almaden Research Center, San Jose, CA
Venue:
VLDB '03 Proceedings of the 29th international conference on Very large data bases - Volume 29
Year:
2003

Citing 18
Cited 17

Security-control methods for statistical databases: a comparative study

ACM Computing Surveys (CSUR)
Multi party computations: past and present

PODC '97 Proceedings of the sixteenth annual ACM symposium on Principles of distributed computing
Protecting data privacy in private information retrieval schemes

STOC '98 Proceedings of the thirtieth annual ACM symposium on Theory of computing
Crowds: anonymity for Web transactions

ACM Transactions on Information and System Security (TISSEC)
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
GlOSS: text-source discovery over the Internet

ACM Transactions on Database Systems (TODS)
Formal Models for Computer Security

ACM Computing Surveys (CSUR)
Space/time trade-offs in hash coding with allowable errors

Communications of the ACM
Chord: A scalable peer-to-peer lookup service for internet applications

Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
YouServ: a web-hosting and content sharing tool for the masses

Proceedings of the 11th international conference on World Wide Web
Executing SQL over encrypted data in the database-service-provider model

Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Tarzan: a peer-to-peer anonymizing network layer

Proceedings of the 9th ACM conference on Computer and communications security
Building the IBM 4758 Secure Coprocessor

Computer
Make it fresh, make it quick: searching a network of personal webservers

WWW '03 Proceedings of the 12th international conference on World Wide Web
Private information retrieval

FOCS '95 Proceedings of the 36th Annual Symposium on Foundations of Computer Science
Anonymous Connections and Onion Routing

SP '97 Proceedings of the 1997 IEEE Symposium on Security and Privacy
Practical Techniques for Searches on Encrypted Data

SP '00 Proceedings of the 2000 IEEE Symposium on Security and Privacy
Symphony: distributed hashing in a small world

USITS'03 Proceedings of the 4th conference on USENIX Symposium on Internet Technologies and Systems - Volume 4

Privacy and Ownership Preserving of Outsourced Medical Data

ICDE '05 Proceedings of the 21st International Conference on Data Engineering
Privacy-preserving payload-based correlation for accurate malicious traffic detection

Proceedings of the 2006 SIGCOMM workshop on Large-scale attack defense
Preserving data privacy in outsourcing data aggregation services

ACM Transactions on Internet Technology (TOIT) - Special Issue on the Internet and Outsourcing
Privacy preserving decision tree learning over multiple parties

Data & Knowledge Engineering
Vision paper: enabling privacy for the paranoids

VLDB '04 Proceedings of the Thirtieth international conference on Very large data bases - Volume 30
Fast nGram-based string search over data encoded using algebraic signatures

VLDB '07 Proceedings of the 33rd international conference on Very large data bases
An agent-based approach for privacy-preserving recommender systems

Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems
Precomputation of privacy policy parameters for auditing SQL queries

Proceedings of the 2nd international conference on Ubiquitous information management and communication
Zerber: r-confidential indexing for distributed documents

EDBT '08 Proceedings of the 11th international conference on Extending database technology: Advances in database technology
Privacy preserving document indexing infrastructure for a distributed environment

Proceedings of the VLDB Endowment
Zerber+R: top-k retrieval from a confidential index

Proceedings of the 12th International Conference on Extending Database Technology: Advances in Database Technology
Search-as-a-service: Outsourced search over outsourced storage

ACM Transactions on the Web (TWEB)
Privacy-preserving similarity-based text retrieval

ACM Transactions on Internet Technology (TOIT)
Towards access control aware P2P data management systems

Proceedings of the 2009 EDBT/ICDT Workshops
Constructing distributed hippocratic video databases for privacy-preserving online patient training and counseling

IEEE Transactions on Information Technology in Biomedicine
Robust Record Linkage Blocking Using Suffix Arrays and Bloom Filters

ACM Transactions on Knowledge Discovery from Data (TKDD)
Privacy-Preserving graph algorithms in the semi-honest model

ASIACRYPT'05 Proceedings of the 11th international conference on Theory and Application of Cryptology and Information Security

Quantified Score

Hi-index	0.00

Visualization

Abstract

We address the problem of providing privacy-preserving search over distributed access-controlled content. Indexed documents can be easily reconstructed from conventional (inverted) indexes used in search. The need to avoid breaches of access-control through the index requires the index hosting site to be fully secured and trusted by by all participating content providers. This level of trust is impractical in the increasingly common case where multiple competing organizations or individuals wish to selectively share content. We propose a solution that eliminates the need of such a trusted authority. The solution builds a centralized privacy-preserving index in conjunction with a distributed access-control enforcing search protocol. The new index provides strong and quantifiable privacy guarantees that hold even if the entire index is made public. Experiments on a real-life dataset validate performance of the scheme. The appeal of our solution is two-fold: (a) Content providers maintain complete control in defining access groups and ensuring its compliance, and (b) System implementors retain tunable knobs to balance privacy and efficiency concerns for their particular domains.