Private client-side profiling with random forests and hidden markov models

Authors:
George Danezis;Markulf Kohlweiss;Benjamin Livshits;Alfredo Rial
Affiliations:
Microsoft Research;Microsoft Research;Microsoft Research;IBBT and KU Leuven, ESAT-COSIC, Belgium
Venue:
PETS'12 Proceedings of the 12th international conference on Privacy Enhancing Technologies
Year:
2012

Citing 32
Cited 1

How to prove yourself: practical solutions to identification and signature problems

Proceedings on Advances in cryptology---CRYPTO '86
A digital signature scheme secure against adaptive chosen-message attacks

SIAM Journal on Computing - Special issue on cryptography
Random oracles are practical: a paradigm for designing efficient protocols

CCS '93 Proceedings of the 1st ACM conference on Computer and communications security
Random Forests

Machine Learning
Non-Interactive and Information-Theoretic Secure Verifiable Secret Sharing

CRYPTO '91 Proceedings of the 11th Annual International Cryptology Conference on Advances in Cryptology
Wallet Databases with Observers

CRYPTO '92 Proceedings of the 12th Annual International Cryptology Conference on Advances in Cryptology
Proofs of Partial Knowledge and Simplified Design of Witness Hiding Protocols

CRYPTO '94 Proceedings of the 14th Annual International Cryptology Conference on Advances in Cryptology
Statistical Zero Knowledge Protocols to Prove Modular Polynomial Relations

CRYPTO '97 Proceedings of the 17th Annual International Cryptology Conference on Advances in Cryptology
Efficient Group Signature Schemes for Large Groups (Extended Abstract)

CRYPTO '97 Proceedings of the 17th Annual International Cryptology Conference on Advances in Cryptology
An Efficient System for Non-transferable Anonymous Credentials with Optional Anonymity Revocation

EUROCRYPT '01 Proceedings of the International Conference on the Theory and Application of Cryptographic Techniques: Advances in Cryptology
On Defining Proofs of Knowledge

CRYPTO '92 Proceedings of the 12th Annual International Cryptology Conference on Advances in Cryptology
Cryptographic techniques for privacy-preserving data mining

ACM SIGKDD Explorations Newsletter
Privacy-preserving decision trees over vertically partitioned data

ACM Transactions on Knowledge Discovery from Data (TKDD)
Efficient Protocols for Set Membership and Range Proofs

ASIACRYPT '08 Proceedings of the 14th International Conference on the Theory and Application of Cryptology and Information Security: Advances in Cryptology
On the Portability of Generalized Schnorr Proofs

EUROCRYPT '09 Proceedings of the 28th Annual International Conference on Advances in Cryptology: the Theory and Applications of Cryptographic Techniques
Accurate and large-scale privacy-preserving data mining using the election paradigm

Data & Knowledge Engineering
Rapid demonstration of linear relations connected by boolean operators

EUROCRYPT'97 Proceedings of the 16th annual international conference on Theory and application of cryptographic techniques
Proving in zero-knowledge that a number is the product of two safe primes

EUROCRYPT'99 Proceedings of the 17th international conference on Theory and application of cryptographic techniques
Efficient proofs that a committed number lies in an interval

EUROCRYPT'00 Proceedings of the 19th international conference on Theory and application of cryptographic techniques
Detecting web bugs with bugnosis: privacy advocacy through education

PET'02 Proceedings of the 2nd international conference on Privacy enhancing technologies
A signature scheme with efficient protocols

SCN'02 Proceedings of the 3rd international conference on Security in communication networks
P-signatures and noninteractive anonymous credentials

TCC'08 Proceedings of the 5th conference on Theory of cryptography
A certifying compiler for zero-knowledge proofs of knowledge based on Σ-protocols

ESORICS'10 Proceedings of the 15th European conference on Research in computer security
Automatic generation of sigma-protocols

EuroPKI'09 Proceedings of the 6th European conference on Public key infrastructures, services and applications
Privad: practical privacy in online advertising

Proceedings of the 8th USENIX conference on Networked systems design and implementation
RePriv: Re-imagining Content Personalization and In-browser Privacy

SP '11 Proceedings of the 2011 IEEE Symposium on Security and Privacy
Privacy-preserving smart metering

Proceedings of the 10th annual ACM workshop on Privacy in the electronic society
Towards ensuring client-side computational integrity

Proceedings of the 3rd ACM workshop on Cloud computing security workshop
Auctions in do-not-track compliant internet advertising

Proceedings of the 18th ACM conference on Computer and communications security
Non-interactive zero-knowledge arguments for voting

ACNS'05 Proceedings of the Third international conference on Applied Cryptography and Network Security
Cryptanalysis of an efficient proof of knowledge of discrete logarithm

PKC'06 Proceedings of the 9th international conference on Theory and Practice of Public-Key Cryptography
A framework for practical universally composable zero-knowledge protocols

ASIACRYPT'11 Proceedings of the 17th international conference on The Theory and Application of Cryptology and Information Security

Optimally private access control

Proceedings of the 12th ACM workshop on Workshop on privacy in the electronic society

Quantified Score

Hi-index	0.00

Visualization

Abstract

Nowadays, service providers gather fine-grained data about users to deliver personalized services, for example, through the use of third-party cookies or social network profiles. This poses a threat both to privacy, since the amount of information obtained is excessive for the purpose of customization, and authenticity, because those methods employed to gather data can be blocked and fooled. In this paper we propose privacy-preserving profiling techniques, in which users perform the profiling task locally, reveal to service providers the result and prove its correctness. We address how our approach applies to tasks of both classification and pattern recognition. For the former, we describe client-side profiling based on random forests, where users, based on certified input data representing their activity, resolve a random forest and reveal the classification result to service providers. For the latter, we show how to match a stream of user activity to a regular expression, or how to assign it a probability using a hidden Markov model. Our techniques, based on the use of zero-knowledge proofs, can be composed with other protocols as part of the certification of a larger computation.