Privacy-preserving genomic computation through program specialization

Authors:
Rui Wang;XiaoFeng Wang;Zhou Li;Haixu Tang;Michael K. Reiter;Zheng Dong
Affiliations:
Indiana University Bloomington, Bloomington, IN, USA;Indiana University Bloomington, Bloomington, IN, USA;Indiana University Bloomington, Bloomington, IN, USA;Indiana University Bloomington, Bloomington, IN, USA;University of North Carolina at Chapel Hill, Chapel Hill, NC, USA;Indiana University Bloomington, Bloomington, IN, USA
Venue:
Proceedings of the 16th ACM conference on Computer and communications security
Year:
2009

Citing 31
Cited 9

An experiment in partial evaluation: the generation of a compiler generator

Proc. of the first international conference on Rewriting techniques and applications
How to play ANY mental game

STOC '87 Proceedings of the nineteenth annual ACM symposium on Theory of computing
Compiling inheritance using partial evaluation

PEPM '91 Proceedings of the 1991 ACM SIGPLAN symposium on Partial evaluation and semantics-based program manipulation
Automatic generation of compiled simulations through program specialization

DAC '91 Proceedings of the 28th ACM/IEEE Design Automation Conference
Generating a compiler for a lazy language by partial evaluation

POPL '92 Proceedings of the 19th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Partial evaluation of high-level imperative programming languages with applications in hard real-time systems

POPL '92 Proceedings of the 19th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Partial evaluation and automatic program generation

Partial evaluation and automatic program generation
Tutorial notes on partial evaluation

POPL '93 Proceedings of the 20th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Semantics-directed generation of a Prolog compiler

Science of Computer Programming
JFlow: practical mostly-static information flow control

Proceedings of the 26th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
C and tcc: a language and compiler for dynamic code generation

ACM Transactions on Programming Languages and Systems (TOPLAS)
A Space-Economical Suffix Tree Construction Algorithm

Journal of the ACM (JACM)
JavaML: a markup language for Java source code

Proceedings of the 9th international World Wide Web conference on Computer networks : the international journal of computer and telecommunications netowrking
A lattice model of secure information flow

Communications of the ACM
Symbolic execution and program testing

Communications of the ACM
Protecting privacy using the decentralized label model

ACM Transactions on Software Engineering and Methodology (TOSEM)
A sub-quadratic sequence alignment algorithm for unrestricted cost matrices

SODA '02 Proceedings of the thirteenth annual ACM-SIAM symposium on Discrete algorithms
Inference Control in Statistical Databases, From Theory to Practice

Inference Control in Statistical Databases, From Theory to Practice
Efficient Multi-level Generating Extensions for Program Specialization

PLILPS '95 Proceedings of the 7th International Symposium on Programming Languages: Implementations, Logics and Programs
Program Specialization via Program Slicing

Selected Papers from the Internaltional Seminar on Partial Evaluation
Control flow analysis

Proceedings of a symposium on Compiler optimization
Secure and private sequence comparisons

Proceedings of the 2003 ACM workshop on Privacy in the electronic society
Simulatable auditing

Proceedings of the twenty-fourth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Towards robustness in query auditing

VLDB '06 Proceedings of the 32nd international conference on Very large data bases
L-diversity: Privacy beyond k-anonymity

ACM Transactions on Knowledge Discovery from Data (TKDD)
Privtrans: automatically partitioning programs for privilege separation

SSYM'04 Proceedings of the 13th conference on USENIX Security Symposium - Volume 13
Secure web applications via automatic partitioning

Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
How to generate and exchange secrets

SFCS '86 Proceedings of the 27th Annual Symposium on Foundations of Computer Science
Towards Practical Privacy for Genomic Computation

SP '08 Proceedings of the 2008 IEEE Symposium on Security and Privacy
Auditing SQL Queries

ICDE '08 Proceedings of the 2008 IEEE 24th International Conference on Data Engineering
Auditing a Batch of SQL Queries

ICDEW '07 Proceedings of the 2007 IEEE 23rd International Conference on Data Engineering Workshop

Secure outsourcing of DNA searching via finite automata

DBSec'10 Proceedings of the 24th annual IFIP WG 11.3 working conference on Data and applications security and privacy
Server-side verification of client behavior in online games

ACM Transactions on Information and System Security (TISSEC)
Privacy-preserving outsourcing of brute-force key searches

Proceedings of the 3rd ACM workshop on Cloud computing security workshop
Sedic: privacy-aware data intensive computing on hybrid clouds

Proceedings of the 18th ACM conference on Computer and communications security
Countering GATTACA: efficient and secure testing of fully-sequenced human genomes

Proceedings of the 18th ACM conference on Computer and communications security
Genodroid: are privacy-preserving genomic tests ready for prime time?

Proceedings of the 2012 ACM workshop on Privacy in the electronic society
Addressing the concerns of the lacks family: quantification of kin genomic privacy

Proceedings of the 2013 ACM SIGSAC conference on Computer & communications security
Protecting and evaluating genomic privacy in medical tests and personalized medicine

Proceedings of the 12th ACM workshop on Workshop on privacy in the electronic society
Secure genomic testing with size- and position-hiding private substring matching

Proceedings of the 12th ACM workshop on Workshop on privacy in the electronic society

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present a new approach to performing important classes of genomic computations (e.g., search for homologous genes) that makes a significant step towards privacy protection in this domain. Our approach leverages a key property of the human genome, namely that the vast majority of it is shared across humans (and hence public), and consequently relatively little of it is sensitive. Based on this observation, we propose a privacy-protection framework that partitions a genomic computation, distributing the part on sensitive data to the data provider and the part on the pubic data to the user of the data. Such a partition is achieved through program specialization that enables a biocomputing program to perform a concrete execution on public data and a symbolic execution on sensitive data. As a result, the program is simplified into an efficient query program that takes only sensitive genetic data as inputs. We prove the effectiveness of our techniques on a set of dynamic programming algorithms common in genomic computing. We develop a program transformation tool that automatically instruments a legacy program for specialization operations. We also demonstrate that our techniques can greatly facilitate secure multi-party computations on large biocomputing problems.