Tainted flow analysis on e-SSA-form programs

Authors:
Andrei Rimsa;Marcelo d'Amorim;Fernando Magno Quintão Pereira
Affiliations:
UFMG-Belo Horizonte, Brazil;UFPE - Recife, Brazil;UFPE - Recife, Brazil
Venue:
CC'11/ETAPS'11 Proceedings of the 20th international conference on Compiler construction: part of the joint European conferences on theory and practice of software
Year:
2011

Citing 23
Cited 2

Automatic construction of sparse data flow evaluation graphs

POPL '91 Proceedings of the 18th ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Efficiently computing static single assignment form and the control dependence graph

ACM Transactions on Programming Languages and Systems (TOPLAS)
Precise interprocedural dataflow analysis via graph reachability

POPL '95 Proceedings of the 22nd ACM SIGPLAN-SIGACT symposium on Principles of programming languages
Efficient inference of object types

Information and Computation
ABCD: eliminating array bounds checks on demand

PLDI '00 Proceedings of the ACM SIGPLAN 2000 conference on Programming language design and implementation
A fast algorithm for finding dominators in a flowgraph

ACM Transactions on Programming Languages and Systems (TOPLAS)
Flow-sensitive type qualifiers

PLDI '02 Proceedings of the ACM SIGPLAN 2002 Conference on Programming language design and implementation
Modern Compiler Implementation in Java

Modern Compiler Implementation in Java
Effective Representation of Aliases and Indirect Memory Operations in SSA Form

CC '96 Proceedings of the 6th International Conference on Compiler Construction
Program slicing

ICSE '81 Proceedings of the 5th international conference on Software engineering
Specifying and Enforcing Application-Level Web Security Policies

IEEE Transactions on Knowledge and Data Engineering
Trust in the λ-calculus

Journal of Functional Programming
Pixy: A Static Analysis Tool for Detecting Web Application Vulnerabilities (Short Paper)

SP '06 Proceedings of the 2006 IEEE Symposium on Security and Privacy
Precise alias analysis for static detection of web application vulnerabilities

Proceedings of the 2006 workshop on Programming languages and analysis for security
Sound and precise analysis of web applications for injection vulnerabilities

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Thin slicing

Proceedings of the 2007 ACM SIGPLAN conference on Programming language design and implementation
Static detection of security vulnerabilities in scripting languages

USENIX-SS'06 Proceedings of the 15th conference on USENIX Security Symposium - Volume 15
A practical solution for scripting language compilers

Proceedings of the 2009 ACM symposium on Applied Computing
Staged information flow for javascript

Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation
TAJ: effective taint analysis of web applications

Proceedings of the 2009 ACM SIGPLAN conference on Programming language design and implementation
User-input dependence analysis via graph reachability

User-input dependence analysis via graph reachability
Precise analysis of string expressions

SAS'03 Proceedings of the 10th international conference on Static analysis
Interprocedural analysis for privileged code placement and tainted variable detection

ECOOP'05 Proceedings of the 19th European conference on Object-Oriented Programming

Program analysis scenarios in rascal

WRLA'12 Proceedings of the 9th international conference on Rewriting Logic and Its Applications
Speed and precision in range analysis

SBLP'12 Proceedings of the 16th Brazilian conference on Programming Languages

Quantified Score

Hi-index	0.00

Visualization

Abstract

Tainted flow attacks originate from program inputs maliciously crafted to exploit software vulnerabilities. These attacks are common in server-side scripting languages, such as PHP. In 1997, ørbæk and Palsberg formalized the problem of detecting these exploits as an instance of type-checking, and gave an O(V3) algorithm to solve it, where V is the number of program variables. A similar algorithm was, ten years later, implemented on the Pixy tool. In this paper we give an O(V2) solution to the same problem. Our solution uses Bodik et al.'s extended Static Single Assignment (e-SSA) program representation. The e-SSA form can be efficiently computed and it enables us to solve the problem via a sparse data-flow analysis. Using the same infrastructure, we compared a state-of-the-art data-flow solution with our technique. Both approaches have detected 36 vulnerabilities in well known PHP programs. Our results show that our approach tends to outperform the data-flow algorithm for bigger inputs. We have reported the bugs that we found, and an implementation of our algorithm is now publicly available.