Structural detection of android malware using embedded call graphs

Authors:
Hugo Gascon;Fabian Yamaguchi;Daniel Arp;Konrad Rieck
Affiliations:
University of Göttingen, Göttingen, Germany;University of Göttingen, Göttingen, Germany;University of Göttingen, Göttingen, Germany;University of Göttingen, Göttingen, Germany
Venue:
Proceedings of the 2013 ACM workshop on Artificial intelligence and security
Year:
2013

Citing 32
Cited 0

Diffusion Kernels on Graphs and Other Discrete Input Spaces

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
Protein function prediction via graph kernels

Bioinformatics
2005 Speical Issue: Graph kernels for chemical informatics

Neural Networks - Special issue on neural networks and kernel methods for structured domains
GPLAG: detection of software plagiarism by program dependence graph analysis

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
LIBLINEAR: A Library for Large Linear Classification

The Journal of Machine Learning Research
On lightweight mobile phone application certification

Proceedings of the 16th ACM conference on Computer and communications security
Large-scale malware indexing using function-call graphs

Proceedings of the 16th ACM conference on Computer and communications security
A Linear-Time Graph Kernel

ICDM '09 Proceedings of the 2009 Ninth IEEE International Conference on Data Mining
Effective and efficient malware detection at the end host

SSYM'09 Proceedings of the 18th conference on USENIX security symposium
Classification of malware using structured control flow

AusPDC '10 Proceedings of the Eighth Australasian Symposium on Parallel and Distributed Computing - Volume 107
Paranoid Android: versatile protection for smartphones

Proceedings of the 26th Annual Computer Security Applications Conference
TaintDroid: an information-flow tracking system for realtime privacy monitoring on smartphones

OSDI'10 Proceedings of the 9th USENIX conference on Operating systems design and implementation
A study of android application security

SEC'11 Proceedings of the 20th USENIX conference on Security
A survey of mobile malware in the wild

Proceedings of the 1st ACM workshop on Security and privacy in smartphones and mobile devices
Android permissions demystified

Proceedings of the 18th ACM conference on Computer and communications security
Graph-based malware detection using dynamic analysis

Journal in Computer Virology
Malware classification based on call graph clustering

Journal in Computer Virology
Malware Variant Detection Using Similarity Search over Sets of Control Flow Graphs

TRUSTCOM '11 Proceedings of the 2011IEEE 10th International Conference on Trust, Security and Privacy in Computing and Communications
Detecting repackaged smartphone applications in third-party android marketplaces

Proceedings of the second ACM conference on Data and Application Security and Privacy
Polymorphic worm detection using structural information of executables

RAID'05 Proceedings of the 8th international conference on Recent Advances in Intrusion Detection
Android permissions: a perspective combining risks and benefits

Proceedings of the 17th ACM symposium on Access Control Models and Technologies
RiskRanker: scalable and accurate zero-day android malware detection

Proceedings of the 10th international conference on Mobile systems, applications, and services
Dissecting Android Malware: Characterization and Evolution

SP '12 Proceedings of the 2012 IEEE Symposium on Security and Privacy
DroidScope: seamlessly reconstructing the OS and Dalvik semantic views for dynamic Android malware analysis

Security'12 Proceedings of the 21st USENIX conference on Security symposium
DroidMat: Android Malware Detection through Manifest and API Calls Tracing

ASIAJCIS '12 Proceedings of the 2012 Seventh Asia Joint Conference on Information Security
CHEX: statically vetting Android apps for component hijacking vulnerabilities

Proceedings of the 2012 ACM conference on Computer and communications security
Using probabilistic generative models for ranking risks of Android apps

Proceedings of the 2012 ACM conference on Computer and communications security
Generalized vulnerability extrapolation using abstract syntax trees

Proceedings of the 28th Annual Computer Security Applications Conference
AppsPlayground: automatic security analysis of smartphone applications

Proceedings of the third ACM conference on Data and application security and privacy
Juxtapp: a scalable system for detecting code reuse among android applications

DIMVA'12 Proceedings of the 9th international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
iBinHunt: binary hunting with inter-procedural control flow

ICISC'12 Proceedings of the 15th international conference on Information Security and Cryptology
On the effectiveness of API-level access control using bytecode rewriting in Android

Proceedings of the 8th ACM SIGSAC symposium on Information, computer and communications security

Quantified Score

Hi-index	0.00

Visualization

Abstract

The number of malicious applications targeting the Android system has literally exploded in recent years. While the security community, well aware of this fact, has proposed several methods for detection of Android malware, most of these are based on permission and API usage or the identification of expert features. Unfortunately, many of these approaches are susceptible to instruction level obfuscation techniques. Previous research on classic desktop malware has shown that some high level characteristics of the code, such as function call graphs, can be used to find similarities between samples while being more robust against certain obfuscation strategies. However, the identification of similarities in graphs is a non-trivial problem whose complexity hinders the use of these features for malware detection. In this paper, we explore how recent developments in machine learning classification of graphs can be efficiently applied to this problem. We propose a method for malware detection based on efficient embeddings of function call graphs with an explicit feature map inspired by a linear-time graph kernel. In an evaluation with 12,158 malware samples our method, purely based on structural features, outperforms several related approaches and detects 89% of the malware with few false alarms, while also allowing to pin-point malicious code structures within Android applications.