In-execution dynamic malware analysis and detection by mining information in process control blocks of Linux OS

Authors:
Farrukh Shahzad;M. Shahzad;Muddassar Farooq
Affiliations:
Next Generation Intelligent Networks Research Center (nexGIN RC), FAST National University of Computer and Emerging Sciences (FAST-NU), Islamabad, Pakistan;Next Generation Intelligent Networks Research Center (nexGIN RC), FAST National University of Computer and Emerging Sciences (FAST-NU), Islamabad, Pakistan;Next Generation Intelligent Networks Research Center (nexGIN RC), FAST National University of Computer and Emerging Sciences (FAST-NU), Islamabad, Pakistan
Venue:
Information Sciences: an International Journal
Year:
2013

Citing 39
Cited 3

Data mining: practical machine learning tools and techniques with Java implementations

ACM SIGMOD Record
Intrusion Detection via Static Analysis

SP '01 Proceedings of the 2001 IEEE Symposium on Security and Privacy
Testing malware detectors

ISSTA '04 Proceedings of the 2004 ACM SIGSOFT international symposium on Software testing and analysis
Class Noise vs. Attribute Noise: A Quantitative Study

Artificial Intelligence Review
Detecting Kernel-Level Rootkits Through Binary Analysis

ACSAC '04 Proceedings of the 20th Annual Computer Security Applications Conference
The Art of Computer Virus Research and Defense

The Art of Computer Virus Research and Defense
Semantics-Aware Malware Detection

SP '05 Proceedings of the 2005 IEEE Symposium on Security and Privacy
Detecting Stealth Software with Strider GhostBuster

DSN '05 Proceedings of the 2005 International Conference on Dependable Systems and Networks
Anomalous system call detection

ACM Transactions on Information and System Security (TISSEC)
PolyUnpack: Automating the Hidden-Code Extraction of Unpack-Executing Malware

ACSAC '06 Proceedings of the 22nd Annual Computer Security Applications Conference
Data Mining

Data Mining
Toward Automated Dynamic Malware Analysis Using CWSandbox

IEEE Security and Privacy
Securing software by enforcing data-flow integrity

OSDI '06 Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementation - Volume 7
Behavior-based spyware detection

USENIX-SS'06 Proceedings of the 15th conference on USENIX Security Symposium - Volume 15
SecVisor: a tiny hypervisor to provide lifetime kernel code integrity for commodity OSes

Proceedings of twenty-first ACM SIGOPS symposium on Operating systems principles
Panorama: capturing system-wide information flow for malware detection and analysis

Proceedings of the 14th ACM conference on Computer and communications security
A semantics-based approach to malware detection

ACM Transactions on Programming Languages and Systems (TOPLAS)
A Study of Malcode-Bearing Documents

DIMVA '07 Proceedings of the 4th international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
Embedded Malware Detection Using Markov n-Grams

DIMVA '08 Proceedings of the 5th international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
Guest-Transparent Prevention of Kernel Rootkits with VMM-Based Memory Shadowing

RAID '08 Proceedings of the 11th international symposium on Recent Advances in Intrusion Detection
Improving malware detection by applying multi-inducer ensemble

Computational Statistics & Data Analysis
Guidelines to Select Machine Learning Scheme for Classification of Biomedical Datasets

EvoBIO '09 Proceedings of the 7th European Conference on Evolutionary Computation, Machine Learning and Data Mining in Bioinformatics
IMAD: in-execution malware analysis and detection

Proceedings of the 11th Annual conference on Genetic and evolutionary computation
The Role of Biomedical Dataset in Classification

AIME '09 Proceedings of the 12th Conference on Artificial Intelligence in Medicine: Artificial Intelligence in Medicine
Countering kernel rootkits with lightweight hook protection

Proceedings of the 16th ACM conference on Computer and communications security
Using spatio-temporal information in API calls with machine learning algorithms for malware detection

Proceedings of the 2nd ACM workshop on Security and artificial intelligence
PE-Miner: Mining Structural Information to Detect Malicious Executables in Realtime

RAID '09 Proceedings of the 12th International Symposium on Recent Advances in Intrusion Detection
Anomaly intrusion detection by clustering transactional audit streams in a host computer

Information Sciences: an International Journal
On the versatility of radial basis function neural networks: A case study in the field of intrusion detection

Information Sciences: an International Journal
Research on hidden Markov model for system call anomaly detection

PAISI'07 Proceedings of the 2007 Pacific Asia conference on Intelligence and security informatics
Effective and efficient malware detection at the end host

SSYM'09 Proceedings of the 18th conference on USENIX security symposium
Return-oriented rootkits: bypassing kernel code integrity protection mechanisms

SSYM'09 Proceedings of the 18th conference on USENIX security symposium
A sense of self for Unix processes

SP'96 Proceedings of the 1996 IEEE conference on Security and privacy
Detecting self-mutating malware using control-flow graph matching

DIMVA'06 Proceedings of the Third international conference on Detection of Intrusions and Malware & Vulnerability Assessment
Detecting malicious code by model checking

DIMVA'05 Proceedings of the Second international conference on Detection of Intrusions and Malware, and Vulnerability Assessment
Environment-sensitive intrusion detection

RAID'05 Proceedings of the 8th international conference on Recent Advances in Intrusion Detection
Polymorphic worm detection using structural information of executables

RAID'05 Proceedings of the 8th international conference on Recent Advances in Intrusion Detection
ELF-Miner: using structural knowledge and data mining methods to detect new (Linux) malicious executables

Knowledge and Information Systems
Behavioral distance measurement using hidden markov models

RAID'06 Proceedings of the 9th international conference on Recent Advances in Intrusion Detection

Editorial: Guest editorial: Special issue on data mining for information security

Information Sciences: an International Journal
Semantic security against web application attacks

Information Sciences: an International Journal
A new adaptive decentralized soft decision combining rule for distributed sensor systems with data fusion

Information Sciences: an International Journal

Quantified Score

Hi-index	0.07

Visualization

Abstract

Run-time behavior of processes - running on an end-host - is being actively used to dynamically detect malware. Most of these detection schemes build model of run-time behavior of a process on the basis of its data flow and/or sequence of system calls. These novel techniques have shown promising results but an efficient and effective technique must meet the following performance metrics: (1) high detection accuracy, (2) low false alarm rate, (3) small detection time, and (4) the technique should be resilient to run-time evasion attempts. To meet these challenges, a novel concept of genetic footprint is proposed, by mining the information in the kernel process control blocks (PCB) of a process, that can be used to detect malicious processes at run time. The genetic footprint consists of selected parameters - maintained inside the PCB of a kernel for each running process - that define the semantics and behavior of an executing process. A systematic forensic study of the execution traces of benign and malware processes is performed to identify discriminatory parameters of a PCB (task_struct is PCB in case of Linux OS). As a result, 16 out of 118 task structure parameters are short listed using the time series analysis. A statistical analysis is done to corroborate the features of the genetic footprint and to select suitable machine learning classifiers to detect malware. The scheme has been evaluated on a dataset that consists of 105 benign processes and 114 recently collected malware processes for Linux. The results of experiments show that the presented scheme achieves a detection accuracy of 96% with 0% false alarm rate in less than 100ms of the start of a malicious activity. Last but not least, the presented technique utilizes partial knowledge that is available at a given time while the process is still executing; as a result, the kernel of OS can devise mitigation strategies. It is also shown that the presented technique is robust to well known run-time evasion attempts.