A supervised topic transition model for detecting malicious system call sequences

  • Authors:
  • Han Xiao;Thomas Stibor

  • Affiliations:
  • Technische Universität München, Garching, Germany;Technische Universität München, Garching, Germany

  • Venue:
  • Proceedings of the 2011 workshop on Knowledge discovery, modeling and simulation
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

We propose a probabilistic model for behavior-based malware detection that jointly models sequential data and class labels. Given labeled sequences (harmless/malicious), our goal is to reveal behavior patterns and exploit them to predict class labels of unknown sequences. The proposed model is a novel extension of supervised latent Dirichlet allocation with an estimation algorithm that alternates between Gibbs sampling and gradient descent. Experiments on real-world data set show that our model can learn meaningful patterns, and provides competitive performance on the malware detection task. Moreover, we parallelize the training algorithm and demonstrate scalability with varying numbers of processors.