New Malicious Code Detection Based on N-Gram Analysis and Rough Set Theory

  • Authors:
  • Boyun Zhang;Jianping Yin;Jingbo Hao;Shulin Wang;Dingxing Zhang

  • Affiliations:
  • School of Computer Science, National University of Defense Technolgy, Changsha 410073, China and Department of Computer Science, Hunan Public Security College, Changsha 410138, China;School of Computer Science, National University of Defense Technolgy, Changsha 410073, China;School of Computer Science, National University of Defense Technolgy, Changsha 410073, China;School of Computer Science, National University of Defense Technolgy, Changsha 410073, China;School of Computer Science, National University of Defense Technolgy, Changsha 410073, China

  • Venue:
  • Computational Intelligence and Security
  • Year:
  • 2007

Quantified Score

Hi-index 0.01

Visualization

Abstract

Motivated by the standard signature-based technique for detecting viruses, we explore the idea of automatically detecting malicious code using the N-gram analysis. The method is based on statistical learning and not strictly dependent on certain viruses. We propose the use of rough set theory to reduce the feature dimension. An efficient implementation to calculate relative core, based on positive region definition is presented also. The k nearest neighbor and support vector machine classifiers are used to categorize a program as either normal or abnormal. The experimental results are promising and show that the proposed scheme results in low rate of false positive.