Automatic classification of security messages based on text categorization

  • Authors:
  • Fatiha Benali;Stéphane Ubéda;Véronique Legrand

  • Affiliations:
  • ARES INRIA/CITI, INSA, Lyon, France;ARES INRIA/CITI, INSA, Lyon, France;ARES INRIA/CITI, INSA, Lyon, France

  • Venue:
  • NOTERE '08 Proceedings of the 8th international conference on New technologies in distributed systems
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

The generated messages by the security devices are the necessary data for the detection of the malicious activities in an information system. The heterogeneity of the devices and the lack of a standard for the security messages make the automatic processing of the messages difficult. The messages are short, use a very wide vocabulary and have different formats. We propose in this article the application of the text categorization technics for the automatic classification of security log files messages, in categories defined by an ontology. We develop an extraction module for the message attributes to reduce the vocabulary size. Then we apply two training algorithms: the k-nearest neighbour algorithm and the naive bayes, on two corpus of security log messages.