Automatic feature selection for anomaly detection

  • Authors:
  • Marius Kloft;Ulf Brefeld;Patrick Düessel;Christian Gehl;Pavel Laskov

  • Affiliations:
  • TU Berlin, Berlin, Germany;TU Berlin, Berlin, Germany;Fraunhofer Institute FIRST, Berlin, Germany;Fraunhofer Institute FIRST, Berlin, Germany;Fraunhofer Institute FIRST, Berlin, Germany

  • Venue:
  • Proceedings of the 1st ACM workshop on Workshop on AISec
  • Year:
  • 2008

Quantified Score

Hi-index 0.00

Visualization

Abstract

A frequent problem in anomaly detection is to decide among different feature sets to be used. For example, various features are known in network intrusion detection based on packet headers, content byte streams or application level protocol parsing. A method for automatic feature selection in anomaly detection is proposed which determines optimal mixture coefficients for various sets of features. The method generalizes the support vector data description (SVDD) and can be expressed as a semi-infinite linear program that can be solved with standard techniques. The case of a single feature set can be handled as a particular case of the proposed method. The experimental evaluation of the new method on unsanitized HTTP data demonstrates that detectors using automatically selected features attain competitive performance, while sparing practitioners from a priori decisions on feature sets to be used.