A Labeled Data Set for Flow-Based Intrusion Detection

  • Authors:
  • Anna Sperotto;Ramin Sadre;Frank Vliet;Aiko Pras

  • Affiliations:
  • Centre for Telematics and Information Technology Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, Enschede, The Netherlands 7500 AE;Centre for Telematics and Information Technology Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, Enschede, The Netherlands 7500 AE;Centre for Telematics and Information Technology Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, Enschede, The Netherlands 7500 AE;Centre for Telematics and Information Technology Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, Enschede, The Netherlands 7500 AE

  • Venue:
  • IPOM '09 Proceedings of the 9th IEEE International Workshop on IP Operations and Management
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Flow-based intrusion detection has recently become a promising security mechanism in high speed networks (1-10 Gbps). Despite the richness in contributions in this field, benchmarking of flow-based IDS is still an open issue. In this paper, we propose the first publicly available, labeled data set for flow-based intrusion detection. The data set aims to be realistic , i.e., representative of real traffic and complete from a labeling perspective. Our goal is to provide such enriched data set for tuning, training and evaluating ID systems. Our setup is based on a honeypot running widely deployed services and directly connected to the Internet, ensuring attack-exposure. The final data set consists of 14.2M flows and more than 98% of them has been labeled.