Automated Traffic Classification and Application Identification using Machine Learning

  • Authors:
  • Sebastian Zander;Thuy Nguyen;Grenville Armitage

  • Affiliations:
  • Swinburne University of Technology, Melbourne;Swinburne University of Technology, Melbourne;Swinburne University of Technology, Melbourne

  • Venue:
  • LCN '05 Proceedings of the The IEEE Conference on Local Computer Networks 30th Anniversary
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

The dynamic classification and identification of network applications responsible for network traffic flows offers substantial benefits to a number of key areas in IP network engineering, management and surveillance. Currently such classifications rely on selected packet header fields (e.g. port numbers) or application layer protocol decoding. These methods have a number of shortfalls e.g. many applications can use unpredictable port numbers and protocol decoding requires a high amount of computing resources or is simply infeasible in case protocols are unknown or encrypted. We propose a novel method for traffic classification and application identification using an unsupervised machine learning technique. Flows are automatically classified based on statistical flow characteristics. We evaluate the efficiency of our approach using data from several traffic traces collected at different locations of the Internet. We use feature selection to find an optimal feature set and determine the influence of different features.