K-dimensional trees for continuous traffic classification

Authors:
Valentín Carela-Español;Pere Barlet-Ros;Marc Solé-Simó;Alberto Dainotti;Walter de Donato;Antonio Pescapé
Affiliations:
Department of Computer Architecture, Universitat Politècnica de Catalunya (UPC);Department of Computer Architecture, Universitat Politècnica de Catalunya (UPC);Department of Computer Architecture, Universitat Politècnica de Catalunya (UPC);Department of Computer Engineering and Systems, Universitá di Napoli Federico II;Department of Computer Engineering and Systems, Universitá di Napoli Federico II;Department of Computer Engineering and Systems, Universitá di Napoli Federico II
Venue:
TMA'10 Proceedings of the Second international conference on Traffic Monitoring and Analysis
Year:
2010

Citing 13
Cited 2

K-d trees for semidynamic point sets

SCG '90 Proceedings of the sixth annual symposium on Computational geometry
An Algorithm for Finding Best Matches in Logarithmic Expected Time

ACM Transactions on Mathematical Software (TOMS)
Class-of-service mapping for QoS: a statistical signature-based approach to IP traffic classification

Proceedings of the 4th ACM SIGCOMM conference on Internet measurement
Internet traffic classification using bayesian analysis techniques

SIGMETRICS '05 Proceedings of the 2005 ACM SIGMETRICS international conference on Measurement and modeling of computer systems
BLINC: multilevel traffic classification in the dark

Proceedings of the 2005 conference on Applications, technologies, architectures, and protocols for computer communications
Automated Traffic Classification and Application Identification using Machine Learning

LCN '05 Proceedings of the The IEEE Conference on Local Computer Networks 30th Anniversary
Traffic classification on the fly

ACM SIGCOMM Computer Communication Review
Identifying and discriminating between web and peer-to-peer traffic in the network core

Proceedings of the 16th international conference on World Wide Web
Byte me: a case for byte accuracy in traffic classification

Proceedings of the 3rd annual ACM workshop on Mining network data
Early application identification

CoNEXT '06 Proceedings of the 2006 ACM CoNEXT conference
Internet traffic classification demystified: myths, caveats, and the best practices

CoNEXT '08 Proceedings of the 2008 ACM CoNEXT Conference
TIE: A Community-Oriented Traffic Classification Platform

TMA '09 Proceedings of the First International Workshop on Traffic Monitoring and Analysis
A survey of techniques for internet traffic classification using machine learning

IEEE Communications Surveys & Tutorials

Early classification of network traffic through multi-classification

TMA'11 Proceedings of the Third international conference on Traffic monitoring and analysis
Reviewing traffic classification

DataTraffic Monitoring and Analysis

Quantified Score

Hi-index	0.00

Visualization

Abstract

The network measurement community has proposed multiple machine learning (ML) methods for traffic classification during the last years. Although several research works have reported accuracies over 90%, most network operators still use either obsolete (e.g., port-based) or extremely expensive (e.g., pattern matching) methods for traffic classification. We argue that one of the barriers to the real deployment of ML-based methods is their time-consuming training phase. In this paper, we revisit the viability of using the Nearest Neighbor technique for traffic classification. We present an efficient implementation of this well-known technique based on multiple K-dimensional trees, which is characterized by short training times and high classification speed.This allows us not only to run the classifier online but also to continuously retrain it, without requiring human intervention, as the training data become obsolete. The proposed solution achieves very promising accuracy (95%) while looking just at the size of the very first packets of a flow. We present an implementation of this method based on the TIE classification engine as a feasible and simple solution for network operators.