Ranking flows from sampled traffic

Authors:
Chadi Barakat;Gianluca Iannaccone;Christophe Diot
Affiliations:
INRIA - France;Intel Research, Cambridge, UK;Intel Research, Cambridge, UK
Venue:
CoNEXT '05 Proceedings of the 2005 ACM conference on Emerging network experiment and technology
Year:
2005

Citing 11
Cited 13

Self-similarity in World Wide Web traffic: evidence and possible causes

IEEE/ACM Transactions on Networking (TON)
Load-sensitive routing of long-lived IP flows

Proceedings of the conference on Applications, technologies, architectures, and protocols for computer communication
New directions in traffic measurement and accounting

Proceedings of the 2002 conference on Applications, technologies, architectures, and protocols for computer communications
Properties and prediction of flow statistics from sampled packet streams

Proceedings of the 2nd ACM SIGCOMM Workshop on Internet measurment
Finding Frequent Items in Data Streams

ICALP '02 Proceedings of the 29th International Colloquium on Automata, Languages and Programming
Frequency Estimation of Internet Packet Streams with Limited Space

ESA '02 Proceedings of the 10th Annual European Symposium on Algorithms
What's hot and what's not: tracking most frequent items dynamically

Proceedings of the twenty-second ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
Estimating flow distributions from sampled flow statistics

Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications
Inverting sampled traffic

Proceedings of the 3rd ACM SIGCOMM conference on Internet measurement
Identifying elephant flows through periodically sampled packets

Proceedings of the 4th ACM SIGCOMM conference on Internet measurement
Modeling Internet backbone traffic at the flow level

IEEE Transactions on Signal Processing

Probabilistic lossy counting: an efficient algorithm for finding heavy hitters

ACM SIGCOMM Computer Communication Review
Load shedding in network monitoring applications

ATC'07 2007 USENIX Annual Technical Conference on Proceedings of the USENIX Annual Technical Conference
Processing top k queries from samples

CoNEXT '06 Proceedings of the 2006 ACM CoNEXT conference
Processing top-k queries from samples

Computer Networks: The International Journal of Computer and Telecommunications Networking
A Space-Efficient Fair Packet Sampling Algorithm

APNOMS '08 Proceedings of the 11th Asia-Pacific Symposium on Network Operations and Management: Challenges for Next Generation Network Operations and Service Management
Robust network monitoring in the presence of non-cooperative traffic queries

Computer Networks: The International Journal of Computer and Telecommunications Networking
Maximum likelihood estimation of the flow size distribution tail index from sampled packet data

Proceedings of the eleventh international joint conference on Measurement and modeling of computer systems
An analysis of packet sampling in the frequency domain

Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference
Detectability of traffic anomalies in two adjacent networks

PAM'07 Proceedings of the 8th international conference on Passive and active network measurement
On-line predictive load shedding for network monitoring

NETWORKING'07 Proceedings of the 6th international IFIP-TC6 conference on Ad Hoc and sensor networks, wireless networks, next generation internet
Quick detection of top-k personalized pagerank lists

WAW'11 Proceedings of the 8th international conference on Algorithms and models for the web graph
Mining approximate frequent closed flows over packet streams

DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Predictive resource management of multiple monitoring applications

IEEE/ACM Transactions on Networking (TON)

Quantified Score

Hi-index	0.00

Visualization

Abstract

Most of the theoretical work on sampling has addressed the inversion of general traffic properties such as flow size distribution, average flow size, or total number of flows. In this paper, we make a step towards understanding the impact of packet sampling on individual flow properties. We study how to detect and rank the largest flows on a link. To this end, we develop an analytical model that we validate on real traces from two networks. First we study a blind ranking method where only the number of sampled packets from each flow is known. Then, we propose a new method, protocol-aware ranking, where we make use of the packet sequence number (when available in transport header) to infer the number of non-sampled packets from a flow, and hence to improve the ranking. Surprisingly, our analytical and experimental results indicate that a high sampling rate (10% and even more depending on the number of top flows to be ranked) is required for a correct blind ranking of the largest flows. The sampling rate can be reduced by an order of magnitude if one just aims at detecting these flows or by using the protocol-aware method.