CueT: human-guided fast and accurate network alarm triage

  • Authors:
  • Saleema Amershi;Bongshin Lee;Ashish Kapoor;Ratul Mahajan;Blaine Christian

  • Affiliations:
  • University of Washington, Seattle, Washington, USA;Microsoft Research, Redmond, Washington, USA;Microsoft Research, Redmond, Washington, USA;Microsoft Research, Redmond, Washington, USA;Microsoft Corporation, Redmond, Washington, USA

  • Venue:
  • Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
  • Year:
  • 2011

Quantified Score

Hi-index 0.01

Visualization

Abstract

Network alarm triage refers to grouping and prioritizing a stream of low-level device health information to help operators find and fix problems. Today, this process tends to be largely manual because existing tools cannot easily evolve with the network. We present CueT, a system that uses interactive machine learning to learn from the triaging decisions of operators. It then uses that learning in novel visualizations to help them quickly and accurately triage alarms. Unlike prior interactive machine learning systems, CueT handles a highly dynamic environment where the groups of interest are not known a-priori and evolve constantly. A user study with real operators and data from a large network shows that CueT significantly improves the speed and accuracy of alarm triage compared to the network's current practice.