Evaluation of an integrated multi-task machine learning system with humans in the loop

  • Authors:
  • Aaron Steinfeld;S. Rachael Bennett;Kyle Cunningham;Matt Lahut;Pablo-Alejandro Quinones;Django Wexler;Dan Siewiorek;Jordan Hayes;Paul Cohen;Julie Fitzgerald;Othar Hansson;Mike Pool;Mark Drummond

  • Affiliations:
  • Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Carnegie Mellon University, Pittsburgh, PA;Bitway, Inc.;U. of Southern California;JSF Consulting;Bitway, Inc.;IET;SRI International

  • Venue:
  • PerMIS '07 Proceedings of the 2007 Workshop on Performance Metrics for Intelligent Systems
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

Performance of a cognitive personal assistant, RADAR, consisting of multiple machine learning components, natural language processing, and optimization was examined with a test explicitly developed to measure the impact of integrated machine learning when used by a human user in a real world setting. Three conditions (conventional tools, Radar without learning, and Radar with learning) were evaluated in a large-scale, between-subjects study. The study revealed that integrated machine learning does produce a positive impact on overall performance. This paper also discusses how specific machine learning components contributed to human-system performance.