Using behavioral data to identify interviewer fabrication in surveys

  • Authors:
  • Benjamin Birnbaum;Gaetano Borriello;Abraham D. Flaxman;Brian DeRenzi;Anna R. Karlin

  • Affiliations:
  • University of Washington, Seattle, WA, USA;University of Washington, Seattle, Washington, USA;University of Washington, Seattle, Washington, USA;University of Washington, Seattle, Washington, USA;University of Washington, Seattle, Washington, USA

  • Venue:
  • Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
  • Year:
  • 2013

Quantified Score

Hi-index 0.01

Visualization

Abstract

Surveys conducted by human interviewers are one of the principal means of gathering data from all over the world, but the quality of this data can be threatened by interviewer fabrication. In this paper, we investigate a new approach to detecting interviewer fabrication automatically. We instrument electronic data collection software to record logs of low-level behavioral data and show that supervised classification, when applied to features extracted from these logs, can identify interviewer fabrication with an accuracy of up to 96%. We show that even when interviewers know that our approach is being used, have some knowledge of how it works, and are incentivized to avoid detection, it can still achieve an accuracy of 86%. We also demonstrate the robustness of our approach to a moderate amount of label noise and provide practical recommendations, based on empirical evidence, on how much data is needed for our approach to be effective.