Proactive wrangling: mixed-initiative end-user programming of data transformation scripts

  • Authors:
  • Philip J. Guo;Sean Kandel;Joseph M. Hellerstein;Jeffrey Heer

  • Affiliations:
  • Stanford University, Stanford, CA, USA;Stanford University, Stanford, CA, USA;University of California, Berkeley, Berkeley, CA, USA;Stanford University, Stanford, CA, USA

  • Venue:
  • Proceedings of the 24th annual ACM symposium on User interface software and technology
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Analysts regularly wrangle data into a form suitable for computational tools through a tedious process that delays more substantive analysis. While interactive tools can assist data transformation, analysts must still conceptualize the desired output state, formulate a transformation strategy, and specify complex transforms. We present a model to proactively suggest data transforms which map input data to a relational format expected by analysis tools. To guide search through the space of transforms, we propose a metric that scores tables according to type homogeneity, sparsity and the presence of delimiters. When compared to "ideal" hand-crafted transformations, our model suggests over half of the needed steps; in these cases the top-ranked suggestion is preferred 77% of the time. User study results indicate that suggestions produced by our model can assist analysts' transformation tasks, but that users do not always value proactive assistance, instead preferring to maintain the initiative. We discuss some implications of these results for mixed-initiative interfaces.