Principles of mixed-initiative user interfaces
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Learning Information Extraction Rules for Semi-Structured and Free Text
Machine Learning - Special issue on natural language learning
AJAX: an extensible data cleaning tool
SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
SWYN: a visual representation for regular expressions
Your wish is my command
SchemaSQL: An extension to SQL for multidatabase interoperability
ACM Transactions on Database Systems (TODS)
Mining database structure; or, how to build a data quality browser
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
Interactive Simultaneous Editing of Multiple Text Regions
Proceedings of the General Track: 2002 USENIX Annual Technical Conference
A survey of approaches to automatic schema matching
The VLDB Journal — The International Journal on Very Large Data Bases
Exploratory Data Mining and Data Cleaning
Exploratory Data Mining and Data Cleaning
Extracting structured data from Web pages
Proceedings of the 2003 ACM SIGMOD international conference on Management of data
A Survey of Outlier Detection Methodologies
Artificial Intelligence Review
Visualization of mappings between schemas
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
PADS: a domain-specific language for processing ad hoc data
Proceedings of the 2005 ACM SIGPLAN conference on Programming language design and implementation
Clio grows up: from research prototype to industrial tool
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Interactive generation of integrated schemas
Proceedings of the 2008 ACM SIGMOD international conference on Management of data
Proceedings of the 13th international conference on Intelligent user interfaces
Interactive Entity Resolution in Relational Data: A Visual Analytic Tool and Its Evaluation
IEEE Transactions on Visualization and Computer Graphics
End-user programming of mashups with vegemite
Proceedings of the 14th international conference on Intelligent user interfaces
Intelligently creating and recommending reusable reformatting rules
Proceedings of the 14th international conference on Intelligent user interfaces
Potluck: semi-ontology alignment for casual users
ISWC'07/ASWC'07 Proceedings of the 6th international The semantic web and 2nd Asian conference on Asian semantic web conference
The Design of Everyday Things
Spreadsheet table transformations from examples
Proceedings of the 32nd ACM SIGPLAN conference on Programming language design and implementation
Mixer: mixed-initiative data retrieval and integration by example
INTERACT'11 Proceedings of the 13th IFIP TC 13 international conference on Human-computer interaction - Volume Part I
Proactive wrangling: mixed-initiative end-user programming of data transformation scripts
Proceedings of the 24th annual ACM symposium on User interface software and technology
CHI '12 Extended Abstracts on Human Factors in Computing Systems
Spreadsheet data manipulation using examples
Communications of the ACM
Profiler: integrated statistical analysis and visualization for data quality assessment
Proceedings of the International Working Conference on Advanced Visual Interfaces
Redeeming pedigree data with an interactive error cleaning visualisation
Proceedings of the International Working Conference on Advanced Visual Interfaces
Interactive analysis of big data
XRDS: Crossroads, The ACM Magazine for Students - Big Data
Learning data transformation rules through examples: preliminary results
Proceedings of the Ninth International Workshop on Information Integration on the Web
Information Visualization - Special issue on State of the Field and New Research Directions
Synthesizing number transformations from input-output examples
CAV'12 Proceedings of the 24th international conference on Computer Aided Verification
A demonstration of DBWipes: clean as you query
Proceedings of the VLDB Endowment
DataPlay: interactive tweaking and example-driven correction of graphical database queries
Proceedings of the 25th annual ACM symposium on User interface software and technology
I can do text analytics!: designing development tools for novice developers
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Building blocks for exploratory data analysis tools
Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics
Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics
A colorful approach to text processing by example
Proceedings of the 26th annual ACM symposium on User interface software and technology
DeExcelerator: a framework for extracting relational data from partially structured documents
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Forge: generating a high performance DSL implementation from a declarative specification
Proceedings of the 12th international conference on Generative programming: concepts & experiences
Minimizing user effort in transforming data by example
Proceedings of the 19th international conference on Intelligent User Interfaces
Hi-index | 0.02 |
Though data analysis tools continue to improve, analysts still expend an inordinate amount of time and effort manipulating data and assessing data quality issues. Such "data wrangling" regularly involves reformatting data values or layout, correcting erroneous or missing values, and integrating multiple data sources. These transforms are often difficult to specify and difficult to reuse across analysis tasks, teams, and tools. In response, we introduce Wrangler, an interactive system for creating data transformations. Wrangler combines direct manipulation of visualized data with automatic inference of relevant transforms, enabling analysts to iteratively explore the space of applicable operations and preview their effects. Wrangler leverages semantic data types (e.g., geographic locations, dates, classification codes) to aid validation and type conversion. Interactive histories support review, refinement, and annotation of transformation scripts. User study results show that Wrangler significantly reduces specification time and promotes the use of robust, auditable transforms instead of manual editing.