AVI '00 Proceedings of the working conference on Advanced visual interfaces
Data Mining and Knowledge Discovery
Information Visualization and Visual Data Mining
IEEE Transactions on Visualization and Computer Graphics
Polaris: A System for Query, Analysis, and Visualization of Multidimensional Relational Databases
IEEE Transactions on Visualization and Computer Graphics
HD-Eye: Visual Mining of High-Dimensional Data
IEEE Computer Graphics and Applications
Potter's Wheel: An Interactive Data Cleaning System
Proceedings of the 27th International Conference on Very Large Data Bases
Visual hierarchical dimension reduction for exploration of high dimensional datasets
VISSYM '03 Proceedings of the symposium on Data visualisation 2003
Exploratory Data Mining and Data Cleaning
Exploratory Data Mining and Data Cleaning
GGobi: evolving from XGobi into an extensible framework for interactive data visualization
Computational Statistics & Data Analysis - Data visualization
Information Visualization - Special issue on coordinated and multiple views in exploratory visualization
A Survey of Outlier Detection Methodologies
Artificial Intelligence Review
Mapping nominal values to numbers for effective visualization
Information Visualization - Special issue of selected and extended InfoVis 03 papers
Building Highly-Coordinated Visualizations in Improvise
INFOVIS '04 Proceedings of the IEEE Symposium on Information Visualization
Visualization of mappings between schemas
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Clio grows up: from research prototype to industrial tool
Proceedings of the 2005 ACM SIGMOD international conference on Management of data
A rank-by-feature framework for interactive exploration of multidimensional data
Information Visualization
Duplicate Record Detection: A Survey
IEEE Transactions on Knowledge and Data Engineering
Graphics of Large Datasets: Visualizing a Million (Statistics and Computing)
Graphics of Large Datasets: Visualizing a Million (Statistics and Computing)
Systematic yet flexible discovery: guiding domain experts through exploratory data analysis
Proceedings of the 13th international conference on Intelligent user interfaces
Proceedings of the 13th international conference on Intelligent user interfaces
Interactive Entity Resolution in Relational Data: A Visual Analytic Tool and Its Evaluation
IEEE Transactions on Visualization and Computer Graphics
End-user programming of mashups with vegemite
Proceedings of the 14th international conference on Intelligent user interfaces
Intelligently creating and recommending reusable reformatting rules
Proceedings of the 14th international conference on Intelligent user interfaces
ACM Computing Surveys (CSUR)
Wrangler: interactive visual specification of data transformation scripts
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
IEEE Transactions on Visualization and Computer Graphics
Paper: Modeling by shortest data description
Automatica (Journal of IFAC)
Time histograms for large, time-dependent data
VISSYM'04 Proceedings of the Sixth Joint Eurographics - IEEE TCVG conference on Visualization
Interactive analysis of big data
XRDS: Crossroads, The ACM Magazine for Students - Big Data
Building blocks for exploratory data analysis tools
Proceedings of the ACM SIGKDD Workshop on Interactive Data Exploration and Analytics
Scorpion: explaining away outliers in aggregate queries
Proceedings of the VLDB Endowment
ACM SIGMOD Record
imMens: real-time visual querying of big data
EuroVis '13 Proceedings of the 15th Eurographics Conference on Visualization
Hi-index | 0.00 |
Data quality issues such as missing, erroneous, extreme and duplicate values undermine analysis and are time-consuming to find and fix. Automated methods can help identify anomalies, but determining what constitutes an error is context-dependent and so requires human judgment. While visualization tools can facilitate this process, analysts must often manually construct the necessary views, requiring significant expertise. We present Profiler, a visual analysis tool for assessing quality issues in tabular data. Profiler applies data mining methods to automatically flag problematic data and suggests coordinated summary visualizations for assessing the data in context. The system contributes novel methods for integrated statistical and visual analysis, automatic view suggestion, and scalable visual summaries that support real-time interaction with millions of data points. We present Profiler's architecture --- including modular components for custom data types, anomaly detection routines and summary visualizations --- and describe its application to motion picture, natural disaster and water quality data sets.