Technometrics
Constant interaction-time scatter/gather browsing of very large document collections
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Pad++: a zooming graphical interface for exploring alternate interface physics
UIST '94 Proceedings of the 7th annual ACM symposium on User interface software and technology
Reexamining the cluster hypothesis: scatter/gather on retrieval results
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
A vector space model for automatic indexing
Communications of the ACM
Domain visualization using VxInsight for science and technology management
Journal of the American Society for Information Science and Technology
Learning Algorithms for Keyphrase Extraction
Information Retrieval
Proceedings of the 8th international conference on Intelligent user interfaces
The STARLIGHT information visualization system
IV '97 Proceedings of the IEEE Conference on Information Visualisation
Visualizing the non-visual: spatial analysis and interaction with information from text documents
INFOVIS '95 Proceedings of the 1995 IEEE Symposium on Information Visualization
The Journal of Machine Learning Research
Visualization for the document space
VIS '92 Proceedings of the 3rd conference on Visualization '92
Mapping Medline Papers, Genes, and Proteins Related to Melanoma Research
IV '04 Proceedings of the Information Visualisation, Eighth International Conference
Journal of the American Society for Information Science and Technology
Incorporating non-local information into information extraction systems by Gibbs sampling
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Getting our head in the clouds: toward evaluation studies of tagclouds
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Visualizing the marrow of science
Journal of the American Society for Information Science and Technology
The folksonomy tag cloud: when is it useful?
Journal of Information Science
TIMELINES: Tag clouds and the case for vernacular visualization
interactions - Changing energy use through design
Direct manipulation interfaces
Human-Computer Interaction
The Word Tree, an Interactive Visual Concordance
IEEE Transactions on Visualization and Computer Graphics
Computer Methods and Programs in Biomedicine
Jigsaw: Supporting Investigative Analysis through Interactive Visualization
VAST '07 Proceedings of the 2007 IEEE Symposium on Visual Analytics Science and Technology
Dynamicity vs. effectiveness: studying online clustering for scatter/gather
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Studying the history of ideas using topic models
EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Active learning with statistical models
Journal of Artificial Intelligence Research
A Nested Model for Visualization Design and Validation
IEEE Transactions on Visualization and Computer Graphics
IEEE Transactions on Visualization and Computer Graphics
The automatic creation of literature abstracts
IBM Journal of Research and Development
Labeled LDA: a supervised topic model for credit attribution in multi-labeled corpora
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
TIARA: a visual exploratory text analytic system
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
FacetAtlas: Multifaceted Visualization for Rich Text Corpora
IEEE Transactions on Visualization and Computer Graphics
CueT: human-guided fast and accurate network alarm triage
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
TopicNets: Visual Analysis of Large Text Corpora with Topic Modeling
ACM Transactions on Intelligent Systems and Technology (TIST)
Docuburst: visualizing document content using language structure
EuroVis'09 Proceedings of the 11th Eurographics / IEEE - VGTC conference on Visualization
Termite: visualization techniques for assessing textual topic models
Proceedings of the International Working Conference on Advanced Visual Interfaces
The four-level nested model revisited: blocks and guidelines
Proceedings of the 2012 BELIV Workshop: Beyond Time and Errors - Novel Evaluation Methods for Visualization
Optimizing temporal topic segmentation for intelligent text visualization
Proceedings of the 2013 international conference on Intelligent user interfaces
Crowd synthesis: extracting categories and clusters from complex data
Proceedings of the 17th ACM conference on Computer supported cooperative work & social computing
Hi-index | 0.01 |
Statistical topic models can help analysts discover patterns in large text corpora by identifying recurring sets of words and enabling exploration by topical concepts. However, understanding and validating the output of these models can itself be a challenging analysis task. In this paper, we offer two design considerations - interpretation and trust - for designing visualizations based on data-driven models. Interpretation refers to the facility with which an analyst makes inferences about the data through the lens of a model abstraction. Trust refers to the actual and perceived accuracy of an analyst's inferences. These considerations derive from our experiences developing the Stanford Dissertation Browser, a tool for exploring over 9,000 Ph.D. theses by topical similarity, and a subsequent review of existing literature. We contribute a novel similarity measure for text collections based on a notion of "word-borrowing" that arose from an iterative design process. Based on our experiences and a literature review, we distill a set of design recommendations and describe how they promote interpretable and trustworthy visual analysis tools.