Rich interaction in the digital library
Communications of the ACM
OPTICS: ordering points to identify the clustering structure
SIGMOD '99 Proceedings of the 1999 ACM SIGMOD international conference on Management of data
Document clustering for electronic meetings: an experimental comparison of two techniques
Decision Support Systems - From information retrieval to knowledge management: enabling technologies and best practices
Performance Evaluation of Some Clustering Algorithms and Validity Indices
IEEE Transactions on Pattern Analysis and Machine Intelligence
Improved Ant-Based Clustering and Sorting
PPSN VII Proceedings of the 7th International Conference on Parallel Problem Solving from Nature
BiblioMapper: A Cluster-Based Information Visualization Technique
INFOVIS '98 Proceedings of the 1998 IEEE Symposium on Information Visualization
INFOVIS '04 Proceedings of the IEEE Symposium on Information Visualization
NLTK: the Natural Language Toolkit
ETMTNLP '02 Proceedings of the ACL-02 Workshop on Effective tools and methodologies for teaching natural language processing and computational linguistics - Volume 1
Exploratory search: from finding to understanding
Communications of the ACM - Supporting exploratory search
The Eurovision St Andrews collection of photographs
ACM SIGIR Forum
LDA-based document models for ad-hoc retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic labeling of multinomial topic models
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Top 10 algorithms in data mining
Knowledge and Information Systems
Evaluation methods for topic models
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
A comparison of extrinsic clustering evaluation metrics based on formal constraints
Information Retrieval
Parallel K-Means Clustering Based on MapReduce
CloudCom '09 Proceedings of the 1st International Conference on Cloud Computing
Proceedings of the 19th international conference on World wide web
Evaluating topic models for digital libraries
Proceedings of the 10th annual joint conference on Digital libraries
Cluster-based navigation for a virtual museum
RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
Proceedings of the VLDB Endowment
Least squares quantization in PCM
IEEE Transactions on Information Theory
Technical Section: EXOD: A tool for building and exploring a large graph of open datasets
Computers and Graphics
Hi-index | 0.00 |
Large digital libraries have become available over the past years through digitisation and aggregation projects. These large collections present a challenge to the new user who wishes to discover what is available in the collections. Subject classification can help in this task, however in large collections it is frequently incomplete or inconsistent. Automatic clustering algorithms provide a solution to this, however the question remains whether they produce clusters that are sufficiently cohesive and distinct for them to be used in supporting discovery and exploration in digital libraries. In this paper we present a novel approach to investigating cluster cohesion that is based on identifying instruders in a cluster. The results from a human-subject experiment show that clustering algorithms produce clusters that are sufficiently cohesive to be used where no (consistent) manual classification exists.