Automatic text processing
Self-organization and associative memory: 3rd edition
Self-organization and associative memory: 3rd edition
Combinatorial algorithms for integrated circuit layout
Combinatorial algorithms for integrated circuit layout
A fuzzy clustering algorithm for graph bisection
Information Processing Letters
Fast and effective text mining using linear-time document clustering
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Web searching: a process-oriented experimental study of three interactive search paradigms
Journal of the American Society for Information Science and Technology
Information Retrieval Systems: Theory and Implementation
Information Retrieval Systems: Theory and Implementation
Information Retrieval
IEEE Transactions on Pattern Analysis and Machine Intelligence
GD '96 Proceedings of the Symposium on Graph Drawing
Graph Clustering Using Multiway Ratio Cut
GD '97 Proceedings of the 5th International Symposium on Graph Drawing
On the Nature of Structure and Its Identification
WG '99 Proceedings of the 25th International Workshop on Graph-Theoretic Concepts in Computer Science
Cluster Validation with Generalized Dunn's Indices
ANNES '95 Proceedings of the 2nd New Zealand Two-Stream International Conference on Artificial Neural Networks and Expert Systems
Query-By-Keywords (QBK): Query Formulation Using Semantics and Feedback
ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
On the relative hardness of clustering corpora
TSD'07 Proceedings of the 10th international conference on Text, speech and dialogue
Sense cluster based categorization and clustering of abstracts
CICLing'06 Proceedings of the 7th international conference on Computational Linguistics and Intelligent Text Processing
An approach to clustering abstracts
NLDB'05 Proceedings of the 10th international conference on Natural Language Processing and Information Systems
Hi-index | 0.00 |
Automatic document categorization plays a key role in the development of future interfaces for Web-based search. Clustering algorithms are considered as a technology that is capable of mastering this "ad-hoc" categorization task.This paper presents results of a comprehensive analysis of clustering algorithms in connection with document categorization. The contributions relate to exemplar-based, hierarchical, and density-based clustering algorithms. In particular, we contrast ideal and real clustering settings and present runtime results that are based on efficient implementations of the investigated algorithms.