SCM '89 Proceedings of the 2nd International Workshop on Software configuration management
Reexamining the cluster hypothesis: scatter/gather on retrieval results
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Using web structure for classifying and describing web pages
Proceedings of the 11th international conference on World Wide Web
Information Retrieval
A Hierarchical Model for Clustering and Categorising Documents
Proceedings of the 24th BCS-IRSG European Colloquium on IR Research: Advances in Information Retrieval
DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
THESUS: Organizing Web document collections based on link semantics
The VLDB Journal — The International Journal on Very Large Data Bases
Taxonomy generation for text segments: A practical web-based approach
ACM Transactions on Information Systems (TOIS)
CorePhrase: keyphrase extraction for document clustering
MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Extracting salient dimensions for automatic SOM labeling
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Hi-index | 0.00 |
This paper proposes a research line for developing new ways of automatically characterizing groups of documents, being them either clusters or categories. This research line is based upon the works of many other researchers and tries to summarize the most problematic issues of category labelling in order to devise possible solutions. Various lines of action are described, as well as future research lines and developments.