Category labelling for automatic classification scheme generation

Authors:
Rodrigo Sánchez Jiménez
Affiliations:
Dpto. de Biblioteconomía y Documentación UCM, Facultad de Ciencias de la Información, Madrid, Spain
Venue:
FDIA'07 Proceedings of the 1st BCS IRSG conference on Future Directions in Information Access
Year:
2007

Citing 11
Cited 0

Cross references are features

SCM '89 Proceedings of the 2nd International Workshop on Software configuration management
Reexamining the cluster hypothesis: scatter/gather on retrieval results

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing

Foundations of statistical natural language processing
Using web structure for classifying and describing web pages

Proceedings of the 11th international conference on World Wide Web
Information Retrieval

Information Retrieval
A Hierarchical Model for Clustering and Categorising Documents

Proceedings of the 24th BCS-IRSG European Colloquium on IR Research: Advances in Information Retrieval
Using Self-Organizing Maps to Organize Document Archives and to Charakterize Subject Matter: How to Make a Map Tell the News of the World

DEXA '99 Proceedings of the 10th International Conference on Database and Expert Systems Applications
THESUS: Organizing Web document collections based on link semantics

The VLDB Journal — The International Journal on Very Large Data Bases
Taxonomy generation for text segments: A practical web-based approach

ACM Transactions on Information Systems (TOIS)
CorePhrase: keyphrase extraction for document clustering

MLDM'05 Proceedings of the 4th international conference on Machine Learning and Data Mining in Pattern Recognition
Extracting salient dimensions for automatic SOM labeling

IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper proposes a research line for developing new ways of automatically characterizing groups of documents, being them either clusters or categories. This research line is based upon the works of many other researchers and tries to summarize the most problematic issues of category labelling in order to devise possible solutions. Various lines of action are described, as well as future research lines and developments.