An Approach to Microscopic Clustering of Terms and Documents

Authors:
Akiko Aizawa
Affiliations:
-
Venue:
PRICAI '02 Proceedings of the 7th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Year:
2002

Citing 9
Cited 2

Comparison of hierarchic agglomerative clustering methods for document retrieval

The Computer Journal
Scatter/Gather: a cluster-based approach to browsing large document collections

SIGIR '92 Proceedings of the 15th annual international ACM SIGIR conference on Research and development in information retrieval
Document clustering using word clusters via the information bottleneck method

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Concept decompositions for large sparse text data using clustering

Machine Learning
A statistical learning learning model of text classification for support vector machines

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Co-clustering documents and words using bipartite spectral graph partitioning

Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Discovery of Web Communities Based on the Co-Occurence of References

DS '00 Proceedings of the Third International Conference on Discovery Science
A method of cluster-based indexing of textual data

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
A co-evolutionary framework for clustering in information retrieval systems

CEC '02 Proceedings of the Evolutionary Computation on 2002. CEC '02. Proceedings of the 2002 Congress - Volume 02

A method of cluster-based indexing of textual data

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Using visual linkages for multilingual image retrieval

CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we present an approach to clustering in text-based information retrieval systems. The proposed method generates overlapping clusters, each of which is composed of subsets of associated terms and documents with normalized significance weights. In the paper, we first briefly introduce the probabilistic formulation of our clustering scheme and then show the procedure for cluster generation. We also report some experimental results, where the generated clusters are investigated in the framework of automatic text categorization.