Automatic text processing: the transformation, analysis, and retrieval of information by computer
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Automatic text decomposition and structuring
Information Processing and Management: an International Journal
The SMART automatic document retrieval systems—an illustration
Communications of the ACM
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Document clustering based on non-negative matrix factorization
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The SMART Retrieval System—Experiments in Automatic Document Processing
The SMART Retrieval System—Experiments in Automatic Document Processing
Document clustering using nonnegative matrix factorization
Information Processing and Management: an International Journal
Tensor space model for document analysis
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
An Adaptation of the Vector-Space Model for Ontology-Based Information Retrieval
IEEE Transactions on Knowledge and Data Engineering
Hi-index | 0.00 |
This paper deals with text document clustering by means of neural network used for preprocessing and next, the nonnegative factor analysis is applied to create certain amount of clusters. The results on the part of Reuters-21578 collection show that the given number of clusters is created, and the difference between clusters is counted as the cosine similarity between centroids of the particular clusters. Results show that if the data are preprocessed by PCA, the non-negative factor analysis divides documents into given number of clusters quite successfully.