Document clustering based on non-negative matrix factorization
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Projected Gradient Methods for Nonnegative Matrix Factorization
Neural Computation
Toward Faster Nonnegative Matrix Factorization: A New Algorithm and Comparisons
ICDM '08 Proceedings of the 2008 Eighth IEEE International Conference on Data Mining
Detect and track latent factors with online nonnegative matrix factorization
IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
Graph Regularized Nonnegative Matrix Factorization for Data Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Hi-index | 0.00 |
When document corpus is very large, we often need to reduce the number of features. But it is not possible to apply conventional Non-negative Matrix Factorization(NMF) on billion by million matrix as the matrix may not fit in memory. Here we present novel Online NMF algorithm. Using Online NMF, we reduced original high-dimensional space to low-dimensional space. Then we cluster all the documents in reduced dimension using k-means algorithm. We experimentally show that by processing small subsets of documents we will be able to achieve good performance. The method proposed outperforms existing algorithms.