The vocabulary problem in human-system communication
Communications of the ACM
ACM Transactions on Mathematical Software (TOMS)
Automatic text structuring and retrieval-experiments in automatic encyclopedia searching
SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
Large-scale information retrieval with latent semantic indexing
Information Sciences: an International Journal
The symmetric eigenvalue problem
The symmetric eigenvalue problem
RCV1: A New Benchmark Collection for Text Categorization Research
The Journal of Machine Learning Research
Understanding Search Engines: Mathematical Modeling and Text Retrieval (Software, Environments, Tools), Second Edition
Distributed, large-scale latent semantic analysis by index interpolation
Proceedings of the 3rd international conference on Scalable information systems
Visualization of temporal text collections based on Correspondence Analysis
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
The following study documents a formal evaluation of the performance tradeoffs and scalability for computing the sparse matrix singular value decomposition (SVD) as part of the Latent Semantic Analysis (LSA) of a given document collection with an out-of-core process. Most software packages capable of computing the SVD do all of their processing in-core, which involves keeping all vectors for the computation in memory. This limits the size of document collections that can be processed. The goal of the study was specifically to evaluate software capable of performing the SVD calculations out-of-core, minimizing memory usage by keeping only a small set of work vectors in memory at a time. Performance measures of interest for this study included the time of execution, both in CPU time and wall clock time, as well the memory and disk usage for computing the SVD.