On Updating Problems in Latent Semantic Indexing
SIAM Journal on Scientific Computing
Chord: A scalable peer-to-peer lookup service for internet applications
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
A scalable content-addressable network
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
Pastry: Scalable, Decentralized Object Location, and Routing for Large-Scale Peer-to-Peer Systems
Middleware '01 Proceedings of the IFIP/ACM International Conference on Distributed Systems Platforms Heidelberg
Peer-to-peer information retrieval using self-organizing semantic overlay networks
Proceedings of the 2003 conference on Applications, technologies, architectures, and protocols for computer communications
Efficient Semantic-Based Content Search in P2P Network
IEEE Transactions on Knowledge and Data Engineering
On scaling latent semantic indexing for large peer-to-peer systems
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Hi-index | 0.00 |
Recently published studies have shown that Latent Semantic Indexing (LSI) plays an important role in content-based full text information retrieval of P2P system. However, it is a challenging problem to generate global consistent LSI structures in P2P systems because their nodes are self-organizing and their corpora are large, dynamic and distributed on different nodes. In this paper we propose a method for building LSI structures from distributed corpora. Our method is consisted with a network model for semantic information sampling and exchanging and a Reduced-Dimension-Representation (RDR)s merging algorithm. By the signal and noise subspace model, we also provide a theoretical justification that the RDR merging algorithm is sound. A simple numerical experiment shows that our RDR merging algorithm can keep query precision on an acceptable level.