A vector space model for automatic indexing
Communications of the ACM
A scalable content-addressable network
Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
A survey in indexing and searching XML documents
Journal of the American Society for Information Science and Technology - XML
Querying and ranking XML documents
Journal of the American Society for Information Science and Technology - XML
Modern Information Retrieval
Looking up data in P2P systems
Communications of the ACM
Chord: a scalable peer-to-peer lookup protocol for internet applications
IEEE/ACM Transactions on Networking (TON)
Searching XML documents via XML fragments
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Configurable indexing and ranking for XML information retrieval
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Information Retrieval Techniques for Peer-to-Peer Networks
Computing in Science and Engineering
Information Retrieval: Algorithms and Heuristics (The Kluwer International Series on Information Retrieval)
Peer-to-peer management of XML data: issues and research challenges
ACM SIGMOD Record
Peer-to-Peer Systems and Applications (Lecture Notes in Computer Science)
Peer-to-Peer Systems and Applications (Lecture Notes in Computer Science)
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
XML search: languages, INEX and scoring
ACM SIGMOD Record
Introduction to Information Retrieval
Introduction to Information Retrieval
A peer-to-peer architecture for information retrieval across digital library collections
ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
XPeer: a self-organizing XML P2P database system
EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology
Routing of structured queries in large-scale distributed systems
Proceedings of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval
Hi-index | 0.01 |
XML has become a widely accepted standard for modelling, storing, and exchanging structured documents. Taking advantage of the document structure can result in improving the retrieval performance of XML-documents notably. A growing number of these documents are stored in Peer-to-Peer networks, which are promising self-organizing infrastructures. Documents are distributed over the Peer-to-Peer network by either being stored locally on individual peers or by being assigned to collections such as Digital Libraries. Current search methods for XML-documents in Peer-to-Peer networks lack the use of Information Retrieval techniques for vague queries and relevance detection. Our work aims for the development of a search engine for XML-documents, where Information Retrieval methods are enhanced by using structural information. Documents and global index are distributed over a Peer-to-Peer Network, building a virtually unlimited storage space. In this paper, a conceptual architecture for XML Information Retrieval in Peer-to-Peer networks is proposed. Based on this general architecture, a component-structured architecture for a concrete search engine is presented, which uses an extension of the Vector Space Model to compute relevance for dynamic XML-documents.