An architecture for xml information retrieval in a peer-to-peer environment

Authors:
Judith Winter;Oswald Drobnik
Affiliations:
J. W. Goethe-University, Frankfurt/Main, Germany;J. W. Goethe-University, Frankfurt/Main, Germany
Venue:
Proceedings of the ACM first Ph.D. workshop in CIKM
Year:
2007

Citing 18
Cited 1

A vector space model for automatic indexing

Communications of the ACM
A scalable content-addressable network

Proceedings of the 2001 conference on Applications, technologies, architectures, and protocols for computer communications
A survey in indexing and searching XML documents

Journal of the American Society for Information Science and Technology - XML
Querying and ranking XML documents

Journal of the American Society for Information Science and Technology - XML
Modern Information Retrieval

Modern Information Retrieval
Looking up data in P2P systems

Communications of the ACM
Chord: a scalable peer-to-peer lookup protocol for internet applications

IEEE/ACM Transactions on Networking (TON)
Searching XML documents via XML fragments

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Configurable indexing and ranking for XML information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Information Retrieval Techniques for Peer-to-Peer Networks

Computing in Science and Engineering
Information Retrieval: Algorithms and Heuristics (The Kluwer International Series on Information Retrieval)

Information Retrieval: Algorithms and Heuristics (The Kluwer International Series on Information Retrieval)
Peer-to-peer management of XML data: issues and research challenges

ACM SIGMOD Record
Peer-to-Peer Systems and Applications (Lecture Notes in Computer Science)

Peer-to-Peer Systems and Applications (Lecture Notes in Computer Science)
A picture of search

InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
XML search: languages, INEX and scoring

ACM SIGMOD Record
Introduction to Information Retrieval

Introduction to Information Retrieval
A peer-to-peer architecture for information retrieval across digital library collections

ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
XPeer: a self-organizing XML P2P database system

EDBT'04 Proceedings of the 2004 international conference on Current Trends in Database Technology

Routing of structured queries in large-scale distributed systems

Proceedings of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval

Quantified Score

Hi-index	0.01

Visualization

Abstract

XML has become a widely accepted standard for modelling, storing, and exchanging structured documents. Taking advantage of the document structure can result in improving the retrieval performance of XML-documents notably. A growing number of these documents are stored in Peer-to-Peer networks, which are promising self-organizing infrastructures. Documents are distributed over the Peer-to-Peer network by either being stored locally on individual peers or by being assigned to collections such as Digital Libraries. Current search methods for XML-documents in Peer-to-Peer networks lack the use of Information Retrieval techniques for vague queries and relevance detection. Our work aims for the development of a search engine for XML-documents, where Information Retrieval methods are enhanced by using structural information. Documents and global index are distributed over a Peer-to-Peer Network, building a virtually unlimited storage space. In this paper, a conceptual architecture for XML Information Retrieval in Peer-to-Peer networks is proposed. Based on this general architecture, a component-structured architecture for a concrete search engine is presented, which uses an extension of the Vector Space Model to compute relevance for dynamic XML-documents.