Metadata inference for document retrieval in a distributed repository

Authors:
P. Rigaux;N. Spyratos
Affiliations:
Laboratoire de Recherche en Informatique, Université Paris-Sud Orsay, France;Laboratoire de Recherche en Informatique, Université Paris-Sud Orsay, France
Venue:
ASIAN'04 Proceedings of the 9th Asian Computing Science conference on Advances in Computer Science: dedicated to Jean-Louis Lassez on the Occasion of His 5th Cycle Birthday
Year:
2004

Citing 15
Cited 8

Mediators in the Architecture of Future Information Systems

Computer
Your mediators need data conversion!

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Ariadne: a system for constructing mediators for Internet sources

SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Annotea: an open RDF infrastructure for shared Web annotations

Proceedings of the 10th international conference on World Wide Web
RQL: a declarative query language for RDF

Proceedings of the 11th international conference on World Wide Web
EDUTELLA: a P2P networking infrastructure based on RDF

Proceedings of the 11th international conference on World Wide Web
Modern Information Retrieval

Modern Information Retrieval
Automatic metadata generation & evaluation

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
User-System Cooperation in Document Annotation Based on Information Extraction

EKAW '02 Proceedings of the 13th International Conference on Knowledge Engineering and Knowledge Management. Ontologies and the Semantic Web
SemTag and seeker: bootstrapping the semantic web via automated semantic annotation

WWW '03 Proceedings of the 12th international conference on World Wide Web
Data extraction and label assignment for web databases

WWW '03 Proceedings of the 12th international conference on World Wide Web
On deep annotation

WWW '03 Proceedings of the 12th international conference on World Wide Web
Peer-to-Peer Data Management

ICDE '02 Proceedings of the 18th International Conference on Data Engineering
Mediators over Ontology-Based Information Sources

WISE '01 Proceedings of the Second International Conference on Web Information Systems Engineering (WISE'01) Volume 1 - Volume 1
From manual to semi-automatic semantic annotation: about ontology-based text annotation tools

Proceedings of the COLING-2000 Workshop on Semantic Annotation and Intelligent Content

Annotating illuminated manuscripts: an effective tool for research and education

Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
User notification in taxonomy based digital libraries

SIGDOC '06 Proceedings of the 24th annual ACM international conference on Design of communication
A formal model of annotations of digital content

ACM Transactions on Information Systems (TOIS)
Fast user notification in large-scale digital libraries: experiments and results

ADBIS'07 Proceedings of the 11th East European conference on Advances in databases and information systems
Multilingual adaptive search for digital libraries

TPDL'11 Proceedings of the 15th international conference on Theory and practice of digital libraries: research and advanced technology for digital libraries
Querying with preferences in a digital library

Proceedings of the 2005 international conference on Federation over the Web
Preference-based query tuning through refinement/enlargement in a formal context

FoIKS'06 Proceedings of the 4th international conference on Foundations of Information and Knowledge Systems
A system architecture as a support to a flexible annotation service

DELOS'04 Proceedings of the 6th Thematic conference on Peer-to-Peer, Grid, and Service-Orientation in Digital Library Architectures

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper describes a simple data model for the composition and metadata management of documents in a distributed setting. We assume that each document resides at the local repository of its provider, so all providers’ repositories, collectively, can be thought of as a single database of documents spread over the network. Providers willing to share their documents with other providers in the network must register them with a coordinator, or mediator, and providers that search for documents matching their needs must address their queries to the mediator. The process of registering (or un-registering) a document, formulating a query to the mediator, or answering a query by the mediator, all rely on document content annotation. Content annotation depends on the nature of the document: if the document is atomic then an annotation provided explicitely by the author is sufficient, whereas if the document is composite then the author annotation should be augmented by an implied annotation, i.e., an annotation inferred from the annotations of the document’s components. The main contributions of this paper are: Providing appropriate definitions of document annotations; Providing an algorithm for the automatic computation of implied annotations; Defining the main services that the mediator should support.