The Z39.50 information retrieval protocol: an overview and status report
ACM SIGCOMM Computer Communication Review
Research problems in data warehousing
CIKM '95 Proceedings of the fourth international conference on Information and knowledge management
Your mediators need data conversion!
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
Catching the boat with Strudel: experiences with a Web-site management system
SIGMOD '98 Proceedings of the 1998 ACM SIGMOD international conference on Management of data
WebOQL: Restructuring Documents, Databases, and Webs
ICDE '98 Proceedings of the Fourteenth International Conference on Data Engineering
VLDB '97 Proceedings of the 23rd International Conference on Very Large Data Bases
WIND - A Warehouse for Internet Data
BNCOD 15 Proceedings of the 15th British National Conferenc on Databases: Advances in Databases
Alerting in a Digital Library Environment: Do Channels Meet the Requirements?
ECDL '98 Proceedings of the Second European Conference on Research and Advanced Technology for Digital Libraries
W3QS - A System for WWW Querying
ICDE '97 Proceedings of the Thirteenth International Conference on Data Engineering
A Declarative Language for Querying and Restructuring the Web
RIDE '96 Proceedings of the 6th International Workshop on Research Issues in Data Engineering (RIDE '96) Interoperability of Nontraditional Database Systems
EDBT '02 Proceedings of the Worshops XMLDM, MDDE, and YRWS on XML-Based Data Management and Multimedia Engineering-Revised Papers
Improving the Effectiveness of a Web Site with Web Usage Mining
WEBKDD '99 Revised Papers from the International Workshop on Web Usage Analysis and User Profiling
Hi-index | 0.00 |
Electronic journals are becoming a major source of scientific information. Researchers interested only in certain topics do not have time to scan all possibly relevant journals on a regular basis. A digital library can assist them by providing a uniform, search-able interface for electronic journals. To this purpose, a catalogue of metadata on the available journals such as authors and titles of articles must be established by the digital library. If there is no cooperation with journal publishers, this metadata must be extracted from the publishers' Web Sites, overcoming the intrinsic heterogeneity problems. Within the framework of the ongoing Natural Sciences Digital Library project at the Free University of Berlin, we have designed a wrapper-mediator mechanism that copes with the heterogeneity problems of automatic metadata acquisition. It is based on our generic HyperView methodology for integration ofWeb Sites. From this methodology it inherits two elegant and effective features. First, the structure of the publisher site is specified with abstract graph-schemata, instead of being hard-coded in scripts for data acquisition. Second, a powerful view concept based on declarative graph-transformation rules is used for information extraction.