The Harvest information discovery and access system
Computer Networks and ISDN Systems
Networked information retrieval
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the third ACM conference on Digital libraries
Greenstone: a comprehensive open-source digital library software system
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Probe, count, and classify: categorizing hidden web databases
SIGMOD '01 Proceedings of the 2001 ACM SIGMOD international conference on Management of data
The open archives initiative: building a low-barrier interoperability framework
Proceedings of the 1st ACM/IEEE-CS joint conference on Digital libraries
Bootstrapping for example-based data extraction
Proceedings of the tenth international conference on Information and knowledge management
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
5SL: a language for declarative specification and generation of digital libraries
Proceedings of the 2nd ACM/IEEE-CS joint conference on Digital libraries
A brief survey of web data extraction tools
ACM SIGMOD Record
Web-DL: an experience in building digital libraries from the web
Proceedings of the eleventh international conference on Information and knowledge management
DEByE - Date extraction by example
Data & Knowledge Engineering
Java MARIAN: From an OPAC to a Modern Digital Library System
SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
A Framework for Generating Attribute Extractors for Web Data Sources
SPIRE 2002 Proceedings of the 9th International Symposium on String Processing and Information Retrieval
Representing and Querying Semistructured Web Data Using Nested Tables with Structural Variants
ER '02 Proceedings of the 21st International Conference on Conceptual Modeling
BDBComp: building a digital library for the Brazilian computer science community
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Panorama: extending digital libraries with topical crawlers
Proceedings of the 4th ACM/IEEE-CS joint conference on Digital libraries
Automatic generation of agents for collecting hidden web pages for data extraction
Data & Knowledge Engineering - Special issue: WIDM 2002
Freebie: an open source digital library with support for textual and spatial searches
WebMedia '06 Proceedings of the 12th Brazilian Symposium on Multimedia and the web
Evaluating a digital library self-archiving service: The BDBComp user case study
Information Processing and Management: an International Journal
Clustering-based schema matching of web data for constructing digital library
ICCSA'05 Proceedings of the 2005 international conference on Computational Science and Its Applications - Volume Part II
Hi-index | 0.00 |
The Web contains a huge volume of unstructured data, which is difficult to manage. In digital libraries, on the other hand, information is explicitly organized, described, and managed. Community-oriented services are built to attend specific information needs and tasks. In this paper, we describe an environment, Web-DL, that allows the construction of digital libraries from the Web. The Web-DL environment will allow us to collect data from the Web, standardize it, and publish it through a digital library system. It provides support to services and organizational structure normally available in digital libraries, but benefiting from the breadth of the Web contents. We experimented with applying the Web-DL environment to the Networked Digital Library of Theses and Dissertations (NDLTD), thus demonstrating that the rapid construction of DLs from the Web is possible. Also, Web-DL provides an alternative as a largescale solution for interoperability between independent digital libraries.