Flexible pattern matching in strings: practical on-line search algorithms for texts and biological sequences
Managing the quality of person names in DBLP
ECDL'06 Proceedings of the 10th European conference on Research and Advanced Technology for Digital Libraries
Combining information from multiple search engines-Preliminary comparison
Information Sciences: an International Journal
Hi-index | 0.00 |
The University of Trier maintains the DBLP (Digital Bibliography & Library Project) Computer Science Bibliography which offers bibliographic information about more than 870.000 scientific publications. This paper describes the DBLP WebCrawler, a meta search engine that is able to search for full text publications in PDF format for each DBLP entry on the web. Various search engines such as Google and Yahoo are used as data sources. The retrieved documents are additionally analysed and ranked according to their relevance. The proposed system differs from systems like CiteSeer in so far, that the DBLP Webcrawler builds upon metadata and tries to find relevant full-texts whereas Cite-Seer mainly starts with full-texts and extracts metadata.