Phonetic string matching: lessons from information retrieval
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval effectiveness of proper name search methods
Information Processing and Management: an International Journal
Projekt Der Deutsche Wortschatz
Linguistik und neue Medien [10. Jahrestagung der GLDV
Advances in Information Retrieval: 28th European Conference on IR Research, ECIR 2006, London, UK, April 10-12, 2006, Proceedings (Lecture Notes in Computer Science)
Generating search term variants for text collections with historic spellings
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
A cross-language approach to historic document retrieval
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Annotating historical archives of images
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
On lexical resources for digitization of historical documents
Proceedings of the 9th ACM symposium on Document engineering
Non-interactive OCR post-correction for giga-scale digitization projects
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Using word sense discrimination on historic document collections
Proceedings of the 10th annual joint conference on Digital libraries
Visualization of relationships among historical persons using locational information
W2GIS'11 Proceedings of the 10th international conference on Web and wireless geographical information systems
Efficiently generating correction suggestions for garbled tokens of historical language
Natural Language Engineering
Extending the tool, or how to annotate historical language varieties
LaTeCH '11 Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Which words do you remember? temporal properties of language use in digital archives
TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries
Hi-index | 0.00 |
We present a new approach for the retrieval of texts with non-standard spelling, which is important for historic texts e.g. in English or German. In this paper, we describe the overall architecture of our system, followed by its evaluation. Given a search term as lemma, we use a dictionary of contemporary German for finding all inflected and derived forms of the lemma. Then we apply transformation rules (derived from training data) for generating historic spelling variants. For the evaluation, we regard the resulting retrieval quality. The experimental results show that we can improve the retrieval quality for historic collections substantially.