Communications of the ACM
Communications of the ACM
Information Extraction: Techniques and Challenges
SCIE '97 International Summer School on Information Extraction: A Multidisciplinary Approach to an Emerging Information Technology
FACILE: Classifying Texts Integrating Pattern Matching and Information Extraction
IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Architectural elements of language engineering robustness
Natural Language Engineering
SRI International FASTUS system: MUC-6 test results and analysis
MUC6 '95 Proceedings of the 6th conference on Message understanding
Introduction to information extraction
AI Communications
Towards a cultural heritage digital library
Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
Evolving GATE to meet new challenges in language engineering
Natural Language Engineering
Proceedings of the 6th ACM/IEEE-CS joint conference on Digital libraries
Proceedings of the 2007 conference on Human interface: Part II
Hi-index | 0.00 |
In this paper we show how we used robust human language technology, such as our domain-independent and customisable named entity recogniser, for automatic content annotation and indexing in two digital library applications. Each of these applications posed a unique challenge: one required adapting the language processing components to the non-standard written conventions of 18th century English, while the other presented the challenge of processing material in multiple modalities. This reusable technology could also form the basis for the creation of computational tools for the study of cultural heritage languages, such as Ancient Greek and Latin.