Viewing morphology as an inference process
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
EMIS: A Multilingual Information System
AMTA '98 Proceedings of the Third Conference of the Association for Machine Translation in the Americas on Machine Translation and the Information Soup
A simple rule-based part of speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Multi-lingual detection of terrorist content on the web
WISI'06 Proceedings of the 2006 international conference on Intelligence and Security Informatics
Hi-index | 0.00 |
The application of NLP techniques to improve the results of information retrieval is still considered as a controversial issue, whereas in cross-language information retrieval (CLIR) linguistic processing is already well established. In this paper, the CLIR component - Mpro-IR - which is presented has been developed as the core module of a multilingual information system in a legal domain. This component uses not only the lexical base form for indexing but also derivational information and, for German, information about the decomposition of compounds. This information is provided by a sophisticated morpho-syntactic analyser and is exploited not only for query translation but also for query expansion as well as the search and the document ranking. The objective of the CLEF evaluation was to assess this linguistic based retrieval approach in an unrestricted domain. The focus of the investigation was on how derivation and decomposition can contribute to improve the recall.