Practical NLP-Based Text Indexing
IBERAMIA 2002 Proceedings of the 8th Ibero-American Conference on AI: Advances in Artificial Intelligence
Current research issues and trends in non-English Web searching
Information Retrieval
Integrating syntactic information by means of data fusion techniques
EUROCAST'05 Proceedings of the 10th international conference on Computer Aided Systems Theory
COLE experiments at QA@CLEF 2004 spanish monolingual track
CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
Hi-index | 0.01 |
In this paper we consider a set of natural language processing techniques that can be used to analyze large amounts of texts, focusing on the advanced tokenizer which accounts for a number of complex linguistic phenomena, as well as for pre-tagging tasks such as proper noun recognition. We also show the results of several experiments performed in order to study the impact of the strategy chosen for the recognition of proper nouns.