Experiments on using semantic distances between words in image caption retrieval
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Context-sensitive learning methods for text categorization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Four text classification algorithms compared on a Dutch corpus
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Phase-based information retrieval
Information Processing and Management: an International Journal
EuroWordNet: a multilingual database with lexical semantic networks
EuroWordNet: a multilingual database with lexical semantic networks
Information Retrieval
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Affix Grammars for Natural Languages
Proceedings on Attribute Grammars, Applications and Systems
Proceedings of the FREENIX Track: 2002 USENIX Annual Technical Conference
ECIR'03 Proceedings of the 25th European conference on IR research
Hi-index | 0.00 |
This paper is concerned with the use of linguistically motivated phrases as indexing terms in Information Retrieval applications. Apart from the conventional noun phrases, we propose to use verb phrases as index terms for text classification. Techniques for phrase matching through syntactic normalization and semantical matching are described. We discuss the realization of the syntactic normalization of phrases by transduction to frames. Semantical normalization is based on lexico-semantical relations, taking into account certain properties of the classification algorithms used. The ideas described here are being implemented in the Document Routing system DORO, in which statistical learning algorithms are applied to document profiles consisting of phrases. This paper describes the rationale behind work in progress, rather than presenting final results