Using WordNet to disambiguate word senses for text retrieval
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Word sense disambiguation for free-text indexing using a massive semantic network
CIKM '93 Proceedings of the second international conference on Information and knowledge management
Query expansion using lexical-semantic relations
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
The nature of statistical learning theory
The nature of statistical learning theory
Making large-scale support vector machine learning practical
Advances in kernel methods
Learning probabilistic models of the Web (poster session)
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
On feature distributional clustering for text categorization
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Journal of Intelligent Information Systems
Class-based probability estimation using a semantic hierarchy
Computational Linguistics
Feature Engineering for Text Classification
ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Support Vector Machines Based on a Semantic Kernel for Text Categorization
IJCNN '00 Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks (IJCNN'00)-Volume 5 - Volume 5
Generalizing case frames using a thesaurus and the MDL principle
Computational Linguistics
Word sense disambiguation using Conceptual Density
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Kernel-based relation extraction from investigative data
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
Improving text classification by a sense spectrum approach to term expansion
CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Kernel methods for word sense disambiguation and acronym expansion
AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Re-ranking models based-on small training data for spoken language understanding
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Convolution kernels on constituent, dependency and sequential structures for relation extraction
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Syntactic/semantic structures for textual entailment recognition
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Syntactic and semantic structure for opinion expression detection
CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Kernel engineering for fast and easy design of natural language applications
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Kernel Engineering for Fast and Easy Design of Natural Language Applications
Kernel-based reranking for named-entity extraction
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Learning discriminative projections for text similarity measures
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Using syntactic and semantic structural kernels for classifying definition questions in Jeopardy!
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Structured lexical similarity via convolution kernels on dependency trees
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Hi-index | 0.00 |
Research on document similarity has shown that complex representations are not more accurate than the simple bag-of-words. Term clustering, e.g. using latent semantic indexing, word co-occurrences or synonym relations using a word ontology have been shown not very effective. In particular, when to extend the similarity function external prior knowledge is used, e.g. WordNet, the retrieval system decreases its performance. The critical issues here are methods and conditions to integrate such knowledge. In this paper we propose kernel functions to add prior knowledge to learning algorithms for document classification. Such kernels use a term similarity measure based on the WordNet hierarchy. The kernel trick is used to implement such space in a balanced and statistically coherent way. Cross-validation results show the benefit of the approach for the Support Vector Machines when few training data is available.