Effective use of WordNet semantics via kernel-based learning

Authors:
Roberto Basili;Marco Cammisa;Alessandro Moschitti
Affiliations:
University of Rome "Tor Vergata", Rome, Italy;University of Rome "Tor Vergata", Rome, Italy;University of Rome "Tor Vergata", Rome, Italy
Venue:
CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Year:
2005

Citing 14
Cited 12

Using WordNet to disambiguate word senses for text retrieval

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Word sense disambiguation for free-text indexing using a massive semantic network

CIKM '93 Proceedings of the second international conference on Information and knowledge management
Query expansion using lexical-semantic relations

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
The nature of statistical learning theory

The nature of statistical learning theory
Making large-scale support vector machine learning practical

Advances in kernel methods
Learning probabilistic models of the Web (poster session)

SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Statistical Approaches to Text Categorization

Information Retrieval
On feature distributional clustering for text categorization

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Latent Semantic Kernels

Journal of Intelligent Information Systems
Class-based probability estimation using a semantic hierarchy

Computational Linguistics
Feature Engineering for Text Classification

ICML '99 Proceedings of the Sixteenth International Conference on Machine Learning
Support Vector Machines Based on a Semantic Kernel for Text Categorization

IJCNN '00 Proceedings of the IEEE-INNS-ENNS International Joint Conference on Neural Networks (IJCNN'00)-Volume 5 - Volume 5
Generalizing case frames using a thesaurus and the MDL principle

Computational Linguistics
Word sense disambiguation using Conceptual Density

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1

Kernel-based relation extraction from investigative data

Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
Improving text classification by a sense spectrum approach to term expansion

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Kernel methods for word sense disambiguation and acronym expansion

AAAI'06 proceedings of the 21st national conference on Artificial intelligence - Volume 2
Re-ranking models based-on small training data for spoken language understanding

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Convolution kernels on constituent, dependency and sequential structures for relation extraction

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
Syntactic/semantic structures for textual entailment recognition

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Syntactic and semantic structure for opinion expression detection

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Kernel engineering for fast and easy design of natural language applications

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Kernel Engineering for Fast and Easy Design of Natural Language Applications
Kernel-based reranking for named-entity extraction

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Learning discriminative projections for text similarity measures

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Using syntactic and semantic structural kernels for classifying definition questions in Jeopardy!

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Structured lexical similarity via convolution kernels on dependency trees

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Research on document similarity has shown that complex representations are not more accurate than the simple bag-of-words. Term clustering, e.g. using latent semantic indexing, word co-occurrences or synonym relations using a word ontology have been shown not very effective. In particular, when to extend the similarity function external prior knowledge is used, e.g. WordNet, the retrieval system decreases its performance. The critical issues here are methods and conditions to integrate such knowledge. In this paper we propose kernel functions to add prior knowledge to learning algorithms for document classification. Such kernels use a term similarity measure based on the WordNet hierarchy. The kernel trick is used to implement such space in a balanced and statistically coherent way. Cross-validation results show the benefit of the approach for the Support Vector Machines when few training data is available.