Reverse engineering of tree kernel feature spaces

Authors:
Daniele Pighin;Alessandro Moschitti
Affiliations:
FBK-Irst, HLT, Povo (TN), Italy;University of Trento, Povo (TN), Italy
Venue:
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Year:
2009

Citing 28
Cited 4

Automatic labeling of semantic roles

Computational Linguistics
Kernels for Semi-Structured Data

ICML '02 Proceedings of the Nineteenth International Conference on Machine Learning
PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth

Proceedings of the 17th International Conference on Data Engineering
Estimating the Generalization Performance of an SVM Efficiently

ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Word sequence kernels

The Journal of Machine Learning Research
An introduction to variable and feature selection

The Journal of Machine Learning Research
Variable selection using svm based criteria

The Journal of Machine Learning Research
Use of the zero norm with linear models and kernel methods

The Journal of Machine Learning Research
A maximum-entropy-inspired parser

NAACL 2000 Proceedings of the 1st North American chapter of the Association for Computational Linguistics conference
New ranking algorithms for parsing and tagging: kernels over discrete structures, and the voted perceptron

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
Fast methods for kernel-based text analysis

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Accurate unlexicalized parsing

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Combined SVM-Based Feature Selection and Classification

Machine Learning
Using LTAG based features in parse reranking

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
The Proposition Bank: An Annotated Corpus of Semantic Roles

Computational Linguistics
Learning question classifiers: the role of semantic information

Natural Language Engineering
Fast On-line Kernel Learning for Trees

ICDM '06 Proceedings of the Sixth International Conference on Data Mining
Dependency tree kernels for relation extraction

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Boosting-based parse reranking with subtree features

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Domain kernels for word sense disambiguation

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Speeding up training with tree kernels for node relation labeling

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Fast and effective kernels for relational learning from texts

Proceedings of the 24th international conference on Machine learning
Structure and semantics for expressive text kernels

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Tree kernels for semantic role labeling

Computational Linguistics
Efficient linearization of tree kernel functions

CoNLL '09 Proceedings of the Thirteenth Conference on Computational Natural Language Learning
Introduction to the CoNLL-2005 shared task: semantic role labeling

CONLL '05 Proceedings of the Ninth Conference on Computational Natural Language Learning
Combined syntactic and semantic Kernels for text classification

ECIR'07 Proceedings of the 29th European conference on IR research
Efficient convolution kernels for dependency and constituent syntactic trees

ECML'06 Proceedings of the 17th European conference on Machine Learning

On reverse feature engineering of syntactic tree kernels

CoNLL '10 Proceedings of the Fourteenth Conference on Computational Natural Language Learning
Social network extraction from texts: a thesis proposal

HLT-SS '11 Proceedings of the ACL 2011 Student Session
Verb classification using distributional similarity in syntactic and semantic structures

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
Characterizing stylistic elements in syntactic structure

EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present a framework to extract the most important features (tree fragments) from a Tree Kernel (TK) space according to their importance in the target kernel-based machine, e.g. Support Vector Machines (SVMs). In particular, our mining algorithm selects the most relevant features based on SVM estimated weights and uses this information to automatically infer an explicit representation of the input data. The explicit features (a) improve our knowledge on the target problem domain and (b) make large-scale learning practical, improving training and test time, while yielding accuracy in line with traditional TK classifiers. Experiments on semantic role labeling and question classification illustrate the above claims.