Is it worth submitting this run?: assess your RTE system with a good sparring partner

Authors:
Milen Kouylekov;Yashar Mehdad;Matteo Negri
Affiliations:
CELI s.r.l., Turin, Italy;FBK-irst and University of Trento, Trento, Italy;FBK-irst, Trento, Italy
Venue:
TIWTE '11 Proceedings of the TextInfer 2011 Workshop on Textual Entailment
Year:
2011

Citing 6
Cited 1

Genetic algorithms for data-driven web question answering

Evolutionary Computation
COGEX at RTE3

RTE '07 Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing
Shallow semantics in fast textual entailment rule learners

RTE '07 Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing
A discourse commitment-based framework for recognizing textual entailment

RTE '07 Proceedings of the ACL-PASCAL Workshop on Textual Entailment and Paraphrasing
Automatic cost estimation for tree edit distance using particle swarm optimization

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
An open-source package for recognizing textual entailment

ACLDemos '10 Proceedings of the ACL 2010 System Demonstrations

Detecting semantic equivalence and information disparity in cross-lingual documents

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2

Quantified Score

Hi-index	0.00

Visualization

Abstract

We address two issues related to the development of systems for Recognizing Textual Entailment. The first is the impossibility to capitalize on lessons learned over the different datasets available, due to the changing nature of traditional RTE evaluation settings. The second is the lack of simple ways to assess the results achieved by our system on a given training corpus, and figure out its real potential on unseen test data. Our contribution is the extension of an open-source RTE package with an automatic way to explore the large search space of possible configurations, in order to select the most promising one over a given dataset. From the developers' point of view, the efficiency and ease of use of the system, together with the good results achieved on all previous RTE datasets, represent a useful support, providing an immediate term of comparison to position the results of their approach.