A categorial variation database for English

Authors:
Nizar Habash;Bonnie Dorr
Affiliations:
University of Maryland, MD;University of Maryland, MD
Venue:
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Year:
2003

Citing 13
Cited 14

Using WordNet to disambiguate word senses for text retrieval

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Viewing morphology as an inference process

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Building a large-scale knowledge base for machine translation

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Experiments in multilingual information retrieval using the SPIDER system

SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Corpus-based stemming using cooccurrence of word variants

ACM Transactions on Information Systems (TOIS)
DUSTer: A Method for Unraveling Cross-Language Divergences for Statistical Word-Level Alignment

AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Handling Translation Divergences: Combining Statistical and Symbolic Techniques in Generation-Heavy Machine Translation

AMTA '02 Proceedings of the 5th Conference of the Association for Machine Translation in the Americas on Machine Translation: From Research to Real Users
Building a large annotated corpus of English: the penn treebank

Computational Linguistics - Special issue on using large corpora: II
Generation that exploits corpus-based statistical knowledge

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Morphological cues for lexical semantics

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
A freely available wide coverage morphological analyzer for English

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
Mapping lexical entries in a verbs database to WordNet senses

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
BLEU: a method for automatic evaluation of machine translation

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics

Statistical machine translation using coercive two-level syntactic transduction

EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
Inducing frame semantic verb classes from WordNet and LDOCE

ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Learning entailment rules for unary templates

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Interpretation of compound nominalisations using corpus and web statistics

MWE '06 Proceedings of the Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties
Adjective-to-verb paraphrasing in Japanese based on lexical constraints of verbs

INLG '06 Proceedings of the Fourth International Natural Language Generation Conference
Symbolic-to-statistical hybridization: extending generation-heavy machine translation

Machine Translation
Generating entailment rules from FrameNet

ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
A survey of paraphrasing and textual entailment methods

Journal of Artificial Intelligence Research
A probabilistic modeling framework for lexical entailment

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Towards strict sentence intersection: decoding and evaluation strategies

MTTG '11 Proceedings of the Workshop on Monolingual Text-To-Text Generation
Learning entailment relations by global graph structure optimization

Computational Linguistics
Towards a probabilistic model for lexical entailment

TIWTE '11 Proceedings of the TextInfer 2011 Workshop on Textual Entailment
A probabilistic lexical model for ranking textual inferences

SemEval '12 Proceedings of the First Joint Conference on Lexical and Computational Semantics - Volume 1: Proceedings of the main conference and the shared task, and Volume 2: Proceedings of the Sixth International Workshop on Semantic Evaluation
Semantic annotation for textual entailment recognition

MICAI'12 Proceedings of the 11th Mexican international conference on Advances in Computational Intelligence - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

We describe our approach to the construction and evaluation of a large-scale database called "CatVar" which contains categorial variations of English lexemes. Due to the prevalence of cross-language categorial variation in multilingual applications, our categorial-variation resource may serve as an integral part of a diverse range of natural language applications. Thus, the research reported herein overlaps heavily with that of the machine-translation, lexicon-construction, and information-retrieval communities.We apply the information-retrieval metrics of precision and recall to evaluate the accuracy and coverage of our database with respect to a human-produced gold standard. This evaluation reveals that the categorial database achieves a high degree of precision and recall. Additionally, we demonstrate that the database improves on the linkability of Porter stemmer by over 30%.