Evaluating the impact of alternative dependency graph encodings on solving event extraction tasks

Authors:
Ekaterina Buyko;Udo Hahn
Affiliations:
Friedrich-Schiller-Universität Jena, Jena, Germany;Friedrich-Schiller-Universität Jena, Jena, Germany
Venue:
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Year:
2010

Citing 11
Cited 4

Intricacies of Collins' Parsing Model

Computational Linguistics
Coarse-to-fine n-best parsing and MaxEnt discriminative reranking

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Non-projective dependency parsing using spanning tree algorithms

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Maximum entropy estimation for feature forests

HLT '02 Proceedings of the second international conference on Human Language Technology Research
Overview of BioNLP'09 shared task on event extraction

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Extracting complex biological events with rich graph-based feature sets

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Event extraction from trimmed dependency graphs

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
CoNLL-X shared task on multilingual dependency parsing

CoNLL-X '06 Proceedings of the Tenth Conference on Computational Natural Language Learning
Evaluating and integrating treebank parsers on a biomedical corpus

Software '05 Proceedings of the Workshop on Software
Comparative experiments on learning information extractors for proteins and their interactions

Artificial Intelligence in Medicine
Any domain parsing: automatic domain adaptation for natural language parsing

Any domain parsing: automatic domain adaptation for natural language parsing

BioNLP Shared Task 2011: supporting resources

BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
Evaluating dependency parsing: robust and heuristics-free cross-nnotation evaluation

EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Tree kernel-based protein-protein interaction extraction from biomedical literature

Journal of Biomedical Informatics
Divisible transition systems and multiplanar dependency parsing

Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

In state-of-the-art approaches to information extraction (IE), dependency graphs constitute the fundamental data structure for syntactic structuring and subsequent knowledge elicitation from natural language documents. The top-performing systems in the BioNLP 2009 Shared Task on Event Extraction all shared the idea to use dependency structures generated by a variety of parsers --- either directly or in some converted manner --- and optionally modified their output to fit the special needs of IE. As there are systematic differences between various dependency representations being used in this competition, we scrutinize on different encoding styles for dependency information and their possible impact on solving several IE tasks. After assessing more or less established dependency representations such as the Stanford and CoNLL-X dependencies, we will then focus on trimming operations that pave the way to more effective IE. Our evaluation study covers data from a number of constituency- and dependency-based parsers and provides experimental evidence which dependency representations are particularly beneficial for the event extraction task. Based on empirical findings from our study we were able to achieve the performance of 57.2% F-score on the development data set of the BioNLP Shared Task 2009.