MSR-NLP entry in BioNLP Shared Task 2011

Authors:
Chris Quirk;Pallavi Choudhury;Michael Gamon;Lucy Vanderwende
Affiliations:
Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA;Microsoft Research, Redmond, WA
Venue:
BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
Year:
2011

Citing 11
Cited 5

Introduction to Algorithms

Introduction to Algorithms
Accurate methods for the statistics of surprise and coincidence

Computational Linguistics - Special issue on using large corpora: I
Support vector machine learning for interdependent and structured output spaces

ICML '04 Proceedings of the twenty-first international conference on Machine learning
Discriminative learning and spanning tree algorithms for dependency parsing

Discriminative learning and spanning tree algorithms for dependency parsing
Feature forest models for probabilistic hpsg parsing

Computational Linguistics
Self-training for biomedical parsing

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Extracting complex biological events with rich graph-based feature sets

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Forest-based translation rule extraction

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Web-scale distributional similarity and entity set expansion

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Joint inference for knowledge extraction from biomedical literature

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Any domain parsing: automatic domain adaptation for natural language parsing

Any domain parsing: automatic domain adaptation for natural language parsing

Overview of Genia event task in BioNLP Shared Task 2011

BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
Overview of the epigenetics and post-translational modifications (EPI) task of BioNLP Shared Task 2011

BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
BioNLP Shared Task 2011: supporting resources

BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
New resources and perspectives for biomedical event extraction

BioNLP '12 Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
A three-way perspective on scientific discourse annotation for knowledge extraction

ACL '12 Proceedings of the Workshop on Detecting Structure in Scholarly Discourse

Quantified Score

Hi-index	0.00

Visualization

Abstract

We describe the system from the Natural Language Processing group at Microsoft Research for the BioNLP 2011 Shared Task. The task focuses on event extraction, identifying structured and potentially nested events from unannotated text. Our approach follows a pipeline, first decorating text with syntactic information, then identifying the trigger words of complex events, and finally identifying the arguments of those events. The resulting system depends heavily on lexical and syntactic features. Therefore, we explored methods of maintaining ambiguities and improving the syntactic representations, making the lexical information less brittle through clustering, and of exploring novel feature combinations and feature reduction. The system ranked 4th in the GENIA task with an F-measure of 51.5%, and 3rd in the EPI task with an F-measure of 64.9%.