Evaluation and extension of maximum entropy models with inequality constraints
EMNLP '03 Proceedings of the 2003 conference on Empirical methods in natural language processing
The GENIA corpus: an annotated research abstract corpus in molecular biology domain
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Self-training for biomedical parsing
HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Overview of BioNLP'09 shared task on event extraction
BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Extracting complex biological events with rich graph-based feature sets
BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Static relations: a piece in the biomedical information extraction puzzle
BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Incorporating GENETAG-style annotation to GENIA corpus
BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Any domain parsing: automatic domain adaptation for natural language parsing
Any domain parsing: automatic domain adaptation for natural language parsing
EVEX: a pubmed-scale resource for homology-based generalization of text mining predictions
BioNLP '11 Proceedings of BioNLP 2011 Workshop
Towards exhaustive protein modification event extraction
BioNLP '11 Proceedings of BioNLP 2011 Workshop
SimSem: fast approximate string matching in relation to semantic category disambiguation
BioNLP '11 Proceedings of BioNLP 2011 Workshop
Generalizing biomedical event extraction
BioNLP Shared Task '11 Proceedings of the BioNLP Shared Task 2011 Workshop
BioNLP '12 Proceedings of the 2012 Workshop on Biomedical Natural Language Processing
Hi-index | 0.00 |
We present the first full-scale event extraction experiment covering the titles and abstracts of all PubMed citations. Extraction is performed using a pipeline composed of state-of-the-art methods: the BANNER named entity recognizer, the McClosky-Charniak domain-adapted parser, and the Turku Event Extraction System. We analyze the statistical properties of the resulting dataset and present evaluations of the core event extraction as well as negation and speculation detection components of the system. Further, we study in detail the set of extracted events relevant to the apoptosis pathway to gain insight into the biological relevance of the result. The dataset, consisting of 19.2 million occurrences of 4.5 million unique events, is freely available for use in research at http://bionlp.utu.fi/.