Biomedical events extraction using the hidden vector state model

Authors:
Deyu Zhou;Yulan He
Affiliations:
School of Computer Science and Engineering, Southeast University, Nanjing, Jiangsu Province 210093, China;Knowledge Media Institute, The Open University, Walton Hall, Milton Keynes MK7 6AA, United Kingdom
Venue:
Artificial Intelligence in Medicine
Year:
2011

Citing 16
Cited 0

Discovering patterns to extract protein--protein interactions from full texts

Bioinformatics
Simple algorithms for complex relation extraction with applications to biomedical IE

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
A shortest path dependency kernel for relation extraction

HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Comparisons of sequence labeling algorithms and extensions

Proceedings of the 24th international conference on Machine learning
Semi-supervised learning of the hidden vector state model for extracting protein-protein interactions

Artificial Intelligence in Medicine
Extracting Protein-Protein Interactions from MEDLINE using the Hidden Vector State model

International Journal of Bioinformatics Research and Applications
Discriminative Training of the Hidden Vector State Model for Semantic Parsing

IEEE Transactions on Knowledge and Data Engineering
A graph kernel for protein-protein interaction extraction

BioNLP '08 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing
Overview of BioNLP'09 shared task on event extraction

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Extracting complex biological events with rich graph-based feature sets

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Molecular event extraction from link grammar parse trees

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
From protein-protein interaction to molecular event extraction

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Supervised classification for extracting biomedical events

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
Biomedical event detection using rules, conditional random fields and parse tree distances

BioNLP '09 Proceedings of the Workshop on Current Trends in Biomedical Natural Language Processing: Shared Task
A hybrid generative/discriminative framework to train a semantic parser from an un-annotated corpus

COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Joint inference for knowledge extraction from biomedical literature

HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Objective: Biomedical events extraction concerns about events describing changes on the state of bio-molecules from literature. Comparing to the protein-protein interactions (PPIs) extraction task which often only involves the extraction of binary relations between two proteins, biomedical events extraction is much harder since it needs to deal with complex events consisting of embedded or hierarchical relations among proteins, events, and their textual triggers. In this paper, we propose an information extraction system based on the hidden vector state (HVS) model, called HVS-BioEvent, for biomedical events extraction, and investigate its capability in extracting complex events. Methods and material: HVS has been previously employed for extracting PPIs. In HVS-BioEvent, we propose an automated way to generate abstract annotations for HVS training and further propose novel machine learning approaches for event trigger words identification, and for biomedical events extraction from the HVS parse results. Results: Our proposed system achieves an F-score of 49.57% on the corpus used in the BioNLP'09 shared task, which is only 2.38% lower than the best performing system by UTurku in the BioNLP'09 shared task. Nevertheless, HVS-BioEvent outperforms UTurku's system on complex events extraction with 36.57% vs. 30.52% being achieved for extracting regulation events, and 40.61% vs. 38.99% for negative regulation events. Conclusions: The results suggest that the HVS model with the hierarchical hidden state structure is indeed more suitable for complex event extraction since it could naturally model embedded structural context in sentences.