Automatic construction of predicate-argument structure patterns for biomedical information extraction

  • Authors:
  • Akane Yakushiji;Yusuke Miyao;Tomoko Ohta;Yuka Tateisi;Jun'ichi Tsujii

  • Affiliations:
  • University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan and Fujitsu Laboratories Ltd.;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan and Kogakuin University;University of Tokyo, Hongo, Bunkyo-ku, Tokyo, Japan and University of Manchester, Manchester, UK

  • Venue:
  • EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents a method of automatically constructing information extraction patterns on predicate-argument structures (PASs) obtained by full parsing from a smaller training corpus. Because PASs represent generalized structures for syntactical variants, patterns on PASs are expected to be more generalized than those on surface words. In addition, patterns are divided into components to improve recall and we introduce a Support Vector Machine to learn a prediction model using pattern matching results. In this paper, we present experimental results and analyze them on how well protein-protein interactions were extracted from MEDLINE abstracts. The results demonstrated that our method improved accuracy compared to a machine learning approach using surface word/part-of-speech patterns.