Zone identification in biology articles as a basis for information extraction

Authors:
Yoko Mizuta;Nigel Collier
Affiliations:
National Institute of Informatics, Chiyoda-ku, Tokyo, Japan;National Institute of Informatics, Chiyoda-ku, Tokyo, Japan
Venue:
JNLPBA '04 Proceedings of the International Joint Workshop on Natural Language Processing in Biomedicine and its Applications
Year:
2004

Citing 6
Cited 7

The identification of important concepts in highly structured technical papers

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Text structuration leading to an automatic summary system: RAFI

Information Processing and Management: an International Journal
Summarizing scientific articles: experiments with relevance and rhetorical status

Computational Linguistics - Summarization
Constructing Biological Knowledge Bases by Extracting Information from Text Sources

Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology
An annotation scheme for discourse-level argumentation in research articles

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
An unsupervised approach to recognizing discourse relations

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics

A baseline feature set for learning rhetorical zones using full articles in the biomedical domain

ACM SIGKDD Explorations Newsletter - Natural language processing and text mining
Development, implementation, and a cognitive evaluation of a definitional question answering system for physicians

Journal of Biomedical Informatics
Argumentative feedback: a linguistically-motivated term expansion for information retrieval

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Evaluating a meta-knowledge annotation scheme for bio-events

NeSp-NLP '10 Proceedings of the Workshop on Negation and Speculation in Natural Language Processing
Identifying the information structure of scientific abstracts: an investigation of three different schemes

BioNLP '10 Proceedings of the 2010 Workshop on Biomedical Natural Language Processing
Section classification in clinical notes using supervised hidden markov model

Proceedings of the 1st ACM International Health Informatics Symposium
Mining methodologies from NLP publications: A case study in automatic terminology recognition

Computer Speech and Language

Quantified Score

Hi-index	0.00

Visualization

Abstract

Information extraction (IE) in the biomedical domain is now regarded as an essential technique for the dynamic management of factual information contained in archived journal articles and abstract collections. We aim to provide a technique serving as a basis for pinpointing and organizing factual information related to experimental results. In this paper, we enhance the idea proposed in (Mizuta and Collier, 2004); annotating articles in terms of rhetorical zones with shallow nesting. We give a qualitative analysis of the zone identification (ZI) process in biology articles. Specifically, we illustrate the linguistic and other features of each zone based on our investigation of articles selected from four major online journals. We also discuss controversial cases and nested zones, and ZI using multiple features. In doing so, we provide a stronger theoretical and practical support for our framework toward automatic ZI.