Generative content models for structural analysis of medical abstracts

Authors:
Jimmy Lin;Damianos Karakos;Dina Demner-Fushman;Sanjeev Khudanpur
Affiliations:
University of Maryland, College Park, MD;Johns Hopkins University, Baltimore, MD;University of Maryland, College Park, MD;Johns Hopkins University, Baltimore, MD
Venue:
LNLBioNLP '06 Proceedings of the HLT-NAACL BioNLP Workshop on Linking Natural Language and Biology
Year:
2006

Citing 6
Cited 4

Text generation: using discourse strategies and focus constraints to generate natural language text

Text generation: using discourse strategies and focus constraints to generate natural language text
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Leveraging a common representation for personalized search and summarization in a medical digital library

Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text

Journal of Biomedical Informatics - Special issue: Unified medical language system
An unsupervised approach to recognizing discourse relations

ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
What's yours and what's mine: determining intellectual attribution in scientific text

EMNLP '00 Proceedings of the 2000 Joint SIGDAT conference on Empirical methods in natural language processing and very large corpora: held in conjunction with the 38th Annual Meeting of the Association for Computational Linguistics - Volume 13

Extracting semantics in a clinical scenario

ACSW '07 Proceedings of the fifth Australasian symposium on ACSW frontiers - Volume 68
Robust argumentative zoning for sensemaking in scholarly documents

NLP4DL'09/AT4DL'09 Proceedings of the 2009 international conference on Advanced language technologies for digital libraries
Discourse structure and computation: past, present and future

ACL '12 Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries
Discourse structure and language technology

Natural Language Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

The ability to accurately model the content structure of text is important for many natural language processing applications. This paper describes experiments with generative models for analyzing the discourse structure of medical abstracts, which generally follow the pattern of "introduction", "methods", "results", and "conclusions". We demonstrate that Hidden Markov Models are capable of accurately capturing the structure of such texts, and can achieve classification accuracy comparable to that of discriminative techniques. In addition, generative approaches provide advantages that may make them preferable to discriminative techniques such as Support Vector Machines under certain conditions. Our work makes two contributions: at the application level, we report good performance on an interesting task in an important domain; more generally, our results contribute to an ongoing discussion regarding the tradeoffs between generative and discriminative techniques.