Classifying free-text triage chief complaints into syndromic categories with natural languages processing

Authors:
Wendy W. Chapman;Lee M. Christensen;Michael M. Wagner;Peter J. Haug;Oleg Ivanov;John N. Dowling;Robert T. Olszewski
Affiliations:
The RODS Laboratory, Center for Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA;Department of Medical Informatics, LDS Hospital/University of Utah, Salt Lake City, UT;The RODS Laboratory, Center for Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA;Department of Medical Informatics, LDS Hospital/University of Utah, Salt Lake City, UT;The RODS Laboratory, Center for Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA;The RODS Laboratory, Center for Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA;The RODS Laboratory, Center for Biomedical Informatics, University of Pittsburgh, Pittsburgh, PA
Venue:
Artificial Intelligence in Medicine
Year:
2005

Citing 7
Cited 14

Using a generalized instance set for automatic text categorization

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A re-examination of text categorization methods

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Statistical Approaches to Text Categorization

Information Retrieval
Machine Learning

Machine Learning
Rule-based anomaly pattern detection for detecting disease outbreaks

Eighteenth national conference on Artificial intelligence
Using nurses' natural language entries to build a concept-oriented terminology for patients' chief complaints in the emergency department

Journal of Biomedical Informatics - Special issue: Building nursing knowledge through infomatics: from concept representation to data mining
MPLUS: a probabilistic medical language understanding system

BioMed '02 Proceedings of the ACL-02 workshop on Natural language processing in the biomedical domain - Volume 3

Fever detection from free-text clinical records for biosurveillance

Journal of Biomedical Informatics
Identifying off-topic student essays without topic-specific training data

Natural Language Engineering
Ontology-enhanced automatic chief complaint classification for syndromic surveillance

Journal of Biomedical Informatics
Advanced Capabilities for Evaluating Student Writing: Detecting Off-Topic Essays Without Topic-Specific Training

Proceedings of the 2005 conference on Artificial Intelligence in Education: Supporting Learning through Intelligent and Socially Informed Technology
Methodological Review: What can natural language processing do for clinical decision support?

Journal of Biomedical Informatics
An interactive and user-centered computer system to predict physician's disease judgments in discharge summaries

Journal of Biomedical Informatics
The ngram chief complaint classifier: A novel method of automatically creating chief complaint classifiers based on international classification of diseases groupings

Journal of Biomedical Informatics
Chinese chief complaint classification for syndromic surveillance

BioSurveillance'07 Proceedings of the 2nd NSF conference on Intelligence and security informatics: BioSurveillance
Extracting information for generating a diabetes report card from free text in physicians notes

Louhi '10 Proceedings of the NAACL HLT 2010 Second Louhi Workshop on Text and Data Mining of Health Documents
Combining Bayesian Text Classification and Shrinkage to Automate Healthcare Coding: A Data Quality Analysis

Journal of Data and Information Quality (JDIQ)
Relevance ranking of intensive care nursing narratives

KES'06 Proceedings of the 10th international conference on Knowledge-Based Intelligent Information and Engineering Systems - Volume Part I
A review of public health syndromic surveillance systems

ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Document-level classification of CT pulmonary angiography reports based on an extension of the ConText algorithm

Journal of Biomedical Informatics
Methodological Review: Using chief complaints for syndromic surveillance: A review of chief complaint based classifiers in North America

Journal of Biomedical Informatics

Quantified Score

Hi-index	0.00

Visualization

Abstract

Objective: Develop and evaluate a natural language processing application for classifying chief complaints into syndromic categories for syndromic surveillance.Introduction: Much of the input data for artificial intelligence applications in the medical field are free-text patient medical records, including dictated medical reports and triage chief complaints. To be useful for automated systems, the free-text must be translated into encoded form.Methods: We implemented a biosurveillance detection system from Pennsylvania to monitor the 2002 Winter Olympic Games. Because input data was in free-text format, we used a natural language processing text classifier to automatically classify free-text triage chief complaints into syndromic categories used by the biosurveillance system. The classifier was trained on 4700 chief complaints from Pennsylvania. We evaluated the ability of the classifier to classify free-text chief complaints into syndromic categories with a test set of 800 chief complaints from Utah.Results: The classifier produced the following areas under the ROC curve: Constitutional = 0.95; Gastrointestinal = 0.97; Hemorrhagic = 0.99; Neurological = 0.96; Rash = 1.0; Respiratory = 0.99; Other = 0.96. Using information stored in the system's semantic model, we extracted from the Respiratory classifications lower respiratory complaints and lower respiratory complaints with fever with a precision of 0.97 and 0.96, respectively.Conclusion: Results suggest that a trainable natural language processing text classifier can accurately extract data from free-text chief complaints for biosurveillance.