Identifying segment topics in medical dictations

Authors:
Johannes Matiasek;Jeremy Jancsary;Alexandra Klein;Harald Trost
Affiliations:
Austrian Research Institute for Artificial Intelligence, Wien, Austria;Austrian Research Institute for Artificial Intelligence, Wien, Austria;Austrian Research Institute for Artificial Intelligence, Wien, Austria;Medical University Vienna, Austria
Venue:
SRSL '09 Proceedings of the 2nd Workshop on Semantic Representation of Spoken Language
Year:
2009

Citing 4
Cited 0

The nature of statistical learning theory

The nature of statistical learning theory
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
LIBLINEAR: A Library for Large Linear Classification

The Journal of Machine Learning Research
Revealing the structure of medical dictations with conditional random fields

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this paper, we describe the use of lexical and semantic features for topic classification in dictated medical reports. First, we employ SVM classification to assign whole reports to coarse work-type categories. Afterwards, text segments and their topic are identified in the output of automatic speech recognition. This is done by assigning work-type-specific topic labels to each word based on features extracted from a sliding context window, again using SVM classification utilizing semantic features. Classifier stacking is then used for a posteriori error correction, yielding a further improvement in classification accuracy.