Identifying segment topics in medical dictations

  • Authors:
  • Johannes Matiasek;Jeremy Jancsary;Alexandra Klein;Harald Trost

  • Affiliations:
  • Austrian Research Institute for Artificial Intelligence, Wien, Austria;Austrian Research Institute for Artificial Intelligence, Wien, Austria;Austrian Research Institute for Artificial Intelligence, Wien, Austria;Medical University Vienna, Austria

  • Venue:
  • SRSL '09 Proceedings of the 2nd Workshop on Semantic Representation of Spoken Language
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, we describe the use of lexical and semantic features for topic classification in dictated medical reports. First, we employ SVM classification to assign whole reports to coarse work-type categories. Afterwards, text segments and their topic are identified in the output of automatic speech recognition. This is done by assigning work-type-specific topic labels to each word based on features extracted from a sliding context window, again using SVM classification utilizing semantic features. Classifier stacking is then used for a posteriori error correction, yielding a further improvement in classification accuracy.