Semantic Annotation of City Transportation Information Dialogues Using CRF Method

  • Authors:
  • Agnieszka Mykowiecka;Jakub Waszczuk

  • Affiliations:
  • Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland 01-237 and Polish-Japanese Institute of Information Techniques, Warsaw, Poland 02-008;Institute of Computer Science, Polish Academy of Sciences, Warsaw, Poland 01-237

  • Venue:
  • TSD '09 Proceedings of the 12th International Conference on Text, Speech and Dialogue
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

The article presents results of an experiment consisting in automatic concept annotation of the transliterated spontaneous human-human dialogues in the city transportation domain. The data source was a corpus of dialogues collected at a Warsaw call center and annotated with about 200 concepts' types. The machine learning technique we used is the linear-chain Conditional Random Fields (CRF) sequence labeling approach. The model based on word lemmas in a window of length 5 gave results of concept recognition with an F-measure equal to 0.85.