A stochastic finite-state morphological parser for Turkish

  • Authors:
  • Haşim Sak;Tunga Güngör;Murat Saraçlar

  • Affiliations:
  • Boğaziçi University, Bebek, İstanbul, Turkey;Boğaziçi University, Bebek, İstanbul, Turkey;Boğaziçi University, Bebek, İstanbul, Turkey

  • Venue:
  • ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents the first stochastic finite-state morphological parser for Turkish. The non-probabilistic parser is a standard finite-state transducer implementation of two-level morphology formalism. A disambiguated text corpus of 200 million words is used to stochastize the morphotactics transducer, then it is composed with the morphophonemics transducer to get a stochastic morphological parser. We present two applications to evaluate the effectiveness of the stochastic parser; spelling correction and morphology-based language modeling for speech recognition.