On Morphological Analysis for Learner Language, Focusing on Russian

  • Authors:
  • Markus Dickinson

  • Affiliations:
  • Department of Linguistics, Indiana University, Bloomington, USA

  • Venue:
  • Research on Language and Computation
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe a framework for performing morphological analysis to account for learner language, focusing on Russian as an example of an inflecting language. Because a set of linguistic analyses is needed to provide feedback on potentially noisy data, there is a large amount of ambiguity for even well-formed words. Using a segmented POS lexicon as a test case, we show how to analyze subparts of words, in order to analyze variations. After describing and implementing this framework for Russian, we focus on removing undesirable analyses to keep the task feasible. This is essentially an investigation of how much overgeneration of analyses is a problem and under what assumptions it can be reduced.