A joint statistical model for simultaneous word spacing and spelling error correction for Korean

  • Authors:
  • Hyungjong Noh;Jeong-Won Cha;Gary Geunbae Lee

  • Affiliations:
  • Pohang University of Science & Technology (POSTECH), Pohang, Republic of Korea;Changwon National University, Changwon Gyeongnam, Korea;Pohang University of Science & Technology (POSTECH), Pohang, Republic of Korea

  • Venue:
  • ACL '07 Proceedings of the 45th Annual Meeting of the ACL on Interactive Poster and Demonstration Sessions
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents noisy-channel based Korean preprocessor system, which corrects word spacing and typographical errors. The proposed algorithm corrects both errors simultaneously. Using Eojeol transition pattern dictionary and statistical data such as Eumjeol n-gram and Jaso transition probabilities, the algorithm minimizes the usage of huge word dictionaries.