Techniques for automatically correcting words in text
ACM Computing Surveys (CSUR)
IEEE Transactions on Pattern Analysis and Machine Intelligence
A technique for computer detection and correction of spelling errors
Communications of the ACM
Chinese input with keyboard and eye-tracking: an anatomical study
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
A spelling correction program based on a noisy channel model
COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2
Correcting real-word spelling errors by restoring lexical cohesion
Natural Language Engineering
Pronunciation modeling for improved spelling correction
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
A new statistical approach to Chinese Pinyin input
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Automatic detecting/correcting errors in Chinese text by an approximate word-matching algorithm
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
An improved error model for noisy channel spelling correction
ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
Learning a spelling error model from search query logs
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Incorporating user behaviors in new word detection
IJCAI'09 Proceedings of the 21st international jont conference on Artifical intelligence
Capturing errors in written Chinese words
ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
Using the web for language independent spellchecking and autocorrection
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Real-word spelling correction using Google Web IT 3-grams
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 3 - Volume 3
CICLing'08 Proceedings of the 9th international conference on Computational linguistics and intelligent text processing
Growing related words from seed via user behaviors: a re-ranking based approach
ACLstudent '10 Proceedings of the ACL 2010 Student Research Workshop
Generating confusion sets for context-sensitive error correction
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A large scale ranker-based system for search query spelling correction
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics
Visually and phonologically similar characters in incorrect simplified Chinese words
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
CHIME: an efficient error-tolerant Cinese pinyin input method
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
A unified approach to transliteration-based text input with online spelling correction
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Hi-index | 0.00 |
Chinese Pinyin input method is very important for Chinese language information processing. Users may make errors when they are typing in Chinese words. In this paper, we are concerned with the reasons that cause the errors. Inspired by the observation that pressing backspace is one of the most common user behaviors to modify the errors, we collect 54, 309, 334 error-correction pairs from a real-world data set that contains 2, 277, 786 users via backspace operations. In addition, we present a comparative analysis of the data to achieve a better understanding of users' input behaviors. Comparisons with English typos suggest that some language-specific properties result in a part of Chinese input errors.