Spelling checkers,spelling correctors and the misspellings of poor spellers
Information Processing and Management: an International Journal
Techniques for automatically correcting words in text
ACM Computing Surveys (CSUR)
Internal and external evidence in the identification and semantic categorization of proper names
Corpus processing for lexical acquisition
Identifying unknown proper names in newswire text
Corpus processing for lexical acquisition
Predictive data mining: a practical guide
Predictive data mining: a practical guide
A technique for computer detection and correction of spelling errors
Communications of the ACM
Maximizing Text-Mining Performance
IEEE Intelligent Systems
Hierarchical and integrated error recovery based on bidirectional chart parsing technique
Hierarchical and integrated error recovery based on bidirectional chart parsing technique
A stochastic parts program and noun phrase parser for unrestricted text
ANLC '88 Proceedings of the second conference on Applied natural language processing
Detecting and correcting morpho-syntactic errors in real texts
ANLC '92 Proceedings of the third conference on Applied natural language processing
Computational Linguistics - Special issue on ill-formed input
Integrated control of chart items for error repair
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Towards a single proposal in spelling correction
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Spelling correction using context
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Improving data driven wordclass tagging by system combination
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Inferring parts of speech for lexical mappings via the Cyc KB
COLING '04 Proceedings of the 20th international conference on Computational Linguistics
A collaborative framework for collecting Thai unknown words from the web
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Boosting-based ensemble learning with penalty profiles for automatic Thai unknown word recognition
Computers & Mathematics with Applications
Hi-index | 0.00 |
This paper introduces a system for categorizing unknown words. The system is based on a multicomponent architecture where each component is responsible for identifying one class of unknown words. The focus of this paper is the components that identify names and spelling errors. Each component uses a decision tree architecture to combine multiple types of evidence about the unknown word. The system is evaluated using data from live closed captions - a genre replete with a wide variety of unknown words.