Techniques for automatically correcting words in text
ACM Computing Surveys (CSUR)
The String-to-String Correction Problem
Journal of the ACM (JACM)
A technique for computer detection and correction of spelling errors
Communications of the ACM
Periods, capitalized words, etc.
Computational Linguistics
Improving Precision and Recall for Soundex Retrieval
ITCC '02 Proceedings of the International Conference on Information Technology: Coding and Computing
Survey of Text Mining
A knowledge-free method for capitalized word disambiguation
ACL '99 Proceedings of the 37th annual meeting of the Association for Computational Linguistics on Computational Linguistics
ACL '02 Proceedings of the 40th Annual Meeting on Association for Computational Linguistics
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Extracting a domain-specific ontology from a corporate intranet
ConLL '00 Proceedings of the 2nd workshop on Learning language in logic and the 4th conference on Computational natural language learning - Volume 7
Building automatically a business registration ontology
dg.o '02 Proceedings of the 2002 annual national conference on Digital government research
Automatic discovery of synonyms and lexicalizations from the Web
Proceedings of the 2005 conference on Artificial Intelligence Research and Development
Opinion mining from noisy text data
Proceedings of the second workshop on Analytics for noisy unstructured text data
Studying the effects of noisy text on text mining applications
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
Schema Normalization for Improving Schema Matching
ER '09 Proceedings of the 28th International Conference on Conceptual Modeling
Schema label normalization for improving schema matching
Data & Knowledge Engineering
Lexical normalisation of short text messages: makn sens a #twitter
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Author name disambiguation for ranking and clustering pubmed data using netclus
AI'11 Proceedings of the 24th international conference on Advances in Artificial Intelligence
Lexical normalization for social media text
ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context
Normalization of informal text
Computer Speech and Language
Hi-index | 0.00 |
An increasing number of language and speech applications are gearing towards the use of texts from online sources as input. Despite such rise, not much work can be found in the aspect of integrated approaches for cleaning dirty texts from online sources. This paper presents a mechanism of Integrated Scoring for Spelling error correction, Abbreviation expansion and Case restoration (ISSAC). The idea of ISSAC was first conceived as part of the text preprocessing phase in an ontology engineering project. Evaluations of ISSAC using 400 chat records reveal an improved accuracy of 96.5% over the existing 74.4% based on the use of Aspell only.