The anatomy of a large-scale hypertextual Web search engine
WWW7 Proceedings of the seventh international conference on World Wide Web 7
Language identification in web pages
Proceedings of the 2005 ACM symposium on Applied computing
Evaluation of a language identification system for mono- and multilingual text documents
Proceedings of the 2006 ACM symposium on Applied computing
New specialist tools for medieval document XML markup
Proceedings of the 2007 ACM symposium on Applied computing
Self- or pre-tuning?: deep linguistic processing of language variants
DeepLP '07 Proceedings of the Workshop on Deep Linguistic Processing
Language identification: the long and the short of the matter
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Language identification in multi-lingual web-documents
NLDB'06 Proceedings of the 11th international conference on Applications of Natural Language to Information Systems
Factors that affect the accuracy of text-based language identification
Computer Speech and Language
Hi-index | 0.00 |
Language identification is one of the search keys of most widespread use in the Internet. This article describes efficient and easily extensible solutions to the problem of identifying the language of written texts based on closed grammatical classes. An identification tool was developed for recognizing texts written in Portuguese, Spanish, French and English.