Automatic language identification of written texts

  • Authors:
  • Rafael Dueire Lins;Paulo Gonçalves

  • Affiliations:
  • Universidade Federal de Pernambuco, Recife - Pernambuco - Brazil;Universidade Federal de Pernambuco, Recife - Pernambuco - Brazil

  • Venue:
  • Proceedings of the 2004 ACM symposium on Applied computing
  • Year:
  • 2004

Quantified Score

Hi-index 0.00

Visualization

Abstract

Language identification is one of the search keys of most widespread use in the Internet. This article describes efficient and easily extensible solutions to the problem of identifying the language of written texts based on closed grammatical classes. An identification tool was developed for recognizing texts written in Portuguese, Spanish, French and English.