Automatic language identification using multivariate analysis

  • Authors:
  • J. Vinosh Babu;S. Baskaran

  • Affiliations:
  • AU-KBC Research Centre;AU-KBC Research Centre

  • Venue:
  • CICLing'05 Proceedings of the 6th international conference on Computational Linguistics and Intelligent Text Processing
  • Year:
  • 2005

Quantified Score

Hi-index 0.00

Visualization

Abstract

Identifying the language of an e-text is complicated by the existence of a number of character sets for a single language. We present a language identification system that uses the Multivariate Analysis (MVA) for dimensionality reduction and classification. We compare its performance with existing schemes viz., the N-grams and compression.