Automatic prediction of cognate orthography using support vector machines

  • Authors:
  • Andrea Mulloni

  • Affiliations:
  • University of Wolverhampton, Wolverhampton, United Kingdom

  • Venue:
  • ACL '07 Proceedings of the 45th Annual Meeting of the ACL: Student Research Workshop
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper describes an algorithm to automatically generate a list of cognates in a target language by means of Support Vector Machines. While Levenshtein distance was used to align the training file, no knowledge repository other than an initial list of cognates used for training purposes was input into the algorithm. Evaluation was set up in a cognate production scenario which mimed a real-life situation where no word lists were available in the target language, delivering the ideal environment to test the feasibility of a more ambitious project that will involve language portability. An overall improvement of 50.58% over the baseline showed promising horizons.