LexStat: automatic detection of cognates in multilingual wordlists

  • Authors:
  • Johann-Mattis List

  • Affiliations:
  • Heinrich Heine University Düsseldorf, Germany

  • Venue:
  • EACL 2012 Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper, a new method for automatic cognate detection in multilingual wordlists will be presented. The main idea behind the method is to combine different approaches to sequence comparison in historical linguistics and evolutionary biology into a new framework which closely models the most important aspects of the comparative method. The method is implemented as a Python program and provides a convenient tool which is publicly available, easily applicable, and open for further testing and improvement. Testing the method on a large gold standard of IPA-encoded wordlists showed that its results are highly consistent and outperform previous methods.