Detecting shibboleths

  • Authors:
  • Jelena Prokić;Çağri Çöltekin;John Nerbonne

  • Affiliations:
  • Ludwig-Maximilians-Universität;University of Groningen;University of Groningen

  • Venue:
  • EACL 2012 Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

A shibboleth is a pronunciation, or, more generally, a variant of speech that betrays where a speaker is from (Judges 12:6). We propose a generalization of the well-known precision and recall scores to deal with the case of detecting distinctive, characteristic variants when the analysis is based on numerical difference scores. We also compare our proposal to Fisher's linear discriminant, and we demonstrate its effectiveness on Dutch and German dialect data. It is a general method that can be applied both in synchronic and diachronic linguistics that involve automatic classification of linguistic entities.