Computational dialectology in Irish Gaelic

Authors:
Brett Kessler
Affiliations:
Stanford University, Stanford CA
Venue:
EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Year:
1995

Citing 0
Cited 11

Comparison and classification of dialects

EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Linguistic variation and computation

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Measuring Norwegian dialect distances using acoustic features

Speech Communication
A combined phonetic-phonological approach to estimating cross-language phoneme similarity in an ASR environment

SIGPHON '06 Proceedings of the Eighth Meeting of the ACL Special Interest Group on Computational Phonology and Morphology
The relative divergence of Dutch dialect pronunciations from their common source: an exploratory study

SigMorPhon '07 Proceedings of Ninth Meeting of the ACL Special Interest Group in Computational Morphology and Phonology
Inducing sound segment differences using Pair Hidden Markov Models

SigMorPhon '07 Proceedings of Ninth Meeting of the ACL Special Interest Group in Computational Morphology and Phonology
Linguistic distances

LD '06 Proceedings of the Workshop on Linguistic Distances
Evaluation of string distance algorithms for dialectology

LD '06 Proceedings of the Workshop on Linguistic Distances
Evaluating the pairwise string alignment of pronunciations

LaTeCH-SHELT&R '09 Proceedings of the EACL 2009 Workshop on Language Technology and Resources for Cultural Heritage, Social Sciences, Humanities, and Education
Levenshtein distances fail to identify language relationships accurately

Computational Linguistics
Improving suffix tree clustering with new ranking and similarity measures

ADMA'11 Proceedings of the 7th international conference on Advanced Data Mining and Applications - Volume Part II

Quantified Score

Hi-index	0.00

Visualization

Abstract

Dialect groupings can be discovered objectively and automatically by cluster analysis of phonetic transcriptions such as those found in a linguistic atlas. The first step in the analysis, the computation of linguistic distance between each pair of sites, can be computed as Levenshtein distance between phonetic strings. This correlates closely with the much more laborious technique of determining and counting isoglosses, and is more accurate than the more familiar metric of computing Hamming distance based on whether vocabulary entries match. In the actual clustering step, traditional agglomerative clustering works better than the top-down technique of partitioning around medoids. When agglomerative clustering of phonetic string comparison distances is applied to Gaelic, reasonable dialect boundaries are obtained, corresponding to national and (within Ireland) provincial boundaries.