Applications of graph theory to an English rhyming corpus

Authors:
Morgan Sonderegger
Affiliations:
-
Venue:
Computer Speech and Language
Year:
2011

Citing 10
Cited 3

The nature of statistical learning theory

The nature of statistical learning theory
Fast training of support vector machines using sequential minimal optimization

Advances in kernel methods
An open graph visualization system and its applications to software engineering

Software—Practice & Experience - Special issue on discrete algorithm engineering
Handbook of Graphs and Networks: From the Genome to the Internet

Handbook of Graphs and Networks: From the Genome to the Internet
Cluster ensembles --- a knowledge reuse framework for combining multiple partitions

The Journal of Machine Learning Research
On Cheeger-type inequalities for weighted graphs

Journal of Graph Theory
Information theoretic measures for clusterings comparison: is a correction for chance necessary?

ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
Discovering global patterns in linguistic networks through spectral analysis: a case study of the consonant inventories

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
Syntax is from Mars while semantics from Venus!: insights from spectral analysis of distributional similarity networks

ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
On finding graph clusterings with maximum modularity

WG'07 Proceedings of the 33rd international conference on Graph-theoretic concepts in computer science

Unsupervised discovery of rhyme schemes

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Chameleons in imagined conversations: a new approach to understanding coordination of linguistic style in dialogs

CMCL '11 Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics
Unsupervised rhyme scheme identification in hip hop lyrics using hidden markov models

SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing

Quantified Score

Hi-index	0.00

Visualization

Abstract

Abstract: How much can we infer about the pronunciation of a language - past or present - by observing which words its speakers rhyme? This paper explores the connection between pronunciation and network structure in sets of rhymes. We consider the rhyme graphs corresponding to rhyming corpora, where nodes are words and edges are observed rhymes. We describe the graph G corresponding to a corpus of ~ 12000 rhymes from English poetry written c. 1900, and find a close correspondence between graph structure and pronunciation: most connected components show community structure that reflects the distinction between full and half rhymes. We build classifiers for predicting which components correspond to full rhymes, using a set of spectral and non-spectral features. Feature selection gives a small number (1-5) of spectral features, with accuracy and F-measure of ~90%, reflecting that positive components are essentially those without any good partition. We partition components of G via maximum modularity, giving a new graph, G', in which the ''quality'' of components, by several measures, is much higher than in G. We discuss how rhyme graphs could be used for historical pronunciation reconstruction.