The nature of statistical learning theory
The nature of statistical learning theory
Fast training of support vector machines using sequential minimal optimization
Advances in kernel methods
An open graph visualization system and its applications to software engineering
Software—Practice & Experience - Special issue on discrete algorithm engineering
Handbook of Graphs and Networks: From the Genome to the Internet
Handbook of Graphs and Networks: From the Genome to the Internet
Cluster ensembles --- a knowledge reuse framework for combining multiple partitions
The Journal of Machine Learning Research
On Cheeger-type inequalities for weighted graphs
Journal of Graph Theory
Information theoretic measures for clusterings comparison: is a correction for chance necessary?
ICML '09 Proceedings of the 26th Annual International Conference on Machine Learning
EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics
ACLShort '09 Proceedings of the ACL-IJCNLP 2009 Conference Short Papers
On finding graph clusterings with maximum modularity
WG'07 Proceedings of the 33rd international conference on Graph-theoretic concepts in computer science
Unsupervised discovery of rhyme schemes
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
CMCL '11 Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics
Unsupervised rhyme scheme identification in hip hop lyrics using hidden markov models
SLSP'13 Proceedings of the First international conference on Statistical Language and Speech Processing
Hi-index | 0.00 |
Abstract: How much can we infer about the pronunciation of a language - past or present - by observing which words its speakers rhyme? This paper explores the connection between pronunciation and network structure in sets of rhymes. We consider the rhyme graphs corresponding to rhyming corpora, where nodes are words and edges are observed rhymes. We describe the graph G corresponding to a corpus of ~ 12000 rhymes from English poetry written c. 1900, and find a close correspondence between graph structure and pronunciation: most connected components show community structure that reflects the distinction between full and half rhymes. We build classifiers for predicting which components correspond to full rhymes, using a set of spectral and non-spectral features. Feature selection gives a small number (1-5) of spectral features, with accuracy and F-measure of ~90%, reflecting that positive components are essentially those without any good partition. We partition components of G via maximum modularity, giving a new graph, G', in which the ''quality'' of components, by several measures, is much higher than in G. We discuss how rhyme graphs could be used for historical pronunciation reconstruction.