The Flesch index: An easily programmable readability analysis algorithm
SIGDOC '85 Proceedings of the 4th annual international conference on Systems documentation
Recall-precision trade-off: a derivation
Journal of the American Society for Information Science
Signal detection theory: valuable tools for evaluating inductive learning
Proceedings of the sixth international workshop on Machine learning
The relationship between recall and precision
Journal of the American Society for Information Science
Automatic Ambiguity Resolution in Natural Language Processing: An Empirical Approach
Automatic Ambiguity Resolution in Natural Language Processing: An Empirical Approach
An introduction to variable and feature selection
The Journal of Machine Learning Research
A freely available wide coverage morphological analyzer for English
COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
An introduction to ROC analysis
Pattern Recognition Letters - Special issue: ROC analysis in pattern recognition
Data Mining: Practical Machine Learning Tools and Techniques, Second Edition (Morgan Kaufmann Series in Data Management Systems)
Foundations and Trends in Information Retrieval
Lexical Affect Sensing: Are Affect Dictionaries Necessary to Analyze Affect?
ACII '07 Proceedings of the 2nd international conference on Affective Computing and Intelligent Interaction
A Computational Approach to Style in American Poetry
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Automatically profiling the author of an anonymous text
Communications of the ACM - Inspiring Women in Computing
Computational methods in authorship attribution
Journal of the American Society for Information Science and Technology
Authorship attribution in the wild
Language Resources and Evaluation
Hi-index | 0.00 |
Author attribution studies have demonstrated remarkable success in applying orthographic and lexicographic features of text in a variety of discrimination problems. What might poetic features, such as syllabic stress and mood, contribute? We address this question in the context of two different attribution problems: (a) kindred: differentiate Langston Hughes’ early poems from those of kindred poets and (b) diachronic: differentiate Hughes’ early from his later poems. Using a diverse set of 535 generic text features, each categorized as poetic or nonpoetic, correlation-based greedy forward search ranked the features and a support vector machine classified the poems. A small subset of features (∼10) achieved cross-validated precision and recall as high as 87%. Poetic features (rhyme patterns particularly) were nearly as effective as nonpoetic in kindred discrimination, but less effective diachronically. In other words, Hughes used both poetic and nonpoetic features in distinctive ways and his use of nonpoetic features evolved systematically while he continued to experiment with poetic features. These findings affirm qualitative studies attesting to structural elements from Black oral tradition and Black folk music (blues) and to the internal consistency of Hughes’ early poetry. © 2012 Wiley Periodicals, Inc.