Sorting texts by readability

Authors:
Kumiko Tanaka-Ishii;Satoshi Tezuka;Hiroshi Terada
Affiliations:
-;-;-
Venue:
Computational Linguistics
Year:
2010

Citing 9
Cited 5

Making large-scale support vector machine learning practical

Advances in kernel methods
A statistical model for scientific readability

Proceedings of the tenth international conference on Information and knowledge management
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Reading level assessment using support vector machines and statistical language models

ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Modeling local coherence: An entity-based approach

Computational Linguistics
Listwise approach to learning to rank: theory and algorithm

Proceedings of the 25th international conference on Machine learning
Revisiting readability: a unified framework for predicting text quality

EMNLP '08 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Learning to order things

Journal of Artificial Intelligence Research

A posteriori agreement as a quality measure for readability prediction systems

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
Readability annotation: replacing the expert by the crowd

IUNLPBEA '11 Proceedings of the 6th Workshop on Innovative Use of NLP for Building Educational Applications
Identifying science concepts and student misconceptions in an interactive essay writing tutor

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
Building readability lexicons with unannotated corpora

PITR '12 Proceedings of the First Workshop on Predicting and Improving Text Readability for target reader populations
Automatic extraction of core learning goals and generation of pedagogical sequences through a collection of digital library resources

Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries

Quantified Score

Hi-index	0.00

Visualization

Abstract

This article presents a novel approach for readability assessment through sorting. A comparator that judges the relative readability between two texts is generated through machine learning, and a given set of texts is sorted by this comparator. Our proposal is advantageous because it solves the problem of a lack of training data, because the construction of the comparator only requires training data annotated with two reading levels. The proposed method is compared with regression methods and a state-of-the art classification method. Moreover, we present our application, called Terrace, which retrieves texts with readability similar to that of a given input text.