Evaluation of text coherence for electronic essay scoring systems

Authors:
E. Miltsakaki;K. Kukich
Affiliations:
University of Pennsylvania, Philadelphia, PA 19104, USA;Educational Testing Service, Princeton, NJ 08541, USA
Venue:
Natural Language Engineering
Year:
2004

Citing 20
Cited 16

Attention, intentions, and the structure of discourse

Computational Linguistics
Centering: a framework for modeling the local coherence of discourse

Computational Linguistics
Assessing agreement on classification tasks: the kappa statistic

Computational Linguistics
S an Interactive Environment for Data Analysis and Graphics

S an Interactive Environment for Data Analysis and Graphics
Toward an aposynthesis of topic continuity and intrasentential anaphora

Computational Linguistics
Towards a Computational Theory of Definite Anaphora Comprehension in English Discourse

Towards a Computational Theory of Definite Anaphora Comprehension in English Discourse
Lexical cohesion computed by thesaural relations as an indicator of the structure of text

Computational Linguistics
Japanese discourse and the process of centering

Computational Linguistics
Discourse segmentation by human and automated means

Computational Linguistics
A corpus-based investigation of definite description use

Computational Linguistics
A methodology for extending focusing frameworks

Computational Linguistics
Functional centering: grounding referential coherence in information structure

Computational Linguistics
Automated scoring using a hybrid feature identification technique

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
A centering approach to pronouns

ACL '87 Proceedings of the 25th annual meeting on Association for Computational Linguistics
Providing a unified account of definite noun phrases in discourse

ACL '83 Proceedings of the 21st annual meeting on Association for Computational Linguistics
Text segmentation based on similarity between words

ACL '93 Proceedings of the 31st annual meeting on Association for Computational Linguistics
Multi-paragraph segmentation of expository text

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
An automatic method of finding topic boundaries

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Functional centering

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Specifying the parameters of centering theory: a corpus-based evaluation using text from application-oriented domains

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics

Towards intelligent QA interfaces: discourse processing for context questions

Proceedings of the 11th international conference on Intelligent user interfaces
Implementation and evaluation of a quality-based search engine

Proceedings of the seventeenth conference on Hypertext and hypermedia
The quest to find the best pages on the web

Information Services and Use
Discourse processing for context question answering based on linguistic knowledge

Knowledge-Based Systems
Evaluating centering for information ordering using corpora

Computational Linguistics
Coreference-inspired coherence modeling

HLT-Short '08 Proceedings of the 46th Annual Meeting of the Association for Computational Linguistics on Human Language Technologies: Short Papers
Matching readers' preferences and reading skills with appropriate web texts

EACL '09 Proceedings of the 12th Conference of the European Chapter of the Association for Computational Linguistics: Demonstrations Session
Real-time web text classification and analysis of reading difficulty

EANL '08 Proceedings of the Third Workshop on Innovative Use of NLP for Building Educational Applications
Automatic evaluation of text coherence: models and representations

IJCAI'05 Proceedings of the 19th international joint conference on Artificial intelligence
Modeling organization in student essays

EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
A new dataset and method for automatically grading ESOL texts

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Disentangling chat with local coherence models

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Incorporating cohesive devices into entity grid model in evaluating local coherence of japanese text

CICLing'10 Proceedings of the 11th international conference on Computational Linguistics and Intelligent Text Processing
Automated detection of local coherence in short argumentative essays based on centering theory

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Detecting breakdowns in local coherence in the writing of Chinese English learners

Journal of Computer Assisted Learning
Modeling coherence in ESOL learner texts

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP

Quantified Score

Hi-index	0.00

Visualization

Abstract

Existing software systems for automated essay scoring can provide NLP researchers with opportunities to test certain theoretical hypotheses, including some derived from Centering Theory. In this study we employ the Educational Testing Service's e-rater essay scoring system to examine whether local discourse coherence, as defined by a measure of Centering Theory's Rough-Shift transitions, might be a significant contributor to the evaluation of essays. Rough-Shifts within students' paragraphs often occur when topics are short-lived and unconnected, and are therefore indicative of poor topic development. We show that adding the Rough-Shift based metric to the system improves its performance significantly, better approximating human scores and providing the capability of valuable instructional feedback to the student. These results indicate that Rough-Shifts do indeed capture a source of incoherence, one that has not been closely examined in the Centering literature. They not only justify Rough-Shifts as a valid transition type, but they also support the original formulation of Centering as a measure of discourse continuity even in pronominal-free text. Finally, our study design, which used a combination of automated and manual NLP techniques, highlights specific areas of NLP research and development needed for engineering practical applications.