A new dataset and method for automatically grading ESOL texts

Authors:
Helen Yannakoudakis;Ted Briscoe;Ben Medlock
Affiliations:
University of Cambridge, United Kingdom;University of Cambridge, United Kingdom;iLexIR Ltd, Cambridge, United Kingdom
Venue:
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Year:
2011

Citing 11
Cited 15

The nature of statistical learning theory

The nature of statistical learning theory
Making large-scale support vector machine learning practical

Advances in kernel methods
Space/time trade-offs in hash coding with allowable errors

Communications of the ACM
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Optimizing search engines using clickthrough data

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Evaluation of text coherence for electronic essay scoring systems

Natural Language Engineering
Automated scoring using a hybrid feature identification technique

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Automated rating of ESL essays

HLT-NAACL-EDUC '03 Proceedings of the HLT-NAACL 03 workshop on Building educational applications using natural language processing - Volume 2
The second release of the RASP system

COLING-ACL '06 Proceedings of the COLING/ACL on Interactive presentation sessions
Review:

The Knowledge Engineering Review
An Unsupervised Automated Essay Scoring System

IEEE Intelligent Systems

Correction detection and error type selection as an ESL educational aid

NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Automating second language acquisition research: integrating information visualisation and machine learning

EACL 2012 Proceedings of the EACL 2012 Joint Workshop of LINGVIS & UNCLH
Modeling coherence in ESOL learner texts

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
HOO 2012: a report on the preposition and determiner error correction shared task

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
Predicting learner levels for online exercises of Hebrew

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
Informing determiner and preposition error correction with word clusters

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
NUS at the HOO 2012 shared task

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
HOO 2012 error recognition and correction shared task: Cambridge University submission report

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
Korea University system in the HOO 2012 shared task

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
KU Leuven at HOO-2012: a hybrid approach to detection and correction of determiner and preposition errors in non-native English text

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
The UI system in the HOO 2012 shared task on error correction

Proceedings of the Seventh Workshop on Building Educational Applications Using NLP
Automated essay scoring based on finite state transducer: towards ASR transcription of oral English speech

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Long Papers - Volume 1
A meta learning approach to grammatical error correction

ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
On second language tutoring through womb grammars

IWANN'13 Proceedings of the 12th international conference on Artificial Neural Networks: advances in computational intelligence - Volume Part I
Scaling short-answer grading by combining peer assessment with algorithmic scoring

Proceedings of the first ACM conference on Learning @ scale conference

Quantified Score

Hi-index	0.00

Visualization

Abstract

We demonstrate how supervised discriminative machine learning techniques can be used to automate the assessment of 'English as a Second or Other Language' (ESOL) examination scripts. In particular, we use rank preference learning to explicitly model the grade relationships between scripts. A number of different features are extracted and ablation tests are used to investigate their contribution to overall performance. A comparison between regression and rank preference models further supports our method. Experimental results on the first publically available dataset show that our system can achieve levels of performance close to the upper bound for the task, as defined by the agreement between human examiners on the same corpus. Finally, using a set of 'outlier' texts, we test the validity of our model and identify cases where the model's scores diverge from that of a human examiner.