Automated scoring using a hybrid feature identification technique

Authors:
Jill Burstein;Karen Kukich;Susanne Wolff;Chi Lu;Martin Chodorow;Lisa Braden-Harder;Mary Dee Harris
Affiliations:
Educational Testing Service, Princeton, NJ;Educational Testing Service, Princeton, NJ;Educational Testing Service, Princeton, NJ;Educational Testing Service, Princeton, NJ;Hunter College, New York City, NY;Butler-Hill Group, Reston, VA;Language Technology, Inc, Austin, TX
Venue:
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Year:
1998

Citing 6
Cited 35

Focusing in the comprehension of definite anaphora

Readings in natural language processing
Automatic text processing

Automatic text processing
Expressing rhetorical relations in instructional text: a case study of the purpose relation

Computational Linguistics
Employing Knowledge Resources in a New Text Planner Architecture

Proceedings of the 6th International Workshop on Natural Language Generation: Aspects of Automated Natural Language Generation
Empirical studies on the disambiguation of cue phrases

Computational Linguistics
A computational theory of the function of clue words in argument understanding

ACL '84 Proceedings of the 10th International Conference on Computational Linguistics and 22nd annual meeting on Association for Computational Linguistics

Computers Scoring GMAT Essays? Impossible! Or is it?

IEEE Intelligent Systems
The Debate on Automated Essay Grading

IEEE Intelligent Systems
Automating survey coding by multiclass text categorization techniques

Journal of the American Society for Information Science and Technology
Evaluation of text coherence for electronic essay scoring systems

Natural Language Engineering
Automatic evaluation of aspects of document quality

Proceedings of the 22nd annual international conference on Design of communication: The engineering of quality documentation
Automated essay evaluation: the criterion online writing service

AI Magazine
Toward evaluation of writing style: finding overly repetitive word use in student essays

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Towards automatic classification of discourse elements in essays

ACL '01 Proceedings of the 39th Annual Meeting on Association for Computational Linguistics
A hybrid approach to content analysis for automatic essay grading

NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
The role of centering theory's rough-shift in the teaching and evaluation of writing skills

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
A hybrid text classification approach for analysis of student essays

HLT-NAACL-EDUC '03 Proceedings of the HLT-NAACL 03 workshop on Building educational applications using natural language processing - Volume 2
Identifying off-topic student essays without topic-specific training data

Natural Language Engineering
Automated Japanese essay scoring system based on articles written by experts

ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
A model for a web-based learning system

Information Systems Frontiers
Finding high-quality content in social media

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Ranking of field association terms using Co-word analysis

Information Processing and Management: an International Journal
A Multilingual Application for Automated Essay Scoring

IBERAMIA '08 Proceedings of the 11th Ibero-American conference on AI: Advances in Artificial Intelligence
Predicting the readability of short web summaries

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Opportunities for Natural Language Processing Research in Education

CICLing '09 Proceedings of the 10th International Conference on Computational Linguistics and Intelligent Text Processing
Advanced Capabilities for Evaluating Student Writing: Detecting Off-Topic Essays Without Topic-Specific Training

Proceedings of the 2005 conference on Artificial Intelligence in Education: Supporting Learning through Intelligent and Socially Informed Technology
Information Extraction and Machine Learning: Auto-Marking Short Free Text Responses to Science Questions

Proceedings of the 2005 conference on Artificial Intelligence in Education: Supporting Learning through Intelligent and Socially Informed Technology
Modeling the language assessment process and result: proposed architecture for automatic oral proficiency assessment

ASSESSEVALNLP '99 Proceedings of a Symposium on Computer Mediated Language Assessment and Evaluation in Natural Language Processing
Automated essay scoring for nonnative English speakers

ASSESSEVALNLP '99 Proceedings of a Symposium on Computer Mediated Language Assessment and Evaluation in Natural Language Processing
Mining sequential patterns and tree patterns to detect erroneous sentences

AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 1
An autonomous assessment system based on combined latent semantic kernels

Expert Systems with Applications: An International Journal
A framework for the computerized assessment of university student essays

Computers in Human Behavior
Text Mining for Customer Enquiries in Telecommunication Services

KES '09 Proceedings of the 13th International Conference on Knowledge-Based and Intelligent Information and Engineering Systems: Part II
Review:

The Knowledge Engineering Review
Evaluating performance of grammatical error detection to maximize learning effect

COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
A new dataset and method for automatically grading ESOL texts

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies - Volume 1
Semantic analysis and classification method for customer enquiries in telecommunication services

Engineering Applications of Artificial Intelligence
Automated detection of local coherence in short argumentative essays based on centering theory

CICLing'12 Proceedings of the 13th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Benefits of modularity in an automated essay scoring system

Proceedings of the COLING-2000 Workshop on Using Toolsets and Architectures To Build NLP Systems
Non-linear models for confidence estimation

WMT '12 Proceedings of the Seventh Workshop on Statistical Machine Translation
Quality estimation for machine translation: some lessons learned

Machine Translation

Quantified Score

Hi-index	0.00

Visualization

Abstract

This study exploits statistical redundancy inherent in natural language to automatically predict scores for essays. We use a hybrid feature identification method, including syntactic structure analysis, rhetorical structure analysis, and topical analysis, to score essay responses from test-takers of the Graduate Management Admissions Test (GMAT) and the Test of Written English (TWE). For each essay question, a stepwise linear regression analysis is run on a training set (sample of human scored essay responses) to extract a weighted set of predictive features for each test question. Score prediction for cross-validation sets is calculated from the set of predictive features. Exact or adjacent agreement between the Electronic Essay Rater (e-rater) score predictions and human rater scores ranged from 87% to 94% across the 15 test questions.