Google books n-gram corpus used as a grammar checker

Authors:
Rogelio Nazar;Irene Renau
Affiliations:
University Institute of Applied Linguistics, Universitat Pompeu Fabra, Barcelona, Spain;University Institute of Applied Linguistics, Universitat Pompeu Fabra, Barcelona, Spain
Venue:
EACL 2012 Proceedings of the Second Workshop on Computational Linguistics and Writing (CLW 2012): Linguistic and Cognitive Aspects of Document Creation and Document Engineering
Year:
2012

Citing 14
Cited 0

Automated postediting of documents

AAAI '94 Proceedings of the twelfth national conference on Artificial intelligence (vol. 1)
Natural Language Processing: The Plnlp Approach

Natural Language Processing: The Plnlp Approach
Responding intelligently to unparsable inputs

Computational Linguistics
The experience of developing a large-scale natural language text processing system: CRITIQUE

ANLC '88 Proceedings of the second conference on Applied natural language processing
How to detect grammatical errors in a text without parsing it

EACL '87 Proceedings of the third conference on European chapter of the Association for Computational Linguistics
Recognizing syntactic errors in the writing of second language learners

COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 2
Towards robust PATR

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Automated essay evaluation: the criterion online writing service

AI Magazine
The performance of a grammar checker with deviant language input

COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 2
Detecting errors in English article usage by non-native speakers

Natural Language Engineering
Introduction to Information Retrieval

Introduction to Information Retrieval
The EPISTLE text-critiquing system

IBM Systems Journal
Using the web for language independent spellchecking and autocorrection

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Automated Grammatical Error Detection for Language Learners

Automated Grammatical Error Detection for Language Learners

Quantified Score

Hi-index	0.00

Visualization

Abstract

In this research we explore the possibility of using a large n-gram corpus (Google Books) to derive lexical transition probabilities from the frequency of word n-grams and then use them to check and suggest corrections in a target text without the need for grammar rules. We conduct several experiments in Spanish, although our conclusions also reach other languages since the procedure is corpus-driven. The paper reports on experiments involving different types of grammar errors, which are conducted to test different grammar-checking procedures, namely, spotting possible errors, deciding between different lexical possibilities and filling-in the blanks in a text.