Parse fitting and prose fixing: getting a hold on ill-formedness

Authors:
K. Jensen;G. E. Heidorn;L. A. Miller;Y. Ravin
Affiliations:
IBM Thomas J. Watson Research Center, Yorktown Heights, New York;IBM Thomas J. Watson Research Center, Yorktown Heights, New York;IBM Thomas J. Watson Research Center, Yorktown Heights, New York;IBM Thomas J. Watson Research Center, Yorktown Heights, New York
Venue:
Computational Linguistics - Special issue on ill-formed input
Year:
1983

Citing 8
Cited 36

An intelligent analyzer and understander of English

Communications of the ACM
Responding intelligently to unparsable inputs

Computational Linguistics
Relaxation techniques for parsing grammatically III-formed input in Natural Language Understanding Systems

Computational Linguistics
Flexible parsing

Computational Linguistics
The fitted parse: 100% parsing capability in a syntactic grammar of English

ANLC '83 Proceedings of the first conference on Applied natural language processing
A status report on the LRC machine

ANLC '83 Proceedings of the first conference on Applied natural language processing
Experience with an easily computed metric for ranking alternative parsess

ACL '82 Proceedings of the 20th annual meeting on Association for Computational Linguistics
An improved heuristic for ellipsis processing

ACL '82 Proceedings of the 20th annual meeting on Association for Computational Linguistics

The correction of ill-formed input using history-based expectation with applications to speech understanding

Computational Linguistics
CRITAC—an experimental system for Japanese text proofreading

IBM Journal of Research and Development
Tools and methods for computational lexicology

Computational Linguistics - Special issue of the lexicon
Automatic text indexing using complex identifiers

DOCPROCS '88 Proceedings of the ACM conference on Document processing systems
On the application of syntactic methodologies in automatic text analysis

SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
Generating text from compressed input: an intelligent interface for people with severe motor impairments

Communications of the ACM
Techniques for automatically correcting words in text

ACM Computing Surveys (CSUR)
Generalized probabilistic LR parsing of natural language (Corpora) with unification-based grammars

Computational Linguistics - Special issue on using large corpora: I
Surface-marker-based dialog modelling: A progress report on the MAREDI project

Natural Language Engineering
Creating and querying lexical data bases

ANLC '88 Proceedings of the second conference on Applied natural language processing
The experience of developing a large-scale natural language text processing system: CRITIQUE

ANLC '88 Proceedings of the second conference on Applied natural language processing
Compansion: From research prototype to practical integration

Natural Language Engineering
Evaluating parsing strategies using standardized parse files

ANLC '92 Proceedings of the third conference on Applied natural language processing
Robust parsing of natural language descriptions expressed in telegraphic style

ANLC '92 Proceedings of the third conference on Applied natural language processing
Parsing with a small dictionary for applications such as text to speech

Computational Linguistics
How to detect grammatical errors in a text without parsing it

EACL '87 Proceedings of the third conference on European chapter of the Association for Computational Linguistics
A model for preference

EACL '87 Proceedings of the third conference on European chapter of the Association for Computational Linguistics
Some chart-based techniques for parsing ill-formed input

ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
On representing governed prepositions and handling "incorrect" and novel prepositions

ACL '89 Proceedings of the 27th annual meeting on Association for Computational Linguistics
Robust parsing based on discourse information: completing partial parses of ill-formed sentences on the basis of discourse information

ACL '95 Proceedings of the 33rd annual meeting on Association for Computational Linguistics
Sentence fragments regular structures

ACL '88 Proceedings of the 26th annual meeting on Association for Computational Linguistics
Syntactic approaches to automatic book indexing

ACL '88 Proceedings of the 26th annual meeting on Association for Computational Linguistics
Syntactic normalization of spontaneous speech

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 3
Schema method: a framework for correcting grammatically ill-formed input

COLING '88 Proceedings of the 12th conference on Computational linguistics - Volume 1
TTP: a fast and robust parser for natural language

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 1
Towards robust PATR

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
Structural patterns vs. string patterns for extracting semantic information from dictionaries

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 2
JDII: parsing Italian with a robust constraint grammar

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
GramCheck: a grammar and style checker

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
JDII: parsing Italian with a robust constraint grammar

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
An agreement corrector for Russian

COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 2
A linguistic theory of robustness

COLING '90 Proceedings of the 13th conference on Computational linguistics - Volume 2
Parsing Ill-Formed Text Using an Error Grammar

Artificial Intelligence Review
TextTree construction for parser and treebank development

Software '05 Proceedings of the Workshop on Software
ULINK: a semantics-driven approach to understanding ungrammatical input

AAAI'91 Proceedings of the ninth National conference on Artificial intelligence - Volume 1
Feature constraint logic and error detection in ICALL systems

LACL'05 Proceedings of the 5th international conference on Logical Aspects of Computational Linguistics

Quantified Score

Hi-index	0.02

Visualization

Abstract

Processing syntactically ill-formed language is an important mission of the EPISTLE system, Ill-formed input is treated by this system in various ways. Misspellings are highlighted by a standard spelling checker; syntactic errors are detected and corrections are suggested; and stylistic infelicities are called to the user's attention.Central to the EPISTLE processing strategy is its technique of fitted parsing. When the rules of a conventional syntactic grammar are unable to produce a parse for an input string, this technique can be used to produce a reasonable approximate parse that can serve as input to the remaining stages of processing.This paper first describes the fitting process and gives examples of ill-formed language situations where it is called into play. We then show how a fitted parse allows EPISTLE to carry on its text-critiquing mission where conventional grammars would fail either because of input problems or because of limitations in the grammars themselves. Some inherent difficulties of the fitting technique are also discussed. In addition, we explore how style critiquing relates to the handling of ill-formed input, and how a fitted parse can be used in style checking.