Parsing the LOB corpus

Authors:
Carl G. de Marcken
Affiliations:
MIT AI Laboratory, Cambridge, MA
Venue:
ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Year:
1990

Citing 2
Cited 35

Grammatical category disambiguation by statistical optimization

Computational Linguistics
A stochastic parts program and noun phrase parser for unrestricted text

ANLC '88 Proceedings of the second conference on Applied natural language processing

Partial parsing: a report on work in progress

HLT '91 Proceedings of the workshop on Speech and Natural Language
Studies in part of speech labelling

HLT '91 Proceedings of the workshop on Speech and Natural Language
Automatic acquisition of subcategorization frames from tagged text

HLT '91 Proceedings of the workshop on Speech and Natural Language
Towards understanding text with a very large vocabulary

HLT '90 Proceedings of the workshop on Speech and Natural Language
Using multiple knowledge sources for word sense discrimination

Computational Linguistics
Transformation-based error-driven learning and natural language processing: a case study in part-of-speech tagging

Computational Linguistics
Formal Context and Morphological Analysis

CONTEXT '99 Proceedings of the Second International and Interdisciplinary Conference on Modeling and Using Context
Part-of-Speech Tagging with Evolutionary Algorithms

CICLing '02 Proceedings of the Third International Conference on Computational Linguistics and Intelligent Text Processing
Parsing Asymmetries

NLP '00 Proceedings of the Second International Conference on Natural Language Processing
Introduction to the special issue on computational linguistics using large corpora

Computational Linguistics - Special issue on using large corpora: I
Review of "Corpus linguistics and the automatic analysis of English" by Nelleke Oostdijk. Editions Rodopi 1991.

Computational Linguistics - Special issue on using large corpora: I
Coping with ambiguity and unknown words through probabilistic models

Computational Linguistics - Special issue on using large corpora: II
Tagging English text with a probabilistic model

Computational Linguistics
Tagging accurately: don't guess if you know

ANLC '94 Proceedings of the fourth conference on Applied natural language processing
Acquiring knowledge from encyclopedic texts

ANLC '94 Proceedings of the fourth conference on Applied natural language processing
Robust processing of real-world natural-language texts

ANLC '92 Proceedings of the third conference on Applied natural language processing
Automatic extraction of subcategorization from corpora

ANLC '97 Proceedings of the fifth conference on Applied natural language processing
Distributional part-of-speech tagging

EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
A syntax-based part-of-speech analyser

EACL '95 Proceedings of the seventh conference on European chapter of the Association for Computational Linguistics
Independence assumptions considered harmful

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Comparing a linguistic and a stochastic tagger

ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Automatic acquisition of subcategorization frames from untagged text

ACL '91 Proceedings of the 29th annual meeting on Association for Computational Linguistics
A Markov language learning model for finite parameter spaces

ACL '94 Proceedings of the 32nd annual meeting on Association for Computational Linguistics
Fast parsing using pruning and grammar specialization

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Corpus-based acquisition of relative pronoun disambiguation heuristics

ACL '92 Proceedings of the 30th annual meeting on Association for Computational Linguistics
TTP: a fast and robust parser for natural language

COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 1
BBN PLUM: MUC-3 test results and analysis

MUC3 '91 Proceedings of the 3rd conference on Message understanding
BBN: description of the PLUM system as used for MUC-3

MUC3 '91 Proceedings of the 3rd conference on Message understanding
Language independent, minimally supervised induction of lexical probabilities

ACL '00 Proceedings of the 38th Annual Meeting on Association for Computational Linguistics
A new approach to text understanding

HLT '91 Proceedings of the workshop on Speech and Natural Language
Example-based correction of word segmentation and part of speech labelling

HLT '93 Proceedings of the workshop on Human Language Technology
A report of recent progress in transformation-based error-driven learning

HLT '94 Proceedings of the workshop on Human Language Technology
POST: using probabilities in language processing

IJCAI'91 Proceedings of the 12th international joint conference on Artificial intelligence - Volume 2
Learning to disambiguate relative pronouns

AAAI'92 Proceedings of the tenth national conference on Artificial intelligence
Equations for part-of-speech tagging

AAAI'93 Proceedings of the eleventh national conference on Artificial intelligence

Quantified Score

Hi-index	0.00

Visualization

Abstract

This paper presents a rapid and robust parsing system currently used to learn from large bodies of unedited text. The system contains a multivalued part-of-speech disambiguator and a novel parser employing bottom-up recognition to find the constituent phrases of larger structures that might be too difficult to analyze. The results of applying the disambiguator and parser to large sections of the Lancaster/Oslo-Bergen corpus are presented.