Lexicon and grammar in probabilistic tagging of written English

Authors:
Andrew David Beale
Affiliations:
University of Lancaster, Lancaster, England
Venue:
ACL '88 Proceedings of the 26th annual meeting on Association for Computational Linguistics
Year:
1988

Citing 3
Cited 3

Natural Language Information Processing: A Computer Grammmar of English and Its Applications

Natural Language Information Processing: A Computer Grammmar of English and Its Applications
Introduction to Automata Theory, Languages and Computability

Introduction to Automata Theory, Languages and Computability
The derivation of a grammatically indexed lexicon from the Longman Dictionary of Contemporary English

ACL '87 Proceedings of the 25th annual meeting on Association for Computational Linguistics

Tagging English text with a probabilistic model

Computational Linguistics
A corpus-based statistical approach to automatic book indexing

ANLC '92 Proceedings of the third conference on Applied natural language processing
The recognition capacity of local syntactic constraints

EACL '91 Proceedings of the fifth conference on European chapter of the Association for Computational Linguistics

Quantified Score

Hi-index	0.01

Visualization

Abstract

The paper describes the development of software for automatic grammatical analysis of unrestricted, unedited English text at the Unit for Computer Research on the English Language (UCREL) at the University of Lancaster. The work is currently funded by IBM and carried out in collaboration with colleagues at IBM UK (Winchester) and IBM Yorktown Heights. The paper will focus on the lexicon component of the word tagging system, the UCREL grammar, the databanks of parsed sentences, and the tools that have been written to support development of these components. This work has applications to speech technology, spelling correction, and other areas of natural language processing. Currently, our goal is to provide a language model using transition statistics to disambiguate alternative parses for a speech recognition device.