POST: using probabilities in language processing

Authors:
Marie Meteer;Richard Schwartz;Ralph Weischedel
Affiliations:
BBN Systems and Technologies, Cambridge, MA;BBN Systems and Technologies, Cambridge, MA;BBN Systems and Technologies, Cambridge, MA
Venue:
IJCAI'91 Proceedings of the 12th international joint conference on Artificial intelligence - Volume 2
Year:
1991

Citing 8
Cited 5

Grammatical category disambiguation by statistical optimization

Computational Linguistics
A Cache-Based Natural Language Model for Speech Recognition

IEEE Transactions on Pattern Analysis and Machine Intelligence
Portability in the Janus Natural Language Interface

HLT '89 Proceedings of the workshop on Speech and Natural Language
A stochastic parts program and noun phrase parser for unrestricted text

ANLC '88 Proceedings of the second conference on Applied natural language processing
Parsing the LOB corpus

ACL '90 Proceedings of the 28th annual meeting on Association for Computational Linguistics
Unification-based semantic interpretation in the BBN Spoken Language System

HLT '89 Proceedings of the workshop on Speech and Natural Language
Preference semantics for message understanding

HLT '89 Proceedings of the workshop on Speech and Natural Language
Augmenting a hidden Markov model for phrase-dependent word tagging

HLT '89 Proceedings of the workshop on Speech and Natural Language

Shipping departments vs. shipping pacemakers: using thematic analysis to improve tagging accuracy

AAAI'92 Proceedings of the tenth national conference on Artificial intelligence
Probabilistic parse scoring with prosodic information

ICASSP'93 Proceedings of the 1993 IEEE international conference on Acoustics, speech, and signal processing: speech processing - Volume II
Analysis of inconsistencies in cross-lingual automatic ToBI tonal accent labeling

TSD'11 Proceedings of the 14th international conference on Text, speech and dialogue
Ending-based strategies for part-of-speech tagging

UAI'94 Proceedings of the Tenth international conference on Uncertainty in artificial intelligence
Cross-lingual English Spanish tonal accent labeling using decision trees and neural networks

NOLISP'11 Proceedings of the 5th international conference on Advances in nonlinear speech processing

Quantified Score

Hi-index	0.01

Visualization

Abstract

We report here on our experiments with POST (Part of Speech Tagger) to address problems of ambiguity and of understanding unknown words. Part of speech tagging, perse, is a well understood problem. Our paper reports experiments in three important areas: handling unknown words, limiting the size of the training set, and returning a set of the most likely tags for each word rather than a single tag. We describe the algorithms that we used and the specific results of our experiments on Wall Street Journal articles and on MUC terrorist messages.