POST: using probabilities in language processing

  • Authors:
  • Marie Meteer;Richard Schwartz;Ralph Weischedel

  • Affiliations:
  • BBN Systems and Technologies, Cambridge, MA;BBN Systems and Technologies, Cambridge, MA;BBN Systems and Technologies, Cambridge, MA

  • Venue:
  • IJCAI'91 Proceedings of the 12th international joint conference on Artificial intelligence - Volume 2
  • Year:
  • 1991

Quantified Score

Hi-index 0.01

Visualization

Abstract

We report here on our experiments with POST (Part of Speech Tagger) to address problems of ambiguity and of understanding unknown words. Part of speech tagging, perse, is a well understood problem. Our paper reports experiments in three important areas: handling unknown words, limiting the size of the training set, and returning a set of the most likely tags for each word rather than a single tag. We describe the algorithms that we used and the specific results of our experiments on Wall Street Journal articles and on MUC terrorist messages.