Dependency Language Modeling Using KNN and PLSI

  • Authors:
  • Hiram Calvo;Kentaro Inui;Yuji Matsumoto

  • Affiliations:
  • Center for Computing Research, National Polytechnic Institute, Mexico 07738 and Nara Institute of Science and Technology, Nara, Japan 630-0192;Nara Institute of Science and Technology, Nara, Japan 630-0192;Nara Institute of Science and Technology, Nara, Japan 630-0192

  • Venue:
  • MICAI '09 Proceedings of the 8th Mexican International Conference on Artificial Intelligence
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we present a comparison of two language models based on dependency triples. We explore using the verb only for predicting the most plausible argument as in selectional preferences, as well as using both the verb and argument for predicting another argument. This latter causes a problem of data sparseness that must be solved by different techniques for data smoothing. Based on our results on the K-Nearest Neighbor model (KNN) algorithm we conclude that adding more information is useful for attaining higher precision, while the PLSI model was inconveniently sensitive to this information, yielding better results for the simpler model (using the verb only). Our results suggest that combining the strengths of both algorithms would provide best results.