Co-occurrence vectors from corpora vs. distance vectors from dictionaries

  • Authors:
  • Yoshiki Niwa;Yoshihiko Nitta

  • Affiliations:
  • Advanced Research Laboratory, Hitachi, Ltd., Saitama, Japan;Advanced Research Laboratory, Hitachi, Ltd., Saitama, Japan

  • Venue:
  • COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 1
  • Year:
  • 1994

Quantified Score

Hi-index 0.00

Visualization

Abstract

A comparison was made of vectors derived by using ordinary co-occurrence statistics from large text corpora and of vectors derived by measuring the interword distances in dictionary definitions. The precision of word sense disambiguation by using co-occurrence vectors from the 1987 Wall Street Journal (20M total words) was higher than that by using distance vectors from the Collins English Dictionary (60K head words + 1.6M definition words). However, other experimental results suggest that distance vectors contain some different semantic information from co-occurrence vectors.