Recent advances in HNC's context vector information retrieval technology

  • Authors:
  • Marc R. Ilgen;David A. Rushall

  • Affiliations:
  • HNC Software, Inc., San Diego, CA;HNC Software, Inc., San Diego, CA

  • Venue:
  • TIPSTER '96 Proceedings of a workshop on held at Vienna, Virginia: May 6-8, 1996
  • Year:
  • 1996

Quantified Score

Hi-index 0.00

Visualization

Abstract

Over the past few years, HNC has developed a neural network based, vector space approach to text retrieval. This approach, embodied in a system called MatchPlus, allows the user to retrieve information on the basis of meaning and context of a free text query. The MatchPlus system uses a neural network based, constrained self-organization technique to learn word stem interrelationships directly from a training corpus, thereby eliminating the need for hand crafted linguistic knowledge bases and their often substantial maintenance requirements. This paper presents results from recent enhancements to the basic MatchPlus concept. These enhancements include the development of a one step learning law that greatly reduces the amount of time and/or computational resources required to train the system, and the development of a prototype multilingual (English and Spanish) text retrieval system.