Using semantic and phonetic term similarity for spoken document retrieval and spoken query processing

  • Authors:
  • Fabio Crestani

  • Affiliations:
  • Department of Computer Science, University of Strathclyde, 26 Richmond Street, Glasgow G1 1XH, Scotland, UK

  • Venue:
  • Technologies for constructing intelligent systems
  • Year:
  • 2002

Quantified Score

Hi-index 0.00

Visualization

Abstract

In classical Information Retrieval systems a relevant document will not be retrieved in response to a query if the document and query representations do not share at least one term. This problem is known as "term mismatch". A similar problem can be found in spoken document retrieval and spoken query processing, where terms misrecognized by the speech recognition process can hinder the retrieval of potentially relevant documents. I will call this problem "term misrecognition", by analogy to the term mismatch problem.This paper presents two classes of retrieval models that attempt to tackle both the term mismatch and the term misrecognition problems at retrieval time using term similarity information. The models make effective use of complete or partial knowledge of semantic and phonetic term similarity evaluated using statistical methods for the corpus.