Domain-Specific Information Retrieval Based on Improved Language Model

  • Authors:
  • Kai Kang;Kunhui Lin;Changle Zhou;Feng Guo

  • Affiliations:
  • Xiamen Univ. , Xiamen 361005, Fujian, China;Xiamen Univ. , Xiamen 361005, Fujian, China;Xiamen Univ. , Xiamen 361005, Fujian, China;Xiamen Univ. , Xiamen 361005, Fujian, China

  • Venue:
  • FSKD '07 Proceedings of the Fourth International Conference on Fuzzy Systems and Knowledge Discovery - Volume 02
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

There are two key ingredients in the general framework of language models used in information retrieval, one is importance weighting, the other is word relationship computing. A series of improvements are made to these ingredients of the general framework of language models which is used in domain-specific information retrieval. First, an EM algorithm is proposed to estimate the importance weights of query terms, and the Bayesian smoothing is used to compute the productive probabilities of important terms. Next, a new algorithm based on Dynamic Bayesian Network is proposed for obtaining the explanation probabilities between terms. Experiment shows that the improved model performs remarkably better for domain-specific information retrieval than some traditional retrieval techniques, and the extended framework has good expansibility.