Word sense language model for information retrieval

Authors:
Liqi Gao;Yu Zhang;Ting Liu;Guiping Liu
Affiliations:
Information Retrieval Laboratory, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, P.R. China;Information Retrieval Laboratory, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, P.R. China;Information Retrieval Laboratory, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, P.R. China;Information Retrieval Laboratory, School of Computer Science and Technology, Harbin Institute of Technology, Harbin, P.R. China
Venue:
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Year:
2006

Citing 10
Cited 2

Lexical ambiguity and information retrieval

ACM Transactions on Information Systems (TOIS)
Using WordNet to disambiguate word senses for text retrieval

SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Word sense disambiguation and information retrieval

SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
A language modeling approach to information retrieval

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A hidden Markov model information retrieval system

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval as statistical translation

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
A general language model for information retrieval

Proceedings of the eighth international conference on Information and knowledge management
Two-stage language models for information retrieval

SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Word sense disambiguation in information retrieval revisited

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Dependence language model for information retrieval

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval

A conceptual representation of documents and queries for information retrieval systems by using light ontologies

Expert Systems with Applications: An International Journal
Fine-grained topic detection in news search results

Proceedings of the 27th Annual ACM Symposium on Applied Computing

Quantified Score

Hi-index	0.01

Visualization

Abstract

This paper proposes a word sense language model based method for information retrieval. This method, differing from most of traditional ones, combines word senses defined in a thesaurus with a classic statistical model. The word sense language model regards the word sense as a form of linguistic knowledge, which is helpful in handling mismatch caused by synonym and data sparseness due to data limit. Experimental results based on TREC-Mandarin corpus show that this method gains 12.5% improvement on MAP over traditional tf-idf retrieval method but 5.82% decrease on MAP compared to a classic language model. A combination result of this method and the language model yields 8.92% and 7.93% increases over either respectively. We present analysis and discussions on the not-so-exciting results and conclude that a higher performance of word sense language model will owe to high accurate of word sense labeling. We believe that linguistic knowledge such as word sense of a thesaurus will help IR improve ultimately in many ways.