Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Pivoted document length normalization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
A vector space model for automatic indexing
Communications of the ACM
A probabilistic model of information retrieval: development and comparative experiments
Information Processing and Management: an International Journal
A study of smoothing methods for language models applied to Ad Hoc information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Document normalization revisited
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Probabilistic models of information retrieval based on measuring the divergence from randomness
ACM Transactions on Information Systems (TOIS)
Term Frequency Normalization via Pareto Distributions
Proceedings of the 24th BCS-IRSG European Colloquium on IR Research: Advances in Information Retrieval
Usefulness of hyperlink structure for query-biased topic distillation
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A decision mechanism for the selective combination of evidence in topic distillation
Information Retrieval
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
A syntactically-based query reformulation technique for information retrieval
Information Processing and Management: an International Journal
Parameter sensitivity in the probabilistic model for ad-hoc retrieval
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Evolved term-weighting schemes in Information Retrieval: an analysis of the solution space
Artificial Intelligence Review
Artificial Intelligence Review
Learning to Rank for Information Retrieval
Foundations and Trends in Information Retrieval
A study of information retrieval on accumulative social descriptions using the generation features
Proceedings of the 18th ACM conference on Information and knowledge management
Probabilistic static pruning of inverted files
ACM Transactions on Information Systems (TOIS)
Probabilistic document length priors for language models
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
The static absorbing model for the web
Journal of Web Engineering
CSUSM experiments in GeoCLEF2005: monolingual and bilingual tasks
CLEF'05 Proceedings of the 6th international conference on Cross-Language Evalution Forum: accessing Multilingual Information Repositories
Term frequency normalisation tuning for BM25 and DFR models
ECIR'05 Proceedings of the 27th European conference on Advances in Information Retrieval Research
The university of glasgow at CLEF 2004: French monolingual information retrieval with terrier
CLEF'04 Proceedings of the 5th conference on Cross-Language Evaluation Forum: multilingual Information Access for Text, Speech and Images
Class normalization in centroid-based text categorization
Information Sciences: an International Journal
Frequentist and bayesian approach to information retrieval
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Document length normalization using effective level of term frequency in large collections
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Monolingual and bilingual experiments in GeoCLEF2006
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
A constraint to automatically regulate document-length normalisation
Proceedings of the 21st ACM international conference on Information and knowledge management
About learning models with multiple query-dependent features
ACM Transactions on Information Systems (TOIS)
Hi-index | 0.00 |
Most current term frequency normalization approaches for information retrieval involve the use of parameters. The tuning of these parameters has an important impact on the overall performance of the information retrieval system. Indeed, a small variation in the involved parameter(s) could lead to an important variation in the precision/recall values. Most current tuning approaches are dependent on the document collections. As a consequence, the effective parameter value cannot be obtained for a given new collection without extensive training data. In this paper, we propose a novel and robust method for the tuning of term frequency normalization parameter(s), by measuring the normalization effect on the within document frequency of the query terms. As an illustration, we apply our method on Amati \& Van Rijsbergen's so-called normalization 2. The experiments for the ad-hoc TREC-6,7,8 tasks and TREC-8,9,10 Web tracks show that the new method is independent of the collections and able to provide reliable and good performance.