Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Automatic text processing: the transformation, analysis, and retrieval of information by computer
Evaluation of an inference network-based retrieval model
ACM Transactions on Information Systems (TOIS) - Special issue on research and development in information retrieval
Probabilistic models in information retrieval
The Computer Journal - Special issue on information retrieval
Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
On modeling information retrieval with probabilistic inference
ACM Transactions on Information Systems (TOIS)
On relevance weights with little relevance information
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Exploring the similarity space
ACM SIGIR Forum
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Probabilistic models of information retrieval based on measuring the divergence from randomness
ACM Transactions on Information Systems (TOIS)
An exploration of axiomatic approaches to information retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Gravitation-based model for information retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
A practical system of keyphrase extraction for web pages
Proceedings of the 14th ACM international conference on Information and knowledge management
ACM Transactions on Asian Language Information Processing (TALIP)
CWS: a comparative web search system
Proceedings of the 15th international conference on World Wide Web
InfoScale '06 Proceedings of the 1st international conference on Scalable information systems
Semantic term matching in axiomatic approaches to information retrieval
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
eTuner: tuning schema matching software using synthetic scenarios
The VLDB Journal — The International Journal on Very Large Data Bases
Estimation, sensitivity, and generalization in parameterized retrieval models
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Text mining techniques for patent analysis
Information Processing and Management: an International Journal
Proceedings of the 16th international conference on World Wide Web
An exploration of proximity measures in information retrieval
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
A study of Poisson query generation model for information retrieval
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
IDF revisited: a simple new derivation within the Robertson-Spärck Jones probabilistic model
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
An empirical study of tokenization strategies for biomedical information retrieval
Information Retrieval
Semantic annotation of frequent patterns
ACM Transactions on Knowledge Discovery from Data (TKDD)
Natural language processing for information retrieval: the time is ripe (again)
Proceedings of the ACM first Ph.D. workshop in CIKM
Data allocation scheme based on term weight for P2P information retrieval
Proceedings of the 9th annual ACM international workshop on Web information and data management
Parameter sensitivity in the probabilistic model for ad-hoc retrieval
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Speeding Coordination by Combining Analytical and Inductive Learning
WI-IATW '07 Proceedings of the 2007 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Workshops
Automatic online news issue construction in web environment
Proceedings of the 17th international conference on World Wide Web
An outranking approach for information retrieval
Information Retrieval
A new probabilistic retrieval model based on the dirichlet compound multinomial distribution
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Inferring semantic query relations from collective user behavior
Proceedings of the 17th ACM conference on Information and knowledge management
Active relevance feedback for difficult queries
Proceedings of the 17th ACM conference on Information and knowledge management
A generative retrieval model for structured documents
Proceedings of the 17th ACM conference on Information and knowledge management
Information retrieval from digital libraries in SQL
Proceedings of the 10th ACM workshop on Web information and data management
Artificial Intelligence Review
Statistical Language Models for Information Retrieval A Critical Review
Foundations and Trends in Information Retrieval
Measuring constraint violations in information retrieval
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Bridging Language Modeling and Divergence from Randomness Models: A Log-Logistic Model for IR
ICTIR '09 Proceedings of the 2nd International Conference on Theory of Information Retrieval: Advances in Information Retrieval Theory
Effective and efficient structured retrieval
Proceedings of the 18th ACM conference on Information and knowledge management
A comparative study of methods for estimating query language models with pseudo feedback
Proceedings of the 18th ACM conference on Information and knowledge management
Retrieval constraints and word frequency distributions: a log-logistic model for IR
Proceedings of the 18th ACM conference on Information and knowledge management
Discovering the discriminative views: measuring term weights for sentiment analysis
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 1 - Volume 1
Language models for web object retrieval
WiCOM'09 Proceedings of the 5th International Conference on Wireless communications, networking and mobile computing
Improving probabilistic information retrieval by modeling burstiness of words
Information Processing and Management: an International Journal
Proceedings of the 19th international conference on World wide web
Utilizing passage-based language models for ad hoc document retrieval
Information Retrieval
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Information-based models for ad hoc IR
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Multi-style language model for web scale information retrieval
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
On the query reformulation technique for effective MEDLINE document retrieval
Journal of Biomedical Informatics
Examining the information retrieval process from an inductive perspective
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Using chi-square statistics to measure similarities for text categorization
Expert Systems with Applications: An International Journal
A method for weighting multi-valued features in content-based filtering
IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part III
Retrieval constraints and word frequency distributions a log-logistic model for IR
Information Retrieval
Diagnostic Evaluation of Information Retrieval Models
ACM Transactions on Information Systems (TOIS)
Learning to model relatedness for news recommendation
Proceedings of the 20th international conference on World wide web
Improving access to large patent corpora
Transactions on large-scale data- and knowledge-centered systems II
Improving access to large patent corpora
Transactions on large-scale data- and knowledge-centered systems II
When documents are very long, BM25 fails!
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Do IR models satisfy the TDC retrieval constraint
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Efficient keyword extraction for meaningful document perception
Proceedings of the 11th ACM symposium on Document engineering
Upper-bound approximations for dynamic pruning
ACM Transactions on Information Systems (TOIS)
Is document frequency important for PRF?
ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
Lower-bounding term frequency normalization
Proceedings of the 20th ACM international conference on Information and knowledge management
Adaptive term frequency normalization for BM25
Proceedings of the 20th ACM international conference on Information and knowledge management
Query aspect based term weighting regularization in information retrieval
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
A new measure for query disambiguation using term co-occurrences
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
ACM Transactions on Information Systems (TOIS)
Predicting Query Performance by Query-Drift Estimation
ACM Transactions on Information Systems (TOIS)
Document length normalization using effective level of term frequency in large collections
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Joint relevance and freshness learning from clickthroughs for news search
Proceedings of the 21st international conference on World Wide Web
An early warning system for unrecognized drug side effects discovery
Proceedings of the 21st international conference companion on World Wide Web
AIRS'11 Proceedings of the 7th Asia conference on Information Retrieval Technology
Information Retrieval
Automatic Home Medical Product Recommendation
Journal of Medical Systems
Relation based term weighting regularization
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
A log-logistic model-based interpretation of TF normalization of BM25
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Axiomatic analysis of translation language model for information retrieval
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
An information-based cross-language information retrieval model
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Combining relevancy and methodological quality into a single ranking for evidence-based medicine
Information Sciences: an International Journal
An exploration of ranking heuristics in mobile local search
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
A constraint to automatically regulate document-length normalisation
Proceedings of the 21st ACM international conference on Information and knowledge management
Proceedings of the 21st ACM international conference on Information and knowledge management
A framework for the theoretical evaluation of XML retrieval
Journal of the American Society for Information Science and Technology
Estimation of the collection parameter of information models for IR
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Concept based query recommendation
AusDM '11 Proceedings of the Ninth Australasian Data Mining Conference - Volume 121
Composition of TF normalizations: new insights on scoring functions for ad hoc IR
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
About learning models with multiple query-dependent features
ACM Transactions on Information Systems (TOIS)
A Theoretical Analysis of Pseudo-Relevance Feedback Models
Proceedings of the 2013 Conference on the Theory of Information Retrieval
Axiometrics: An Axiomatic Approach to Information Retrieval Effectiveness Metrics
Proceedings of the 2013 Conference on the Theory of Information Retrieval
Revisiting Exhaustivity and Specificity Using Propositional Logic and Lattice Theory
Proceedings of the 2013 Conference on the Theory of Information Retrieval
Exploiting Forum Thread Structures to Improve Thread Clustering
Proceedings of the 2013 Conference on the Theory of Information Retrieval
Graph-of-word and TW-IDF: new approach to ad hoc IR
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
Empirical studies of information retrieval methods show that good retrieval performance is closely related to the use of various retrieval heuristics, such as TF-IDF weighting. One basic research question is thus what exactly are these "necessary" heuristics that seem to cause good retrieval performance. In this paper, we present a formal study of retrieval heuristics. We formally define a set of basic desirable constraints that any reasonable retrieval function should satisfy, and check these constraints on a variety of representative retrieval functions. We find that none of these retrieval functions satisfies all the constraints unconditionally. Empirical results show that when a constraint is not satisfied, it often indicates non-optimality of the method, and when a constraint is satisfied only for a certain range of parameter values, its performance tends to be poor when the parameter is out of the range. In general, we find that the empirical performance of a retrieval formula is tightly related to how well it satisfies these constraints. Thus the proposed constraints provide a good explanation of many empirical observations and make it possible to evaluate any existing or new retrieval formula analytically.