Some simple effective approximations to the 2-Poisson model for probabilistic weighted retrieval
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Effective retrieval of structured documents
SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
A flexible model for retrieval of SGML documents
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Structured information retrieval in XML documents
Proceedings of the 2002 ACM symposium on Applied computing
Combining document representations for known-item search
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Searching XML documents via XML fragments
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A machine learning model for information retrieval with structured documents
MLDM'03 Proceedings of the 3rd international conference on Machine learning and data mining in pattern recognition
Automatic extraction of titles from general documents using machine learning
Proceedings of the 5th ACM/IEEE-CS joint conference on Digital libraries
Efficient and self-tuning incremental query expansion for top-k query processing
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Title extraction from bodies of HTML documents and its application to web page retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Controlling overlap in content-oriented XML retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Relevance weighting for query independent evidence
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Gravitation-based model for information retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Indexing and ranking in Geo-IR systems
Proceedings of the 2005 workshop on Geographic information retrieval
Recommended reading for IR research students
ACM SIGIR Forum
Improving web search ranking by incorporating user behavior information
SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic extraction of titles from general documents using machine learning
Information Processing and Management: an International Journal
Voting for candidates: adapting data fusion techniques for an expert search task
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Optimisation methods for ranking functions with multiple parameters
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
Web page title extraction and its application
Information Processing and Management: an International Journal
Combining fields for query expansion and adaptive query expansion
Information Processing and Management: an International Journal
Proceedings of the 16th international conference on World Wide Web
Spark: top-k keyword query in relational databases
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
Enhancing relevance scoring with chronological term rank
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Improving retrieval accuracy by weighting document types with clickthrough data
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Scalable music recommendation by search
Proceedings of the 15th international conference on Multimedia
Computing block importance for searching on web sites
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Automatic feature selection in the markov random field model for information retrieval
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
SoftRank: optimizing non-smooth rank metrics
WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Mining the search trails of surfing crowds: identifying relevant websites from user activity
Proceedings of the 17th international conference on World Wide Web
Lexical cohesion and term proximity in document ranking
Information Processing and Management: an International Journal
Learning to rank with SoftRank and Gaussian processes
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Classifiers without borders: incorporating fielded text from neighboring web pages
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Emulating query-biased summaries using document titles
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Product retrieval for grocery stores
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Inter and intra-document contexts applied in polyrepresentation for best match IR
Information Processing and Management: an International Journal
Voting techniques for expert search
Knowledge and Information Systems
Tapping on the potential of q&a community by recommending answer providers
Proceedings of the 17th ACM conference on Information and knowledge management
A two-stage text mining model for information filtering
Proceedings of the 17th ACM conference on Information and knowledge management
Key blog distillation: ranking aggregates
Proceedings of the 17th ACM conference on Information and knowledge management
Natural language retrieval of grocery products
Proceedings of the 17th ACM conference on Information and knowledge management
Routing of structured queries in large-scale distributed systems
Proceedings of the 2008 ACM workshop on Large-Scale distributed systems for information retrieval
Distributed, large-scale latent semantic analysis by index interpolation
Proceedings of the 3rd international conference on Scalable information systems
Geographic features in web search retrieval
Proceedings of the 2nd international workshop on Geographic information retrieval
FuzzyFresh: A Fuzzy Logic Approach to the Ranking of Structured Documents
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Integrating Structure in the Probabilistic Model for Information Retrieval
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Consistent phrase relevance measures
Proceedings of the 2nd International Workshop on Data Mining and Audience Intelligence for Advertising
Refining component description by leveraging user query logs
Journal of Systems and Software
Understanding user's query intent with wikipedia
Proceedings of the 18th international conference on World wide web
Online expansion of rare queries for sponsored search
Proceedings of the 18th international conference on World wide web
An Operable Email Based Intelligent Personal Assistant
World Wide Web
Nullification test collections for web spam and SEO
Proceedings of the 5th International Workshop on Adversarial Information Retrieval on the Web
Effectively Searching Maps in Web Documents
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
A Probabilistic Retrieval Model for Semistructured Data
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Investigating Learning Approaches for Blog Post Opinion Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Selective Application of Query-Independent Features in Web Information Retrieval
ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Query dependent pseudo-relevance feedback based on wikipedia
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Building enriched document representations using aggregated anchor text
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Using anchor texts with their hyperlink structure for web search
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Grocery Product Recommendations from Natural Language Inputs
UMAP '09 Proceedings of the 17th International Conference on User Modeling, Adaptation, and Personalization: formerly UM and AH
Analyzing Document Retrievability in Patent Retrieval Settings
DEXA '09 Proceedings of the 20th International Conference on Database and Expert Systems Applications
Mining Negative Relevance Feedback for Information Filtering
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Retrieval experiments using pseudo-desktop collections
Proceedings of the 18th ACM conference on Information and knowledge management
A machine learning approach for improved BM25 retrieval
Proceedings of the 18th ACM conference on Information and knowledge management
The Probabilistic Relevance Framework: BM25 and Beyond
Foundations and Trends in Information Retrieval
Modelling field dependencies on structured documents with fuzzy logic
FUZZ-IEEE'09 Proceedings of the 18th international conference on Fuzzy Systems
Multinomial randomness models for retrieval with document fields
ECIR'07 Proceedings of the 29th European conference on IR research
Setting per-field normalisation hyper-parameters for the named-page finding search task
ECIR'07 Proceedings of the 29th European conference on IR research
A Bayesian approach for learning document type relevance
ECIR'07 Proceedings of the 29th European conference on IR research
The anatomy of an ad: structured indexing and retrieval for sponsored search
Proceedings of the 19th international conference on World wide web
INEX+DBPEDIA: a corpus for semantic search evaluation
Proceedings of the 19th international conference on World wide web
Document clustering of scientific texts using citation contexts
Information Retrieval
Extending weighting models with a term quality measure
SPIRE'07 Proceedings of the 14th international conference on String processing and information retrieval
Augmenting human memory using personal lifelogs
Proceedings of the 1st Augmented Human International Conference
Book search experiments: investigating IR methods for the indexing and retrieval of books
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Probabilistic document length priors for language models
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
Applying maximum entropy to known-item email retrieval
ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Crosslanguage retrieval based on Wikipedia statistics
CLEF'08 Proceedings of the 9th Cross-language evaluation forum conference on Evaluating systems for multilingual and multimodal information access
Ranking using multiple document types in desktop search
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
How good is a span of terms?: exploiting proximity to improve web retrieval
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Multi-style language model for web scale information retrieval
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
A framework for BM25F-based XML retrieval
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Mining positive and negative patterns for relevance feature discovery
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining Historic Query Trails to Label Long and Rare Search Engine Queries
ACM Transactions on the Web (TWEB)
LETOR: A benchmark collection for research on learning to rank for information retrieval
Information Retrieval
Understanding the semantic structure of noun phrase queries
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
A study of information retrieval weighting schemes for sentiment analysis
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Using BM25F for semantic search
Proceedings of the 3rd International Semantic Search Workshop
Structured data retrieval using cover density ranking
Proceedings of the 2nd International Workshop on Keyword Search on Structured Data
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
Achieving high precisions with peer-to-peer is possible!
INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
University of waterloo at INEX 2009: ad hoc, book, entity ranking, and link-the-wiki tracks
INEX'09 Proceedings of the Focused retrieval and evaluation, and 8th international conference on Initiative for the evaluation of XML retrieval
Biomedical information retrieval: the BioTracer approach
ITBAM'10 Proceedings of the First international conference on Information technology in bio- and medical informatics
CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
UNIBA-SENSE @ CLEF 2009: robust WSD task
CLEF'09 Proceedings of the 10th cross-language evaluation forum conference on Multilingual information access evaluation: text retrieval experiments
Scalable clustering of news search results
Proceedings of the fourth ACM international conference on Web search and data mining
LambdaMerge: merging the results of query reformulations
Proceedings of the fourth ACM international conference on Web search and data mining
Towards a collection-based results diversification
RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
VisualWikiCurator: human and machine intelligencefor organizing wiki content
Proceedings of the 16th international conference on Intelligent user interfaces
Mail2Wiki: posting and curating Wiki content from email
Proceedings of the 16th international conference on Intelligent user interfaces
Topic Distillation with Query-Dependent Link Connections and Page Characteristics
ACM Transactions on the Web (TWEB)
AskHERMES: An online question answering system for complex clinical questions
Journal of Biomedical Informatics
Enhancing web search with entity intent
Proceedings of the 20th international conference companion on World wide web
Identifying primary content from web pages and its application to web search ranking
Proceedings of the 20th international conference companion on World wide web
Modeling term proximity for probabilistic information retrieval models
Information Sciences: an International Journal
VisualWikiCurator: a corporate Wiki plugin
CHI '11 Extended Abstracts on Human Factors in Computing Systems
Improving retrievability and recall by automatic corpus partitioning
Transactions on large-scale data- and knowledge-centered systems II
Improving retrievability and recall by automatic corpus partitioning
Transactions on large-scale data- and knowledge-centered systems II
Bridging link and query intent to enhance web search
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Fractional similarity: cross-lingual feature selection for search
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
A pattern mining approach for information filtering systems
Information Retrieval
Incorporating web browsing activities into anchor texts for web search
Information Retrieval
A source independent framework for research paper recommendation
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Learning to rank for freshness and relevance
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Predicting web searcher satisfaction with existing community-based answers
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Indexing strategies for graceful degradation of search quality
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Negation for document re-ranking in ad-hoc retrieval
ICTIR'11 Proceedings of the Third international conference on Advances in information retrieval theory
Relaxed global term weights for XML element search
INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
XML retrieval more efficient using double scoring scheme
INEX'10 Proceedings of the 9th international conference on Initiative for the evaluation of XML retrieval: comparative evaluation of focused retrieval
Supporting biomedical information retrieval: the bioTracer approach
Transactions on large-scale data- and knowledge-centered systems IV
Efficiency optimizations for interpolating subqueries
Proceedings of the 20th ACM international conference on Information and knowledge management
Learning to rank results in relational keyword search
Proceedings of the 20th ACM international conference on Information and knowledge management
Adaptive term frequency normalization for BM25
Proceedings of the 20th ACM international conference on Information and knowledge management
Indexing and weighting of multilingual and mixed documents
Proceedings of the South African Institute of Computer Scientists and Information Technologists Conference on Knowledge, Innovation and Leadership in a Diverse, Multidisciplinary Environment
Relevance feedback between hypertext and Semantic Web search: Frameworks and evaluation
Web Semantics: Science, Services and Agents on the World Wide Web
Field-weighted XML retrieval based on BM25
INEX'05 Proceedings of the 4th international conference on Initiative for the Evaluation of XML Retrieval
Mail2Wiki: low-cost sharing and early curation from email to wikis
Proceedings of the 5th International Conference on Communities and Technologies
A two-stage decision model for information filtering
Decision Support Systems
Effective query formulation with multiple information sources
Proceedings of the fifth ACM international conference on Web search and data mining
Evaluating search in personal social media collections
Proceedings of the fifth ACM international conference on Web search and data mining
Mining anchor text trends for retrieval
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Evaluating the potential of explicit phrases for retrieval quality
ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
ESWC'10 Proceedings of the 7th international conference on The Semantic Web: research and Applications - Volume Part II
Knowledge modeling in prior art search
IRFC'10 Proceedings of the First international Information Retrieval Facility conference on Adbances in Multidisciplinary Retrieval
XML-Structured documents: retrievable units and inheritance
FQAS'06 Proceedings of the 7th international conference on Flexible Query Answering Systems
Towards more accurate retrieval of duplicate bug reports
ASE '11 Proceedings of the 2011 26th IEEE/ACM International Conference on Automated Software Engineering
Survey on web spam detection: principles and algorithms
ACM SIGKDD Explorations Newsletter
Using anchor text for homepage and topic distillation search tasks
Journal of the American Society for Information Science and Technology
A unified context model for web image retrieval
ACM Transactions on Multimedia Computing, Communications, and Applications (TOMCCAP)
Text classifiers for automatic articles categorization
ICAISC'12 Proceedings of the 11th international conference on Artificial Intelligence and Soft Computing - Volume Part II
A schema-driven approach for knowledge-oriented retrieval and query formulation
KEYS '12 Proceedings of the Third International Workshop on Keyword Search on Structured Data
A field relevance model for structured document retrieval
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
On the modeling of entities for ad-hoc entity search in the web of data
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
A log-logistic model-based interpretation of TF normalization of BM25
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
When simple is (more than) good enough: effective semantic search with (almost) no semantics
ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Effective query generation and postprocessing strategies for prior art patent search
Journal of the American Society for Information Science and Technology
How to search in MPEG-7 based semantic descriptions: an evaluation of metrics
Multimedia Tools and Applications
Spoken Content Retrieval: A Survey of Techniques and Technologies
Foundations and Trends in Information Retrieval
Extending BM25 with multiple query operators
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Relevance as a subjective and situational multidimensional concept
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
ChatNoir: a search engine for the ClueWeb09 corpus
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Rewarding term location information to enhance probabilistic information retrieval
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Duplicate bug report detection with a combination of information retrieval and topic modeling
Proceedings of the 27th IEEE/ACM International Conference on Automated Software Engineering
The University of Lisbon at CLEF 2006 ad-hoc task
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Dublin City University at CLEF 2006: cross-language speech retrieval (CL-SR) experiments
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
The University of Lisbon at GeoCLEF 2006
CLEF'06 Proceedings of the 7th international conference on Cross-Language Evaluation Forum: evaluation of multilingual and multi-modal information retrieval
Learning to rank duplicate bug reports
Proceedings of the 21st ACM international conference on Information and knowledge management
Effective retrieval model for entity with multi-valued attributes: BM25MF and beyond
EKAW'12 Proceedings of the 18th international conference on Knowledge Engineering and Knowledge Management
Modeling geographic, temporal, and proximity contexts for improving geotemporal search
Journal of the American Society for Information Science and Technology
High performance query expansion using adaptive co-training
Information Processing and Management: an International Journal
Exploiting user comments for audio-visual content indexing and retrieval
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Aggregating evidence from hospital departments to improve medical records search
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Improving ESA with document similarity
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Multimedia information seeking through search and hyperlinking
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
A supervised machine learning classification algorithm for research articles
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Copulas for information retrieval
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Reading contexts for structured documents retrieval
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
Relevance in microblogs: enhancing tweet retrieval using hyperlinked documents
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
About learning models with multiple query-dependent features
ACM Transactions on Information Systems (TOIS)
Behavioral dynamics on the web: Learning, modeling, and prediction
ACM Transactions on Information Systems (TOIS)
Exploiting Forum Thread Structures to Improve Thread Clustering
Proceedings of the 2013 Conference on the Theory of Information Retrieval
The Impacts of Structural Difference and Temporality of Tweets on Retrieval Effectiveness
ACM Transactions on Information Systems (TOIS)
A study of supervised term weighting scheme for sentiment analysis
Expert Systems with Applications: An International Journal
Text mining in negative relevance feedback
Web Intelligence and Agent Systems
Web Intelligence and Agent Systems
Hi-index | 0.00 |
This paper describes a simple way of adapting the BM25 ranking formula to deal with structured documents. In the past it has been common to compute scores for the individual fields (e.g. title and body) independently and then combine these scores (typically linearly) to arrive at a final score for the document. We highlight how this approach can lead to poor performance by breaking the carefully constructed non-linear saturation of term frequency in the BM25 function. We propose a much more intuitive alternative which weights term frequencies before the non-linear term frequency saturation function is applied. In this scheme, a structured document with a title weight of two is mapped to an unstructured document with the title content repeated twice. This more verbose unstructured document is then ranked in the usual way. We demonstrate the advantages of this method with experiments on Reuters Vol1 and the TREC dotGov collection.