Inter and intra-document contexts applied in polyrepresentation for best match IR

Authors:
Mette Skov;Birger Larsen;Peter Ingwersen
Affiliations:
Information Interaction and Information Architecture, Royal School of Library and Information Science, Birketinget 6, DK-2300 Copenhagen S, Denmark;Information Interaction and Information Architecture, Royal School of Library and Information Science, Birketinget 6, DK-2300 Copenhagen S, Denmark;Information Interaction and Information Architecture, Royal School of Library and Information Science, Birketinget 6, DK-2300 Copenhagen S, Denmark
Venue:
Information Processing and Management: an International Journal
Year:
2008

Citing 19
Cited 11

Optimizing convenient online access to bibliographic databases

Information Services and Use
Descriptor and citation retrieval in the medical behavioral sciences literature:retrieval overlaps and novelty distribution

Journal of the American Society for Information Science
Information retrieval interaction

Information retrieval interaction
Relevance odds of retrieval overlaps from seven search fields

Information Processing and Management: an International Journal
Combining the evidence of multiple query representations for information retrieval

TREC-2 Proceedings of the second conference on Text retrieval conference
The impact of query structure and query expansion on retrieval performance

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The anatomy of a large-scale hypertextual Web search engine

WWW7 Proceedings of the seventh international conference on World Wide Web 7
Predicting the effectiveness of Naïve data fusion on the basis of system characteristics

Journal of the American Society for Information Science
Cumulated gain-based evaluation of IR techniques

ACM Transactions on Information Systems (TOIS)
The Co-Effects of Query Structure and Expansion on RetrievalPerformance in Probabilistic Text Retrieval

Information Retrieval
Simple BM25 extension to multiple weighted fields

Proceedings of the thirteenth ACM international conference on Information and knowledge management
The loquacious user: a document-independent source of terms for query expansion

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Evaluating implicit feedback models using searcher simulations

ACM Transactions on Information Systems (TOIS)
Using searcher simulations to redesign a polyrepresentative implicit feedback interface

Information Processing and Management: an International Journal
The polyrepresentation continuum in IR

IIiX Proceedings of the 1st international conference on Information interaction in context
Inter and intra-document contexts applied in polyrepresentation

IIiX Proceedings of the 1st international conference on Information interaction in context
Improving high accuracy retrieval by eliminating the uneven correlation effect in data fusion

Journal of the American Society for Information Science and Technology
Evaluating XML retrieval effectiveness at INEX

ACM SIGIR Forum
The Turn: Integration of Information Seeking and Retrieval in Context

The Turn: Integration of Information Seeking and Retrieval in Context

Using Multiple Query Aspects to Build Test Collections without Human Relevance Judgments

ECIR '09 Proceedings of the 31th European Conference on IR Research on Advances in Information Retrieval
Generative model-based metasearch for data fusion in information retrieval

Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Supporting polyrepresentation in a quantum-inspired geometrical retrieval framework

Proceedings of the third symposium on Information interaction in context
A subjective logic formalisation of the principle of polyrepresentation for information needs

Proceedings of the third symposium on Information interaction in context
Reconsideration of the simulated work task situation: a context instrument for evaluation of information retrieval interaction

Proceedings of the third symposium on Information interaction in context
On the potential search effectiveness of MeSH (medical subject headings) terms

Proceedings of the third symposium on Information interaction in context
A model for generating related weighted Boolean queries

IEA/AIE'10 Proceedings of the 23rd international conference on Industrial engineering and other applications of applied intelligent systems - Volume Part III
Using anchor text for homepage and topic distillation search tasks

Journal of the American Society for Information Science and Technology
Granules of words to represent text: an approach based on fuzzy relations and spectral clustering

ICCSA'12 Proceedings of the 12th international conference on Computational Science and Its Applications - Volume Part IV
Citations and references as keys to relevance ranking in interactive IR

Proceedings of the 4th Information Interaction in Context Symposium
Extending term suggestion with author names

TPDL'12 Proceedings of the Second international conference on Theory and Practice of Digital Libraries

Quantified Score

Hi-index	0.01

Visualization

Abstract

The principle of polyrepresentation offers a theoretical framework for handling multiple contexts in information retrieval (IR). This paper presents an empirical laboratory study of polyrepresentation in restricted mode of the information space with focus on inter and intra-document features. The Cystic Fibrosis test collection indexed in the best match system InQuery constitutes the experimental setting. Overlaps between five functionally and/or cognitively different document representations are identified. Supporting the principle of polyrepresentation, results show that in general overlaps generated by three or four representations of different nature have higher precision than those generated from two representations or the single fields. This result pertains to both structured and unstructured query mode in best match retrieval, however, with the latter query mode demonstrating higher performance. The retrieval overlaps containing search keys from the bibliographic references provide the best retrieval performance and minor MeSH terms the worst. It is concluded that a highly structured query language is necessary when implementing the principle of polyrepresentation in a best match IR system because the principle is inherently Boolean. Finally a re-ranking test shows promising results when search results are re-ranked according to precision obtained in the overlaps whilst re-ranking by citations seems less useful when integrated into polyrepresentative applications.