Exemplary documents: a foundation for information retrieval design

Authors:
David C. Blair;Steven O. Kimbrough
Affiliations:
Computer and Information Systems, Graduate School of Business, University of Michigan, Ann Arbor, MI;The Wharton School, University of Pennsylvania, 1322 Steinberg Hall-Dietrich Hall, Philadelphia, PA
Venue:
Information Processing and Management: an International Journal
Year:
2002

Citing 6
Cited 5

An evaluation of retrieval effectiveness for a full-text document-retrieval system

Communications of the ACM
Another look at automatic text-retrieval systems

Communications of the ACM
Indeterminacy in the subject access to documents

Information Processing and Management: an International Journal
Language and representation in information retrieval

Language and representation in information retrieval
Full-text information retrieval: further analysis and clarification

Information Processing and Management: an International Journal
Information retrieval and the philosophy of language

The Computer Journal - Special issue on information retrieval

Topic modeling for mediated access to very large document collections

Journal of the American Society for Information Science and Technology
Eliciting better information need descriptions from users of information search systems

Information Processing and Management: an International Journal
Just email it to me!: why things get lost in shared file repositories

GROUP '07 Doctoral Consortium papers
Just email it to me! why things get lost in shared file repositories

INTERACT'07 Proceedings of the 11th IFIP TC 13 international conference on Human-computer interaction - Volume Part II
A bibliometric-based semi-automatic approach to identification of candidate thesaurus terms: parsing and filtering of noun phrases from citation contexts

CoLIS'05 Proceedings of the 5th international conference on Context: conceptions of Library and Information Sciences

Quantified Score

Hi-index	0.00

Visualization

Abstract

Documents are generally represented for retrieval by either extracting index terms from them or by creating and selecting from an external set of candidate terms. There are many procedures for doing this, but while work continues along these dimensions, there have been relatively few attempts to change this basic process. Of particular importance is the creation of indexing schemes for retrieval systems in nonlibrary contexts. Here, the cost of developing an indexing scheme independent of the documents to be retrieved is often considered too high to implement. As a result, simple full-text retrieval or, to a lesser extent, automatic extractive or associative indexing methods are the predominant methods used in nonlibrary contexts. This paper suggests an alternative document representation method based on what we call exemplary documents. Exemplary documents are those documents that describe or exhibit the intellectual structure of a particular field of interest. In so doing, they provide both an indexing vocabulary for that area and, more importantly, a narrative context in which the indexing terms have a clearer meaning. Further, it is much easier to develop an indexing scheme by using exemplary documents than it is to do so from scratch.