An evaluation of retrieval effectiveness for a full-text document-retrieval system
Communications of the ACM
Indeterminacy in the subject access to documents
Information Processing and Management: an International Journal
The vocabulary problem in human-system communication
Communications of the ACM
An extended relational document retrieval model
Information Processing and Management: an International Journal - The Potential for Improvments in Commerical Document Retrieval Systems
Language and representation in information retrieval
Language and representation in information retrieval
Full-text information retrieval: further analysis and clarification
Information Processing and Management: an International Journal
Foundations of Probabilistic and Utility-Theoretic Indexing
Journal of the ACM (JACM)
The data-document distinction in information retrieval
Communications of the ACM
Information Retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Information Processing and Management: an International Journal
Some thoughts on the reported results of TREC
Information Processing and Management: an International Journal
Information Processing and Management: an International Journal
Some thoughts on the reported results of TREC
Information Processing and Management: an International Journal
Bibliographic database access using free-text and controlled vocabulary: an evaluation
Information Processing and Management: an International Journal
The data-document distinction revisited
ACM SIGMIS Database
Evaluating epistemic uncertainty under incomplete assessments
Information Processing and Management: an International Journal
Patterns and measures of digitalisation in business unit communication
International Journal of Business Information Systems
Query side evaluation: an empirical analysis of effectiveness and effort
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
With the growing focus on what is collectively known as "knowledge management", a shift continues to take place in commercial information system development: a shift away from the well-understood data retrieval/database model, to the more complex and challenging development of commercial document/ information retrieval models. While document retrieval has had a long and rich legacy of research, its impact on commercial applications has been modest. At the enterprise level most large organizations have little understanding of, or commitment to, high quality document access and management. Part of the reason for this is that we still do not have a good framework for understanding the major factors which affect the performance of large-scale corporate document retrieval systems. The thesis of this discussion is that document retrieval - specifically, access to intellectual content - is a complex process which is most strongly influenced by three factors: the size of the document collection; the type of search (exhaustive, existence or sample); and, the determinacy of document representation. Collectively, these factors can be used to provide a useful framework for, or taxonomy of, document retrieval, and highlight some of the fundamental issues facing the design and development of commercial document retrieval systems. This is the first of a series of three articles. Part II (D.C. Blair, The challenge of commercial document retrieval. Part II. A strategy for document searching based on identifiable document partitions, Information Processing and Management, 2001b, this issue) will discuss the implications of this framework for search strategy, and Part III (D.C. Blair, Some thoughts on the reported results of Text REtrieval Conference (TREC), Information Processing and Management, 2002, forthcoming) will consider the importance of the TREC results for our understanding of operating information retrieval systems.