Another look at automatic text-retrieval systems
Communications of the ACM
Advanced feedback methods in information retrieval
Journal of the American Society for Information Science
Annual review of information science and technology, vol. 22
Probabilistic and genetic algorithms in document retrieval
Communications of the ACM
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
Representation and learning in information retrieval
Representation and learning in information retrieval
On the evaluation of IR systems
Information Processing and Management: an International Journal - Special issue on evaluation issues in information retrieval
Information Processing and Management: an International Journal
Some inconsistencies and misidentified modeling assumptions in probabilistic information retrieval
ACM Transactions on Information Systems (TOIS)
Foundations of advanced information visualization for information retrieval systems
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
The visual display of information in an information retrieval environment
Proceedings of the 9th annual international ACM SIGIR conference on Research and development in information retrieval
The cluster hypothesis revisited
SIGIR '85 Proceedings of the 8th annual international ACM SIGIR conference on Research and development in information retrieval
Visualization and scaling of TREC topic document sets
Information Processing and Management: an International Journal
Journal of the American Society for Information Science
A visual exploration of the orderliness of TREC relevance judgments
Journal of the American Society for Information Science
Overview of the sixth text REtrieval conference (TREC-6)
Information Processing and Management: an International Journal - The sixth text REtrieval conference (TREC-6)
Data mining: concepts and techniques
Data mining: concepts and techniques
Shape recovery: a visual method for evaluation of information retrieval experiments
Journal of the American Society for Information Science
The use of theory in information science research
Journal of the American Society for Information Science and Technology - Special issue on the still the frontier: Information Science at the Millenium
Extended Boolean information retrieval
Communications of the ACM
Multidimensional scaling of video surrogates
Journal of the American Society for Information Science and Technology
A vector space model for automatic indexing
Communications of the ACM
A probabilistic model of information retrieval: development and comparative experiments
Information Processing and Management: an International Journal
Using cause-effect relations in text to improve information retrieval precision
Information Processing and Management: an International Journal
Journal of the American Society for Information Science and Technology
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Journal of the American Society for Information Science and Technology
Theory of Indexing
Models in information retrieval
Lectures on information retrieval
Logic and uncertainty in information retrieval
Lectures on information retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
The use of bigrams to enhance text categorization
Information Processing and Management: an International Journal
Text categorization based on k-nearest neighbor approach for web site classification
Information Processing and Management: an International Journal
Informational environments: organizational contexts of online information use
Journal of the American Society for Information Science and Technology
An experimental evaluation of multivariate graphical point representations
CHI '82 Proceedings of the 1982 Conference on Human Factors in Computing Systems
Interactive Visualization of Multiple Query Results
INFOVIS '01 Proceedings of the IEEE Symposium on Information Visualization 2001 (INFOVIS'01)
A study of graphically chosen features for representation of trec topic-document sets
A study of graphically chosen features for representation of trec topic-document sets
Text classification from positive and unlabeled documents
CIKM '03 Proceedings of the twelfth international conference on Information and knowledge management
A study of smoothing methods for language models applied to information retrieval
ACM Transactions on Information Systems (TOIS)
Document indexing: a concept-based approach to term weight estimation
Information Processing and Management: an International Journal
Webstar: a visualization model for hyperlink structures
Information Processing and Management: an International Journal
Advanced document description, a sequential approach
ACM SIGIR Forum
Higher order feature selection for text classification
Knowledge and Information Systems
Probabilistic document-context based relevance feedback with limited relevance judgments
CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
An analysis of two approaches in information retrieval: From frameworks to study designs
Journal of the American Society for Information Science and Technology
Document retrieval: shallow data, deep theories; historical reflections, potential directions
ECIR'03 Proceedings of the 25th European conference on IR research
Hi-index | 0.00 |
Text representation, central to information processing, must bedescriptive and discriminative. Although some of the manytechniques to construct document representations may outperformothers for certain tasks, no one is consistently better thanothers. Representations are still problematic. Evaluationtechniques are needed to penetrate foundational questions aboutterm behavior in representation. A study that applies the shaperecovery analysis method is reported here as an evaluative tool tocompare different indexing schemes. Three weight coefficients areused to rank indexing terms and are compared to the documents' fulltext. Two of the weight coefficients are novel and the third relieson the chi-squared distribution. Multidimensional scaling reducesthe dimensional space of the document surrogates into atwo-dimensional Cartesian space. Ten concentric circles evenlyseparated at 10% intervals of relevant data points starting at thecentroid are used to construct a precisionrecall curve. ANOVA isused for a straightforward computation of the 4 x 11 matrix of testdata to see whether the four treatments yield the same P-R result.A post hoc HSD Tukey multiple comparisons test among pairwisetreatments is also used to discover homogeneous groups. Thefindings show the value of the methodology to study term weightingschemes, and their descriptiveness and discriminative power, aswell as the potential strength of the novel coefficientsintroduced. © 2008 Wiley Periodicals, Inc.