Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
Language and representation in information retrieval
Language and representation in information retrieval
ACM SIGIR Forum
Information retrieval: data structures and algorithms
Information retrieval: data structures and algorithms
Ranking documents in thesaurus-based boolean retrieval systems
Information Processing and Management: an International Journal
Information storage and retrieval
Information storage and retrieval
Journal of the American Society for Information Science - Special issue: management of imprecision and uncertainty
Acrophile: an automated acronym extractor and server
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Scalable browsing for large collections: a case study
DL '00 Proceedings of the fifth ACM conference on Digital libraries
ACM SIGIR Forum
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
From research to application: the cite natural language information retrieval system
SIGIR '82 Proceedings of the 5th annual ACM conference on Research and development in information retrieval
Hierarchically Classifying Documents Using Very Few Words
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Improving Text Classification by Shrinkage in a Hierarchy of Classes
ICML '98 Proceedings of the Fifteenth International Conference on Machine Learning
Cooperative Indexing Classification and Evaluation in BoW
CooplS '02 Proceedings of the 7th International Conference on Cooperative Information Systems
The SMART Retrieval System—Experiments in Automatic Document Processing
The SMART Retrieval System—Experiments in Automatic Document Processing
Cha-Cha: a system for organizing intranet search results
USITS'99 Proceedings of the 2nd conference on USENIX Symposium on Internet Technologies and Systems - Volume 2
Interactive Timeline Viewer (ItLv): A Tool to Visualize Variants Among Documents
Visual Interfaces to Digital Libraries [JCDL 2002 Workshop]
Hierarchical indexing and flexible element retrieval for structured document
ECIR'03 Proceedings of the 25th European conference on IR research
Navigating among search results: an information content approach
WISE'07 Proceedings of the 8th international conference on Web information systems engineering
Hi-index | 0.00 |
BoW is an on-line bibliographical repository based on a hierarchical c oncept index to which entries are linked. Searching in the repository should therefore return matching topics from the hierarchy, rather than just a list of entries. Likewise, when new entries are inserted, a search for relevant topics to which they should be linked is required. We develop a vector-based algorithm that creates keyword vectors for the set of competing topics at each node in the hierarchy, and show how its performance improves when domain-specific features are added (such as special handling of topic titles and author names). The results of a 7-fold cross validation on a corpus of some 3,500 entries with a 5-level index are hit ratios in the range of 89-95%, and most of the misclassifications are indeed ambiguous to begin with.