Machine-aided indexing at NASA
Information Processing and Management: an International Journal
Design and implementation of automatic indexing for information retrieval with Arabic documents
Journal of the American Society for Information Science
Information Processing and Management: an International Journal
AUTINDEX: an automatic multilingual indexing system
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Journal of the American Society for Information Science and Technology
Learning Algorithms for Keyphrase Extraction
Information Retrieval
Improving text categorization using the importance of sentences
Information Processing and Management: an International Journal
The role of manually-assigned keywords in query expansion
Information Processing and Management: an International Journal
Variations in use of meta tag descriptions by web pages in different languages
Information Processing and Management: an International Journal
Hi-index | 0.00 |
In this article, the authors analyze the keywords given by authors of scientific articles and the descriptors assigned to the articles to ascertain the presence of the keywords in the descriptors. Six-hundred forty INSPEC (Information Service for Physics, Engineering, and Computing), CAB (Current Agriculture Bibliography) abstracts, ISTA (Information Science and Technology Abstracts), and LISA (Library and Information Science Abstracts) database records were consulted. After detailed comparisons, it was found that keywords provided by authors have an important presence in the database descriptors studied; nearly 25% of all the keywords appeared in exactly the same form as descriptors, with another 21% though normalized, still detected in the descriptors. This means that almost 46% of keywords appear in the descriptors, either as such or after normalization. Elsewhere, three distinct indexing policies appear, one represented by INSPEC and LISA (indexers seem to have freedom to assign the descriptors they deem necessary); another is represented by CAB (no record has fewer than four descriptors and, in general, a large number of descriptors is employed). In contrast, in ISTA, a certain institutional code exists towards economy in indexing because 84% of records contain only four descriptors. © 2007 Wiley Periodicals, Inc.