Merging Thesauri: Principles and Evaluation
IEEE Transactions on Pattern Analysis and Machine Intelligence
A cluster-based approach to thesaurus construction
SIGIR '88 Proceedings of the 11th annual international ACM SIGIR conference on Research and development in information retrieval
Automatic text processing
Automatic thesaurus construction by machine learning from retrieval sessions
Information Processing and Management: an International Journal
An approach to the automatic construction of global thesauri
Information Processing and Management: an International Journal
A comparison of indexing techniques for Japanese text retrieval
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Chinese text segmentation for text retrieval: achievements and problems
Journal of the American Society for Information Science
Automatic thesaurus generation for an electronic community system
Journal of the American Society for Information Science
ACTS: an automatic Chinese text segmentation system for full text retrieval
Journal of the American Society for Information Science
Automatic thesaurus construction using Bayesian networks
CIKM '95 Proceedings of the fourth international conference on Information and knowledge management
A stochastic finite-state word-segmentation algorithm for Chinese
Computational Linguistics
Automatic thesaurus construction using Bayesian networks
Information Processing and Management: an International Journal - Special issue: history of information science
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Journal of the American Society for Information Science
A cooccurrence-based thesaurus and two applications to information retrieval
Information Processing and Management: an International Journal
Chinese text retrieval without using a dictionary
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
A method for monolingual thesauri merging
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Combination and boundary detection approaches on Chinese indexing
Journal of the American Society for Information Science - Special topic issue on digital libraries: part 2
Proceedings of the 1st international conference on Knowledge capture
Journal of the American Society for Information Science and Technology
Exploiting a Thesaurus-Based Semantic Net for Knowledge-Based Search
Proceedings of the Seventeenth National Conference on Artificial Intelligence and Twelfth Conference on Innovative Applications of Artificial Intelligence
Automatic thesaurus generation for Chinese documents
Journal of the American Society for Information Science and Technology
ADL '98 Proceedings of the Advances in Digital Libraries Conference
Automatic generation of English/Chinese thesaurus based on a parallel corpus in laws
Journal of the American Society for Information Science and Technology
Building a meaningful Web: from traditional knowledge organization systems to new semantic tools
Proceedings of the 3rd ACM/IEEE-CS joint conference on Digital libraries
Building a web thesaurus from web link structure
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Journal of Information Science
A clustering-based semi-automated technique to build cultural ontologies
Journal of the American Society for Information Science and Technology
Enriching a thesaurus as a better retrieval aid
Proceedings of the 73rd ASIS&T Annual Meeting on Navigating Streams in an Information Ecosystem - Volume 47
Hi-index | 0.00 |
The application of thesauri in networked environments is seriously hampered by the challenges of introducing new concepts and terminology into the formal controlled vocabulary, which is critical for enhancing its retrieval capability. The author describes an automated process of adding new terms to thesauri as entry vocabulary by analyzing the association between words/phrases extracted from bibliographic titles and subject descriptors in the metadata record (subject descriptors are terms assigned from controlled vocabularies of thesauri to describe the subjects of the objects [e.g., books, articles] represented by the metadata records). The investigated approach uses a corpus of metadata for scientific and technical (S&T) publications in which the titles contain substantive words for key topics. The three steps of the method are (a) extracting words and phrases from the title field of the metadata; (b) applying a method to identify and select the specific and meaningful keywords based on the associated controlled vocabulary terms from the thesaurus used to catalog the objects; and (c) inserting selected keywords into the thesaurus as new terms (most of them are in hierarchical relationships with the existing concepts), thereby updating the thesaurus with new terminology that is being used in the literature. The effectiveness of the method was demonstrated by an experiment with the Chinese Classification Thesaurus (CCT) and bibliographic data in China Machine-Readable Cataloging Record (MARC) format (CNMARC) provided by Peking University Library. This approach is equally effective in large-scale collections and in other languages. © 2006 Wiley Periodicals, Inc.