An information retrieval system for software components
ACM SIGIR Forum
The effectiveness of a nonsyntatic approach to automatic phrase indexing for document retrieval
Journal of the American Society for Information Science
Software reuse: emerging technology
The Reuse system: cataloging and retrieval of reusable software
Software reuse: emerging technology
A knowledge-base environment for the development of software parts composition systems
ICSE '89 Proceedings of the 11th international conference on Software engineering
NLH/E: a natural language help system
ICSE '89 Proceedings of the 11th international conference on Software engineering
Full text indexing based on lexical relations an application: software libraries
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
On the application of syntactic methodologies in automatic text analysis
SIGIR '89 Proceedings of the 12th annual international ACM SIGIR conference on Research and development in information retrieval
An incremental conceptual clustering algorithm that reduces input-ordering bias
Advances in artificial intelligence
OOPSLA '91 Conference proceedings on Object-oriented programming systems, languages, and applications
An Information Retrieval Approach for Automatically Constructing Software Libraries
IEEE Transactions on Software Engineering
Information Retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Lexical navigation: visually prompted query expansion and refinement
DL '97 Proceedings of the second ACM international conference on Digital libraries
Multi-level document classifications with self-organising maps
IDEAL'05 Proceedings of the 6th international conference on Intelligent Data Engineering and Automated Learning
Hi-index | 0.00 |
The two basic requirements for achieving software reuse are: (1) to provide a sufficient number of components over a spectrum of domains that can be reused as is (black-box reuse) or easily adapted (white-box reuse), and (2) to organize components such that code close to the users' needs is easy to locate. Many attempts have been made at addressing the first issue, (e.g. UNIX) however, as far as the second requirement is concerned very few library systems are available or attractive enough to make actual reuse faster than rewriting from scratch.With the increasing size of natural-language documentation in modern software component collections, there is a growing interest in applying IR techniques to the construction of software libraries. For instance recent tools such as INFoEXPLORER for the IBM RS/6000 series, or ANSWERBOOK for Sun Sparc workstations, provide standard IR techniques for searching on-line documentation. However, due to the nature of software documentation and of reuse requirements, some specific IR techniques can be devised to significantly enhance retrieval effectiveness.In this paper, we identify the necessary requirements to be met by the software collection in order to apply an IR approach, and we describe the specificity of reuse as compared to other applications. As an example, we describe GuRu, an information storage and retrieval system for reuse. This system is only briefly described as it has already presented elsewhere, and we rather concentrate on showing how GuRu satisfies the specific needs of reuse. GURU is currently used at the IBM T.J. Watson Research Center by a growing pool of users. We also provide an experimental test collection with relevance judgments for the AIx 3 command set to be used as a starting ground for evaluating retrieval effectiveness in reuse applications. Finally, we compare GURU's indexing scheme to two other schemes using this test collection.