Software library construction from an IR perspective

  • Authors:
  • Yoëlle S. Maarek

  • Affiliations:
  • -

  • Venue:
  • ACM SIGIR Forum
  • Year:
  • 1991

Quantified Score

Hi-index 0.00

Visualization

Abstract

The two basic requirements for achieving software reuse are: (1) to provide a sufficient number of components over a spectrum of domains that can be reused as is (black-box reuse) or easily adapted (white-box reuse), and (2) to organize components such that code close to the users' needs is easy to locate. Many attempts have been made at addressing the first issue, (e.g. UNIX) however, as far as the second requirement is concerned very few library systems are available or attractive enough to make actual reuse faster than rewriting from scratch.With the increasing size of natural-language documentation in modern software component collections, there is a growing interest in applying IR techniques to the construction of software libraries. For instance recent tools such as INFoEXPLORER for the IBM RS/6000 series, or ANSWERBOOK for Sun Sparc workstations, provide standard IR techniques for searching on-line documentation. However, due to the nature of software documentation and of reuse requirements, some specific IR techniques can be devised to significantly enhance retrieval effectiveness.In this paper, we identify the necessary requirements to be met by the software collection in order to apply an IR approach, and we describe the specificity of reuse as compared to other applications. As an example, we describe GuRu, an information storage and retrieval system for reuse. This system is only briefly described as it has already presented elsewhere, and we rather concentrate on showing how GuRu satisfies the specific needs of reuse. GURU is currently used at the IBM T.J. Watson Research Center by a growing pool of users. We also provide an experimental test collection with relevance judgments for the AIx 3 command set to be used as a starting ground for evaluating retrieval effectiveness in reuse applications. Finally, we compare GURU's indexing scheme to two other schemes using this test collection.