Les experiences d'indexation a l'inist

  • Authors:
  • J. Royaute;L. Schmitt;E. Olivetan

  • Affiliations:
  • INIST - CNRS, Vandoeuvre Les Nancy, France;INIST - CNRS, Vandoeuvre Les Nancy, France;INIST - CNRS, Vandoeuvre Les Nancy, France

  • Venue:
  • COLING '92 Proceedings of the 14th conference on Computational linguistics - Volume 3
  • Year:
  • 1992

Quantified Score

Hi-index 0.00

Visualization

Abstract

We talk, in this paper, about the operation of indexation at INIST. We present two experiments carried out within the Department of Research and New Products that aim at the automation of indexation process. The first one comes within the scope of scientometric studies on text database. We have developped a software toolbox with which we can build a chain of treatments up to the generation of hyperdocuments. Therefore, indexation from a large corpus of source documents is the first module of that chain. In this part, we use linguistic and statistical methods to produce keywords from a stream of data. Linguitic heuristics are used to extract compound nouns or noun phrases from the text and combinational treatments determine the importance of each term according to the document. Keywords are here the input of an hypertext system. The second one is the development of a workstation for the information specialist integrating a computer-aided indexing system on title and abstract in bibliographical records. This indexing process works on a single bibliographical record and combines both linguistic methods and artificial intelligence (keywords generation). We use the same extraction module based on linguistic and add a knowledge based system to deduce implicit keywords. Finally, we show that the original specifications and purpose of each experiment are different and we start a discussion on the interest of these methods in relation to the kind of indexation wanted and the qualities expected from automatic indexing systems.