Automatic processing of foreign language documents

  • Authors:
  • G. Salton

  • Affiliations:
  • Cornell University, Ithaca, N.Y.

  • Venue:
  • COLING '69 Proceedings of the 1969 conference on Computational linguistics
  • Year:
  • 1969

Quantified Score

Hi-index 0.00

Visualization

Abstract

Experiments conducted over the last few years with the SMART document retrieval system have shown that fully automatic text processing methods using relatively simple linguistic tools are as effective for purposes of document indexing, classification, search, and retrieval as the more elaborate manual methods normally used in practice. Up to now, all experiments were carried out entirely with English language queries and documents.The present study describes an extension of the SMART procedures to German language materials. A multi-lingual thesaurus is used for the analysis of documents and search requests, and tools are provided which make it possible to process English language documents against German queries, and vice versa. The methods are evaluated, and it is shown that the effectiveness of the mixed language processing is approximately equivalent to that of the standard process operating within a single language only.