Hardware for searching very large text databases

  • Authors:
  • Roger Haskin

  • Affiliations:
  • -

  • Venue:
  • CAW '80 Proceedings of the fifth workshop on Computer architecture for non-numeric processing
  • Year:
  • 1980

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper discusses the problem of searching very large text databases. It is shown that conventional techniques for searching current databases cannot be scaled up to larger ones, and that it is necessary to build hardware to search the database in parallel if reasonable search times are expected. The part of the search process requiring the highest bandwidth is scanning the database to detect instances of search terms. Methods of doing this in hardware that have been mentioned in the literature are examined, and design criteria for term matchers are discussed. A new design that uses a nondeterministic finite state automaton to control matching, is introduced, its operation is explained, and the practicality of using it in a real system is discussed.