A search engine accepting on-line updates

  • Authors:
  • Mauricio Marin;Carolina Bonacic;Veronica Gil Costa;Carlos Gomez

  • Affiliations:
  • Yahoo! Research, Santiago, University of Chile;ARTECS, Complutense University of Madrid, Spain;DCC, University of San Luis, Argentina;Yahoo! Research, Santiago, University of Chile

  • Venue:
  • Euro-Par'07 Proceedings of the 13th international Euro-Par conference on Parallel Processing
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

We describe and evaluate the performance of a parallel search engine that is able to cope efficiently with concurrent read/write operations. Read operations come in the usual form of queries submitted to the search engine and write ones come in the form of new documents added to the text collection in an on-line manner, namely the insertions are embedded into the main stream of user queries in an unpredictable arrival order but with query results respecting causality. The search engine is built upon distributed inverted files for which we propose generic strategies for load balance and concurrency control.