A hybrid cache and prefetch mechanism for scientific literature search engines

  • Authors:
  • Huajing Li;Wang-Chien Lee;Anand Sivasubramaniam;C. Lee Giles

  • Affiliations:
  • Department of Computer Science and Engineering, Pennsylvania State University, State College, PA;Department of Computer Science and Engineering, Pennsylvania State University, State College, PA;Department of Computer Science and Engineering, Pennsylvania State University, State College, PA;Department of Computer Science and Engineering and The School of Information Sciences and Technology, Pennsylvania State University, State College, PA

  • Venue:
  • ICWE'07 Proceedings of the 7th international conference on Web engineering
  • Year:
  • 2007

Quantified Score

Hi-index 0.00

Visualization

Abstract

CiteSeer, a scientific literature search engine that focuses on documents in the computer science and information science domains, suffers from scalability issue on the number of requests and the size of indexed documents, which increased dramatically over the years. CiteSeerχ is an effort to re-architect the search engine. In this paper, we present our initial design of a framework for caching query results, indices, and documents. This design is based on analysis of logged workload in CiteSeer. Our experiments based on mock client requests that simulate actual user behaviors confirm that our approach works well in enhancing system performances.