CiteSeerx: a cloud perspective

  • Authors:
  • Pradeep B. Teregowda;Bhuvan Urgaonkar;C. Lee Giles

  • Affiliations:
  • Pennsylvania State University;Pennsylvania State University;Pennsylvania State University

  • Venue:
  • HotCloud'10 Proceedings of the 2nd USENIX conference on Hot topics in cloud computing
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

Information retrieval applications are good candidates for hosting in a cloud infrastructure. CiteSeerx a digital library and search engine was built with the goal of efficiently disseminating scientific information and literature over the web. The framework for CiteSeerx as an application of the SeerSuite software is a design built with extensibility and scalability as fundamental features. This loosely coupled architecture with service oriented interfaces allows the whole or parts of SeerSuite to readily be placed in the cloud. We discuss in brief the architecture, approaches, and advantages of hosting CiteSeerx in the cloud. We present initial results on costs of migrating whole or parts of CiteSeerx to two popular cloud offerings as well as discuss the effort involved.