CiteSeer: an automatic citation indexing system
Proceedings of the third ACM conference on Digital libraries
Focused crawling: a new approach to topic-specific Web resource discovery
WWW '99 Proceedings of the eighth international conference on World Wide Web
Data mining using high performance data clouds: experimental studies using sector and sphere
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
The cost of doing science on the cloud: the Montage example
Proceedings of the 2008 ACM/IEEE conference on Supercomputing
IEEE Intelligent Systems
Large-scale collaborative analysis and extraction of web data
Proceedings of the VLDB Endowment
The Eucalyptus Open-Source Cloud-Computing System
CCGRID '09 Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid
Search-as-a-service: Outsourced search over outsourced storage
ACM Transactions on the Web (TWEB)
Cloud Computing: A Digital Libraries Perspective
CLOUD '10 Proceedings of the 2010 IEEE 3rd International Conference on Cloud Computing
Disaster recovery as a cloud service: economic benefits & deployment challenges
HotCloud'10 Proceedings of the 2nd USENIX conference on Hot topics in cloud computing
WebApps'10 Proceedings of the 2010 USENIX conference on Web application development
Early experience with the distributed nebula cloud
Proceedings of the fourth international workshop on Data-intensive distributed computing
To move or not to move: the economics of cloud computing
HotCloud'11 Proceedings of the 3rd USENIX conference on Hot topics in cloud computing
An untold story of redundant clouds: making your service deployment truly reliable
Proceedings of the 9th Workshop on Hot Topics in Dependable Systems
Hi-index | 0.00 |
Information retrieval applications are good candidates for hosting in a cloud infrastructure. CiteSeerx a digital library and search engine was built with the goal of efficiently disseminating scientific information and literature over the web. The framework for CiteSeerx as an application of the SeerSuite software is a design built with extensibility and scalability as fundamental features. This loosely coupled architecture with service oriented interfaces allows the whole or parts of SeerSuite to readily be placed in the cloud. We discuss in brief the architecture, approaches, and advantages of hosting CiteSeerx in the cloud. We present initial results on costs of migrating whole or parts of CiteSeerx to two popular cloud offerings as well as discuss the effort involved.