Cloudpress 2.0: a next generation news retrieval system on the cloud with a built-in summarizer

  • Authors:
  • Arockia Anand Raj;T. Mala

  • Affiliations:
  • Anna University, Chennai, India;Anna University, Chennai, India

  • Venue:
  • Proceedings of the International Conference on Advances in Computing, Communications and Informatics
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

Enormous amount of news articles are added and updated on the Internet round-the-clock. This requires frequent and intensive processing by the news retrieval system. The news retrieval systems in use today, barely meet this requirement. Cloudpress 2.0 presented in this paper, is designed and implemented to be scalable, robust and fault tolerant. It is designed to exploit MapReduce paradigm for fetching, processing, organizing and summarizing all the news articles and to use the power of the Cloud computing. Furthermore, it uses novel approaches for parallel processing, for storing the news articles in a distributed database and for visualizing them as a 3D visual. It also includes a novel query expansion feature for searching the news articles. Cloudpress 2.0 also allows on-the-fly, extractive summarization of news articles based on the input query.