Estimation of execution time of data-intensive out-of-core processes

  • Authors:
  • Tamás Schrádi;Ákos Dudás;Sándor Juhász

  • Affiliations:
  • Budapest University of Technology and Economics, Department of Automation and Applied Informatics, Budapest, Hungary;Budapest University of Technology and Economics, Department of Automation and Applied Informatics, Budapest, Hungary;Budapest University of Technology and Economics, Department of Automation and Applied Informatics, Budapest, Hungary

  • Venue:
  • ACACOS'12 Proceedings of the 11th WSEAS international conference on Applied Computer and Applied Computational Science
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

In this paper we investigate various out-of-core methods aiming to process large datasets efficiently using state-of-the-art, personal computers. A dataset is considered large in case it does not fit into the main memory. First an eager method will be shown to demonstrate the incapability and inefficiency of direct in-memory processing of huge amounts of data. Afterward two out-of-core extensions will be introduced that use the secondary storage to overcome the difficulties caused by the limited memory. The Periodic Partial Result Merging algorithm operates with smaller chunks, which fit in the main memory and continuously propagates the results on the secondary storage. The K-way Merge technique follows a similar principle, but it separates the processing and the merging phases. The two proposed methods proved to be suitable to process large datasets efficiently in a fault tolerant way. A comparative evaluation of the out-of-core algorithms and a novel model for estimation of their execution time will also be given. The goodness of the model will be validated by comparing its estimation to the results of practical measurements.