Comparing high level mapreduce query languages

  • Authors:
  • Robert J. Stewart;Phil W. Trinder;Hans-Wolfgang Loidl

  • Affiliations:
  • Mathematical and Computer Sciences, Heriot Watt University;Mathematical and Computer Sciences, Heriot Watt University;Mathematical and Computer Sciences, Heriot Watt University

  • Venue:
  • APPT'11 Proceedings of the 9th international conference on Advanced parallel processing technologies
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

The MapReduce parallel computational model is of increasing importance. A number of High Level Query Languages (HLQLs) have been constructed on top of the Hadoop MapReduce realization, primarily Pig, Hive, and JAQL. This paper makes a systematic performance comparison of these three HLQLs, focusing on scale up, scale out and runtime metrics. We further make a language comparison of the HLQLs focusing on conciseness and computational power. The HLQL development communities are engaged in the study, which revealed technical bottlenecks and limitations described in this document, and it is impacting their development.