MapReduce-Based data stream processing over large history data

  • Authors:
  • Kaiyuan Qi;Zhuofeng Zhao;Jun Fang;Yanbo Han

  • Affiliations:
  • Cloud Computing Research Center, North China University of Technology, Beijing, China,Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China;Cloud Computing Research Center, North China University of Technology, Beijing, China;Cloud Computing Research Center, North China University of Technology, Beijing, China;Cloud Computing Research Center, North China University of Technology, Beijing, China

  • Venue:
  • ICSOC'12 Proceedings of the 10th international conference on Service-Oriented Computing
  • Year:
  • 2012

Quantified Score

Hi-index 0.00

Visualization

Abstract

With the development of Internet of Things applications based on sensor data, how to process high speed data stream over large scale history data brings a new challenge. This paper proposes a new programming model RTMR, which improves the real-time capability of traditional batch processing based MapReduce by preprocessing and caching, along with pipelining and localizing. Furthermore, to adapt the topologies to application characteristics and cluster environments, a model analysis based RTMR cluster constructing method is proposed. The benchmark built on the urban vehicle monitoring system shows RTMR can provide the real-time capability and scalability for data stream processing over large scale data.