MapReduce: simplified data processing on large clusters
Communications of the ACM - 50th anniversary issue: 1958 - 2008
Hadoop: The Definitive Guide
Optimizing Multiway Joins in a Map-Reduce Environment
IEEE Transactions on Knowledge and Data Engineering
Implementation of MapReduce-based image conversion module in cloud computing environment
ICOIN '12 Proceedings of the The International Conference on Information Network 2012
Hi-index | 0.00 |
This paper presents a parallel processing model based on Hadoop platform for large-scale images processing, which aims to make use of the advantages of high reliability and high scalability of Hadoop distributed platform for distributed memory and distributed computing, so as to achieve the purpose of fast processing of large-scale images. The Hadoop streaming technology is used in the model. The main operations are written on shell script as the mapper of Hadoop streaming, then an assigned filelist is used as the Hadoop streaming's input. The large numbers of image files are delivered to cluster computers for concurrent image processing. The model has been implemented using virtual machines. A set of experimental results and analysis are provided.