Scaling-Up Item-Based Collaborative Filtering Recommendation Algorithm Based on Hadoop

  • Authors:
  • Jing Jiang;Jie Lu;Guangquan Zhang;Guodong Long

  • Affiliations:
  • -;-;-;-

  • Venue:
  • SERVICES '11 Proceedings of the 2011 IEEE World Congress on Services
  • Year:
  • 2011

Quantified Score

Hi-index 0.00

Visualization

Abstract

Collaborative filtering (CF) techniques have achieved widespread success in E-commerce nowadays. The tremendous growth of the number of customers and products in recent years poses some key challenges for recommender systems in which high quality recommendations are required and more recommendations per second for millions of customers and products need to be performed. Thus, the improvement of scalability and efficiency of collaborative filtering (CF) algorithms become increasingly important and difficult. In this paper, we developed and implemented a scaling-up item-based collaborative filtering algorithm on MapReduce, by splitting the three most costly computations in the proposed algorithm into four Map-Reduce phases, each of which can be independently executed on different nodes in parallel. We also proposed efficient partition strategies not only to enable the parallel computation in each Map-Reduce phase but also to maximize data locality to minimize the communication cost. Experimental results effectively showed the good performance in scalability and efficiency of the item-based CF algorithm on a Hadoop cluster.