Differential data analysis for recommender systems

  • Authors:
  • Richard Chow;Hongxia Jin;Bart Knijnenburg;Gokay Saldamli

  • Affiliations:
  • Intel Corporation, Santa Clara, CA, USA;Samsung Electronics R&D, San Jose, CA, USA;UC Irvine, Irvine, CA, USA;Samsung Electronics R&D, San Jose, CA, USA

  • Venue:
  • Proceedings of the 7th ACM conference on Recommender systems
  • Year:
  • 2013

Quantified Score

Hi-index 0.00

Visualization

Abstract

We present techniques to characterize which data contributes most to the accuracy of a recommendation algorithm. Our main technique is called differential data analysis. The name is inspired by other sorts of differential analysis, such as differential power analysis and differential cryptanalysis, where insight comes through analysis of slightly differing inputs. In differential data analysis we chunk the data and compare results in the presence or absence of each chunk. We apply differential data analysis to two datasets and three different attributes. The first attribute is called user hardship. This is a novel attribute, particularly relevant to location datasets, that indicates how burdensome a data point was to achieve. The second and third attributes are more standard: timestamp and user rating. For user rating, we confirm previous work concerning the increased importance to the recommender of high and low user ratings.