Differential data analysis for recommender systems

Authors:
Richard Chow;Hongxia Jin;Bart Knijnenburg;Gokay Saldamli
Affiliations:
Intel Corporation, Santa Clara, CA, USA;Samsung Electronics R&D, San Jose, CA, USA;UC Irvine, Irvine, CA, USA;Samsung Electronics R&D, San Jose, CA, USA
Venue:
Proceedings of the 7th ACM conference on Recommender systems
Year:
2013

Citing 4
Cited 0

Social information filtering: algorithms for automating “word of mouth”

CHI '95 Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Privacy-preserving data mining

SIGMOD '00 Proceedings of the 2000 ACM SIGMOD international conference on Management of data
Enhancing privacy and preserving accuracy of a distributed collaborative filtering

Proceedings of the 2007 ACM conference on Recommender systems
Exploiting geographical influence for collaborative point-of-interest recommendation

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

We present techniques to characterize which data contributes most to the accuracy of a recommendation algorithm. Our main technique is called differential data analysis. The name is inspired by other sorts of differential analysis, such as differential power analysis and differential cryptanalysis, where insight comes through analysis of slightly differing inputs. In differential data analysis we chunk the data and compare results in the presence or absence of each chunk. We apply differential data analysis to two datasets and three different attributes. The first attribute is called user hardship. This is a novel attribute, particularly relevant to location datasets, that indicates how burdensome a data point was to achieve. The second and third attributes are more standard: timestamp and user rating. For user rating, we confirm previous work concerning the increased importance to the recommender of high and low user ratings.