An Empirical Study on the Effect of Different Similarity Measures on User-Based Collaborative Filtering Algorithms

Authors:
Ashish Sureka;Pranav Prabhakar Mirajkar
Affiliations:
Software Engineering and Technology Labs (SETLabs), Infosys Technologies Limited, India;Software Engineering and Technology Labs (SETLabs), Infosys Technologies Limited, India
Venue:
PRICAI '08 Proceedings of the 10th Pacific Rim International Conference on Artificial Intelligence: Trends in Artificial Intelligence
Year:
2008

Citing 5
Cited 0

An Empirical Analysis of Design Choices in Neighborhood-Based Collaborative Filtering Algorithms

Information Retrieval
Amazon.com Recommendations: Item-to-Item Collaborative Filtering

IEEE Internet Computing
Toward the Next Generation of Recommender Systems: A Survey of the State-of-the-Art and Possible Extensions

IEEE Transactions on Knowledge and Data Engineering
Evaluating similarity measures: a large-scale study in the orkut social network

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
A new similarity measure for collaborative filtering to alleviate the new user cold-starting problem

Information Sciences: an International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Computation of similarity between user profiles (user rating vectors) is one of the core components of user-based (k-nearest-neighborhood based) collaborative filtering algorithms. Present techniques work by identifying or selecting a similarity function by the designer of the recommendation engine and keeping it fixed throughout the collaborative filtering process and using the same function to compute the neighborhood of every user. However, we found that there is no single similarity measure that gives best predictive accuracy for all users. We see this as a limitation of current systems. For the same user, applying different similarity functions results in different predictive accuracy. We propose that the accuracy of user-based collaborative filtering recommendation engines can be further increased by learning an optimal similarity function for a particular user and by applying different similarity measure for different users. We present an empirical study on the effect of eleven different similarity measures on the predictive accuracy of user-based collaborative filtering algorithms.