Time dependency of data quality for collaborative filtering algorithms

Authors:
Toon De Pessemier;Simon Dooms;Tom Deryckere;Luc Martens
Affiliations:
IBBT-Ghent University, Ghent, Belgium;IBBT-Ghent University, Ghent, Belgium;IBBT-Ghent University, Ghent, Belgium;IBBT-Ghent University, Ghent, Belgium
Venue:
Proceedings of the fourth ACM conference on Recommender systems
Year:
2010

Citing 5
Cited 2

Amazon.com Recommendations: Item-to-Item Collaborative Filtering

IEEE Internet Computing
Evaluating collaborative filtering recommender systems

ACM Transactions on Information Systems (TOIS)
I tube, you tube, everybody tubes: analyzing the world's largest user generated content video system

Proceedings of the 7th ACM SIGCOMM conference on Internet measurement
Programming collective intelligence

Programming collective intelligence
Do Metrics Make Recommender Algorithms?

WAINA '09 Proceedings of the 2009 International Conference on Advanced Information Networking and Applications Workshops

A recommendation model for handling dynamics in user profile

ICDCIT'12 Proceedings of the 8th international conference on Distributed Computing and Internet Technology
Effectiveness of the data generated on different time in latent factor model

Proceedings of the 7th ACM conference on Recommender systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

The efficiency of personal suggestions generated by collaborative filtering techniques is highly dependent on the quality and quantity of the available consumption data. Extending data sets with additional consumption data (from the past) might enrich the user profiles and generally leads to more accurate recommendations. Although if a considerable amount of profile information is already available and detailed personal preferences can be derived, supplementary consumption data may not have any (or a very limited) added value for the recommendation algorithm. These additional consumption data increase the required storage capacity and the computational load to generate the personal recommendations. Moreover, since personal preferences and the relevance of content items may vary over time, older consumption data might be outdated and lead to inaccurate recommendations. Therefore, we investigate which consumption data are (the most) relevant to feed the conventional collaborative filtering algorithms. For provider-generated content systems, we demonstrate that the accuracy of collaborative filtering algorithms increases by extending user profiles with additional older consumption data. In contrast, we witness the opposite effect for user-generated content systems: involving older consumption data has a negative influence on the recommender accuracy. These results are important for website owners who intend to employ a recommendation system at a minimum storage and computation cost.