Time dependency of data quality for collaborative filtering algorithms

  • Authors:
  • Toon De Pessemier;Simon Dooms;Tom Deryckere;Luc Martens

  • Affiliations:
  • IBBT-Ghent University, Ghent, Belgium;IBBT-Ghent University, Ghent, Belgium;IBBT-Ghent University, Ghent, Belgium;IBBT-Ghent University, Ghent, Belgium

  • Venue:
  • Proceedings of the fourth ACM conference on Recommender systems
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

The efficiency of personal suggestions generated by collaborative filtering techniques is highly dependent on the quality and quantity of the available consumption data. Extending data sets with additional consumption data (from the past) might enrich the user profiles and generally leads to more accurate recommendations. Although if a considerable amount of profile information is already available and detailed personal preferences can be derived, supplementary consumption data may not have any (or a very limited) added value for the recommendation algorithm. These additional consumption data increase the required storage capacity and the computational load to generate the personal recommendations. Moreover, since personal preferences and the relevance of content items may vary over time, older consumption data might be outdated and lead to inaccurate recommendations. Therefore, we investigate which consumption data are (the most) relevant to feed the conventional collaborative filtering algorithms. For provider-generated content systems, we demonstrate that the accuracy of collaborative filtering algorithms increases by extending user profiles with additional older consumption data. In contrast, we witness the opposite effect for user-generated content systems: involving older consumption data has a negative influence on the recommender accuracy. These results are important for website owners who intend to employ a recommendation system at a minimum storage and computation cost.