Using mixture models for collaborative filtering

Authors:
Jon Kleinberg;Mark Sandler
Affiliations:
Cornell University, Ithaca, NY;Cornell University, Ithaca, NY
Venue:
STOC '04 Proceedings of the thirty-sixth annual ACM symposium on Theory of computing
Year:
2004

Citing 10
Cited 15

Recommender systems

Communications of the ACM
Matrix computations (3rd ed.)

Matrix computations (3rd ed.)
Distributional clustering of words for text classification

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Spectral analysis of data

STOC '01 Proceedings of the thirty-third annual ACM symposium on Theory of computing
Competitive recommendation systems

STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
Amazon.com Recommendations: Item-to-Item Collaborative Filtering

IEEE Internet Computing
Latent Class Models for Collaborative Filtering

IJCAI '99 Proceedings of the Sixteenth International Joint Conference on Artificial Intelligence
Convergent algorithms for collaborative filtering

Proceedings of the 4th ACM conference on Electronic commerce
Recommendation Systems: A Probabilistic Analysis

FOCS '98 Proceedings of the 39th Annual Symposium on Foundations of Computer Science
Empirical analysis of predictive algorithms for collaborative filtering

UAI'98 Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence

On the use of linear programming for unsupervised text classification

Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Tensor-CUR decompositions for tensor-based data

Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Measuring playlist diversity for recommendation systems

Proceedings of the 1st ACM workshop on Audio and music computing multimedia
Hierarchical mixture models: a probabilistic analysis

Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
A rigorous analysis of population stratification with limited data

SODA '07 Proceedings of the eighteenth annual ACM-SIAM symposium on Discrete algorithms
Sampling algorithms and coresets for ℓp regression

Proceedings of the nineteenth annual ACM-SIAM symposium on Discrete algorithms
Addressing cold-start problem in recommendation systems

Proceedings of the 2nd international conference on Ubiquitous information management and communication
Theory research at Google

ACM SIGACT News
Comparing State-of-the-Art Collaborative Filtering Systems

MLDM '07 Proceedings of the 5th international conference on Machine Learning and Data Mining in Pattern Recognition
A random walk method for alleviating the sparsity problem in collaborative filtering

Proceedings of the 2008 ACM conference on Recommender systems
Collaborative scoring with dishonest participants

Proceedings of the twenty-second annual ACM symposium on Parallelism in algorithms and architectures
Coresets and sketches for high dimensional subspace approximation problems

SODA '10 Proceedings of the twenty-first annual ACM-SIAM symposium on Discrete Algorithms
Shopping for products you don't know you need

Proceedings of the fourth ACM international conference on Web search and data mining
A novel protocol for communicating reputation in p2p networks

iTrust'06 Proceedings of the 4th international conference on Trust Management
Mining influential bloggers: From general to domain specific, from explicit to implicit

International Journal of Knowledge-based and Intelligent Engineering Systems

Quantified Score

Hi-index	0.00

Visualization

Abstract

A collaborative filtering system at an e-commerce site or similar service uses data about aggregate user behavior to make recommendations tailored to specific user interests. We develop recommendation algorithms with provable performance guarantees in a probabilistic mixture model for collaborative filtering proposed by Hoffman and Puzicha. We identify certain novel parameters of mixture models that are closely connected with the best achievable performance of a recommendation algorithm; we show that for any system in which these parameters are bounded, it is possible to give recommendations whose quality converges to optimal as the amount of data grows.All our bounds depend on a new measure of independence that can be viewed as an L1-analogue of the smallest singular value of a matrix. Using this, we introduce a technique based on generalized pseudoinverse matrices and linear programming for handling sets of high-dimensional vectors. We also show that standard approaches based on L2-spectral methods are not strong enough to yield comparable results, thereby suggesting some inherent limitations of spectral analysis.