WordNet: a lexical database for English
Communications of the ACM
Algorithm 457: finding all cliques of an undirected graph
Communications of the ACM
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Similarity Search in High Dimensions via Hashing
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Greedy approximation algorithms for finding dense components in a graph
APPROX '00 Proceedings of the Third International Workshop on Approximation Algorithms for Combinatorial Optimization
The Journal of Machine Learning Research
Using twitter to recommend real-time topical news
Proceedings of the third ACM conference on Recommender systems
Executing SPARQL Queries over the Web of Linked Data
ISWC '09 Proceedings of the 8th International Semantic Web Conference
Recommending twitter users to follow using content and collaborative filtering approaches
Proceedings of the fourth ACM conference on Recommender systems
Discovering users' topics of interest on twitter: a first look
AND '10 Proceedings of the fourth workshop on Analytics for noisy unstructured text data
Hi-index | 0.00 |
In this paper we present a Friend Recommender System for micro-blogging. Traditional batch processing of massive amounts of data makes it difficult to provide a near-real time friend recommender system or even a system that can properly scale to millions of users. In order to overcome these issues, we have designed a solution that represents user-generated micro posts as a set of pseudo-cliques. These graphs are assigned a hash value using an original Concept-Sensitive Hash function, a new sub-kind of Locally-Sensitive Hash functions. Finally, since the user profiles are represented as a binary footprint, the pairwise comparison of footprints using the Hamming distance provides scalability to the recommender system. The paper goes with an online application relying on a large Twitter dataset, so that the reader can freely experiment the system.