Learning semantic content-based profiles for cross-language recommendations

Authors:
Pasquale Lops;Cataldo Musto;Fedelucio Narducci;Marco de Gemmis;Pierpaolo Basile;Giovanni Semeraro
Affiliations:
University of Bari "Aldo Moro", Italy;University of Bari "Aldo Moro", Italy;University of Bari "Aldo Moro", Italy;University of Bari "Aldo Moro", Italy;University of Bari "Aldo Moro", Italy;University of Bari "Aldo Moro", Italy
Venue:
Proceedings of the First Workshop on Personalised Multilingual Hypertext Retrieval
Year:
2011

Citing 6
Cited 2

Phrasal translation and query expansion techniques for cross-language information retrieval

Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Machine learning in automated text categorization

ACM Computing Surveys (CSUR)
Integrating tags in a semantic content-based recommender

Proceedings of the 2008 ACM conference on Recommender systems
Computing semantic relatedness using Wikipedia-based explicit semantic analysis

IJCAI'07 Proceedings of the 20th international joint conference on Artifical intelligence
A Wikipedia-based multilingual retrieval model

ECIR'08 Proceedings of the IR research, 30th European conference on Advances in information retrieval
MARS: a MultilAnguage Recommender System

Proceedings of the 1st International Workshop on Information Heterogeneity and Fusion in Recommender Systems

Personalised multilingual hypertext retrieval: an overview

Proceedings of the First Workshop on Personalised Multilingual Hypertext Retrieval
PMHR 2011: the first workshop on personalised multilingual hypertext retrieval

ACM SIGIR Forum

Quantified Score

Hi-index	0.00

Visualization

Abstract

The exponential growth of the Web is the most influential factor that contributes to the increasing importance of cross-lingual text retrieval and filtering systems. Indeed, relevant information exists in different languages, thus users need to find documents in languages different from the one the query is formulated in. In this context, an emerging requirement is to sift through the increasing flood of multilingual text: this poses a renewed challenge for designing effective multilingual Information Filtering systems. Content-based filtering systems adapt their behavior to individual users by learning their preferences from documents that were already deemed relevant. The learning process aims to construct a profile of the user that can be later exploited in selecting/recommending relevant items. User profiles are generally represented using keywords in a specific language. For example, if a user likes movies whose plots are written in Italian, a content-based filtering algorithm will learn a profile for that user which contains Italian words, thus failing in recommending movies whose plots are written in English, although they might be definitely interesting. Moreover, keywords suffer of typical Information Retrieval-related problems such as polysemy and synonymy. In this paper, we propose a language-independent content-based recommender system, called MARS (MultilAnguage Recommender System), that builds cross-language user profiles, by shifting the traditional text representation based on keywords, to a more complex language-independent representation based on word meanings. The proposed strategy relies on a knowledge-based word sense disambiguation technique that exploits MultiWordNet as sense inventory. As a consequence, content-based user profiles become language-independent and can be exploited for recommending items represented in a language different from the one used in the content-based user profile. Experiments conducted in a movie recommendation scenario show the effectiveness of the approach.