Tag recommendation for large-scale ontology-based information systems

Authors:
Roman Prokofyev;Alexey Boyarsky;Oleg Ruchayskiy;Karl Aberer;Gianluca Demartini;Philippe Cudré-Mauroux
Affiliations:
eXascale Infolab, University of Fribourg, Switzerland;Ecole Polytechnique Fédérale de Lausanne, Switzerland,Instituut-Lorentz for Theoretical Physics, U. Leiden, The Netherlands,Bogolyubov Institute for Theoretical Physics, Kiev, Ukraine;CERN TH-Division, PH-TH, Geneva, Switzerland;Ecole Polytechnique Fédérale de Lausanne, Switzerland;eXascale Infolab, University of Fribourg, Switzerland;eXascale Infolab, University of Fribourg, Switzerland
Venue:
ISWC'12 Proceedings of the 11th international conference on The Semantic Web - Volume Part II
Year:
2012

Citing 10
Cited 1

Term-weighting approaches in automatic text retrieval

Information Processing and Management: an International Journal
An algorithm for suffix stripping

Readings in information retrieval
Introduction to Modern Information Retrieval

Introduction to Modern Information Retrieval
Latent dirichlet allocation

The Journal of Machine Learning Research
AutoTag: a collaborative approach to automated tag assignment for weblog posts

Proceedings of the 15th international conference on World Wide Web
The JIGSAW Algorithm for Word Sense Disambiguation and Semantic Indexing of Documents

AI*IA '07 Proceedings of the 10th Congress of the Italian Association for Artificial Intelligence on AI*IA 2007: Artificial Intelligence and Human-Oriented Computing
Semantic Provenance for eScience: Managing the Deluge of Scientific Data

IEEE Internet Computing
Tag recommendations in social bookmarking systems

AI Communications
Neighborhood-Based Tag Prediction

ESWC 2009 Heraklion Proceedings of the 6th European Semantic Web Conference on The Semantic Web: Research and Applications
Combining inverted indices and structured search for ad-hoc object retrieval

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval

Tag recommendation for open source software

Frontiers of Computer Science: Selected Publications from Chinese Universities

Quantified Score

Hi-index	0.00

Visualization

Abstract

We tackle the problem of improving the relevance of automatically selected tags in large-scale ontology-based information systems. Contrary to traditional settings where tags can be chosen arbitrarily, we focus on the problem of recommending tags (e.g., concepts) directly from a collaborative, user-driven ontology. We compare the effectiveness of a series of approaches to select the best tags ranging from traditional IR techniques such as TF/IDF weighting to novel techniques based on ontological distances and latent Dirichlet allocation. All our experiments are run against a real corpus of tags and documents extracted from the ScienceWise portal, which is connected to ArXiv.org and is currently used by growing number of researchers. The datasets for the experiments are made available online for reproducibility purposes.