Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
A language modeling approach to information retrieval
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The chatty web: emergent semantics through gossiping
WWW '03 Proceedings of the 12th international conference on World Wide Web
Exploring social annotations for the semantic web
Proceedings of the 15th international conference on World Wide Web
Optimizing web search using social annotations
Proceedings of the 16th international conference on World Wide Web
Using social annotations to improve language model for information retrieval
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Exploring social annotations for web document classification
Proceedings of the 2008 ACM symposium on Applied computing
Exploring social annotations for information retrieval
Proceedings of the 17th international conference on World Wide Web
Efficient top-k querying over social-tagging networks
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Information Retrieval
Introduction to Information Retrieval
Personalized recommendation in social tagging systems using hierarchical clustering
Proceedings of the 2008 ACM conference on Recommender systems
Semantic Grounding of Tag Relatedness in Social Bookmarking Systems
ISWC '08 Proceedings of the 7th International Conference on The Semantic Web
Proceedings of the Second ACM International Conference on Web Search and Data Mining
Proceedings of the 18th international conference on World wide web
Evaluating similarity measures for emergent semantics of social tagging
Proceedings of the 18th international conference on World wide web
Ranking and classifying attractiveness of photos in folksonomies
Proceedings of the 18th international conference on World wide web
A scalable, collaborative similarity measure for social annotation systems
Proceedings of the 20th ACM conference on Hypertext and hypermedia
Using social annotations to smooth the language model for IR
PAKDD'07 Proceedings of the 11th Pacific-Asia conference on Advances in knowledge discovery and data mining
Ontologies are us: a unified model of social networks and semantics
ISWC'05 Proceedings of the 4th international conference on The Semantic Web
Hi-index | 0.00 |
Text clustering can effectively improve search results and user experience of information retrieval system. Traditional text clustering approaches are based on vector space model, in which a document is represented as a vector using term frequency based weighting scheme. The main disadvantage of this model is that it cannot fully exploit semantic correlations between social annotations and document contents because term frequency based weighting scheme only captures the number of occurrences of terms in the document. However, social annotation of web pages implicates fundamental and valuable semantic information thus can be fully utilized to improve information retrieval system. In this paper, we investigate and evaluate several extended vector space models which can combine social annotation and web page text. In particular, we propose a novel vector space model by computing the semantic correlations between social annotations and web page words. Comparing with other vector space models, our experiments show that using semantic correlations between social tags and web page words improves the clustering accuracy with RI score increase of 4% - 7%.