Assessing the quality of textual features in social media

Authors:
Flavio Figueiredo;Henrique Pinto;Fabiano BeléM;Jussara Almeida;Marcos GonçAlves;David Fernandes;Edleno Moura
Affiliations:
Universidade Federal de Minas Gerais, Department of Computer Science, Belo Horizonte, MG, Brazil;Universidade Federal de Minas Gerais, Department of Computer Science, Belo Horizonte, MG, Brazil;Universidade Federal de Minas Gerais, Department of Computer Science, Belo Horizonte, MG, Brazil;Universidade Federal de Minas Gerais, Department of Computer Science, Belo Horizonte, MG, Brazil;Universidade Federal de Minas Gerais, Department of Computer Science, Belo Horizonte, MG, Brazil;Universidade Federal do Amazonas, Department of Computer Science, Manaus, AM, Brazil;Universidade Federal do Amazonas, Department of Computer Science, Manaus, AM, Brazil
Venue:
Information Processing and Management: an International Journal
Year:
2013

Citing 44
Cited 5

Data quality in context

Communications of the ACM
A re-examination of text categorization methods

Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
Content-Based Image Retrieval at the End of the Early Years

IEEE Transactions on Pattern Analysis and Machine Intelligence
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features

ECML '98 Proceedings of the 10th European Conference on Machine Learning
Scaling multi-class support vector machines using inter-class confusion

Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
A Relevance Feedback Architecture for Content-based Multimedia Information Retrieval Systems

CAIVL '97 Proceedings of the 1997 Workshop on Content-Based Access of Image and Video Libraries (CBAIVL '97)
Block-based web search

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
A comprehensive comparative study on term weighting schemes for text categorization with support vector machines

WWW '05 Special interest tracks and posters of the 14th international conference on World Wide Web
Document quality models for web ad hoc retrieval

Proceedings of the 14th ACM international conference on Information and knowledge management
Usage patterns of collaborative tagging systems

Journal of Information Science
A framework to predict the quality of answers with non-textual features

SIGIR '06 Proceedings of the 29th annual international ACM SIGIR conference on Research and development in information retrieval
Yago: a core of semantic knowledge

Proceedings of the 16th international conference on World Wide Web
MultiTube--Where Web 2.0 and Multimedia Could Meet

IEEE MultiMedia
Measuring article quality in wikipedia: models and evaluation

Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Finding high-quality content in social media

WSDM '08 Proceedings of the 2008 International Conference on Web Search and Data Mining
Graph theoretical framework for simultaneously integrating visual and textual features for efficient web image clustering

Proceedings of the 17th international conference on World Wide Web
Flickr tag recommendation based on collective knowledge

Proceedings of the 17th international conference on World Wide Web
Tag-based social interest discovery

Proceedings of the 17th international conference on World Wide Web
Real-time automatic tag recommendation

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Efficient top-k querying over social-tagging networks

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
LIBLINEAR: A Library for Large Linear Classification

The Journal of Machine Learning Research
A few bad votes too many?: towards robust ranking in social media

AIRWeb '08 Proceedings of the 4th international workshop on Adversarial information retrieval on the web
Can all tags be used for search?

Proceedings of the 17th ACM conference on Information and knowledge management
Social tags: meaning and suggestions

Proceedings of the 17th ACM conference on Information and knowledge management
Clustering the tagged web

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Improving music genre classification using collaborative tagging data

Proceedings of the Second ACM International Conference on Web Search and Data Mining
Tagommenders: connecting users to items through tags

Proceedings of the 18th international conference on World wide web
No bull, no spin: a comparison of tags with other forms of user metadata

Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Automatic quality assessment of content created collaboratively by web communities: a case study of wikipedia

Proceedings of the 9th ACM/IEEE-CS joint conference on Digital libraries
Personalized tag recommendation using graph-based ranking on multi-type interrelated objects

Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Evidence of quality of textual features on the web 2.0

Proceedings of the 18th ACM conference on Information and knowledge management
Tagging human knowledge

Proceedings of the third ACM international conference on Web search and data mining
Pairwise interaction tensor factorization for personalized tag recommendation

Proceedings of the third ACM international conference on Web search and data mining
Modern Information Retrieval

Modern Information Retrieval
The impact of resource title on tags in collaborative tagging systems

Proceedings of the 21st ACM conference on Hypertext and hypermedia
Social media recommendation based on people and tags

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Evaluating and predicting answer quality in community QA

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Effective music tagging through advanced statistical modeling

Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
The task-dependent effect of tags and ratings on social media access

ACM Transactions on Information Systems (TOIS)
Demand-driven tag recommendation

ECML PKDD'10 Proceedings of the 2010 European conference on Machine learning and knowledge discovery in databases: Part II
Using structural information to improve search in Web collections

Journal of the American Society for Information Science and Technology
Assessing the Value of Contributions in Tagging Systems

SOCIALCOM '10 Proceedings of the 2010 IEEE Second International Conference on Social Computing
On the selection of tags for tag clouds

Proceedings of the fourth ACM international conference on Web search and data mining
Associative tag recommendation exploiting multiple textual features

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval

Exploiting relevance, novelty and diversity in tag recommendation

Proceedings of the 18th Brazilian symposium on Multimedia and the web
Advertisement selection for online videos

Proceedings of the 18th Brazilian symposium on Multimedia and the web
Exploiting novelty and diversity in tag recommendation

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Topic diversity in tag recommendation

Proceedings of the 7th ACM conference on Recommender systems
Measuring and addressing the impact of cold start on associative tag recommenders

Proceedings of the 19th Brazilian symposium on Multimedia and the web

Quantified Score

Hi-index	0.00

Visualization

Abstract

Social media is increasingly becoming a significant fraction of the content retrieved daily by Web users. However, the potential lack of quality of user generated content poses a challenge to information retrieval services, which rely mostly on textual features generated by users (particularly tags) commonly associated with the multimedia objects. This paper presents what, to the best of our knowledge, is currently the most comprehensive study of the relative quality of textual features in social media. We analyze four different features, namely, title, tags, description and comments posted by users, in four popular applications, namely, YouTube, Yahoo! Video, LastFM and CiteULike. Our study is based on an extensive characterization of data crawled from the four applications with respect to usage, amount and semantics of content, descriptive and discriminative power as well as content and information diversity across features. It also includes a series of object classification and tag recommendation experiments as case studies of two important information retrieval tasks, aiming at analyzing how these tasks are affected by the quality of the textual features. Classification and recommendation effectiveness is analyzed in light of our characterization results. Our findings provide valuable insights for future research and design of Web 2.0 applications and services.