Identifying Document Topics Using the Wikipedia Category Network
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Robust classification of rare queries using web knowledge
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Why we twitter: understanding microblogging usage and communities
Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Proceedings of the first workshop on Online social networks
Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Is it really about me?: message content in social awareness streams
Proceedings of the 2010 ACM conference on Computer supported cooperative work
Text Classification Methodologies Applied to Micro-Text in Military Chat
ICMLA '09 Proceedings of the 2009 International Conference on Machine Learning and Applications
What is Twitter, a social network or a news media?
Proceedings of the 19th international conference on World wide web
Short text classification in twitter to improve information filtering
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Classifying sentiment in microblogs: is brevity an advantage?
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Tokenizing micro-blogging messages using a text classification approach
AND '10 Proceedings of the fourth workshop on Analytics for noisy unstructured text data
Microblogging after a major disaster in China: a case study of the 2010 Yushu earthquake
Proceedings of the ACM 2011 conference on Computer supported cooperative work
Who says what to whom on twitter
Proceedings of the 20th international conference on World wide web
Who gives a tweet?: evaluating microblog content value
Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work
Hi-index | 0.00 |
Microblogging is now popular among everyday web users in China who have a common name called grass roots in Sina-Weibo, a major microblogging service similar to Twitter. In this paper, we investigate the properties of messages published by this group of users and classify the messages into various topic categories using text classification methods based on the Bag of Words (BOW) model. We find that, using Naïve Bayes, it is possible to achieve high accuracy in recognizing the topic of a message but the popularity of a message cannot be reliably predicated based on its contents. These findings are also further explored with visualization techniques.