Short text classification in twitter to improve information filtering

Authors:
Bharath Sriram;Dave Fuhry;Engin Demir;Hakan Ferhatosmanoglu;Murat Demirbas
Affiliations:
Ohio State University, Columbus, OH, USA;Ohio State University, Columbus, OH, USA;Ohio State University, Columbus, OH, USA;Ohio State University, Columbus, OH, USA;University at Buffalo, Suny, NY, USA
Venue:
Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval
Year:
2010

Citing 6
Cited 53

Clustering short texts using wikipedia

SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
Why we twitter: understanding microblogging usage and communities

Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Learning to classify short and sparse text & web with hidden topics from large-scale data collections

Proceedings of the 17th international conference on World Wide Web
Site-based dynamic pruning for query processing in search engines

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Exploiting internal and external semantics for the clustering of short texts using world knowledge

Proceedings of the 18th ACM conference on Information and knowledge management
TwitterStand: news in tweets

Proceedings of the 17th ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems

On the difficulty of clustering company tweets

SMUC '10 Proceedings of the 2nd international workshop on Search and mining user-generated contents
What have fruits to do with technology?: the case of Orange, Blackberry and Apple

Proceedings of the International Conference on Web Intelligence, Mining and Semantics
Towards effective short text deep classification

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Subword and spatiotemporal models for identifying actionable information in Haitian Kreyol

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Discovering context: classifying tweets through a semantic transform based on wikipedia

FAC'11 Proceedings of the 6th international conference on Foundations of augmented cognition: directing the future of adaptive systems
Transferring topical knowledge from auxiliary long texts for short text clustering

Proceedings of the 20th ACM international conference on Information and knowledge management
User oriented tweet ranking: a filtering approach to microblogs

Proceedings of the 20th ACM international conference on Information and knowledge management
Classifying trending topics: a typology of conversation triggers on Twitter

Proceedings of the 20th ACM international conference on Information and knowledge management
Tweet classification by data compression

Proceedings of the 2011 international workshop on DETecting and Exploiting Cultural diversiTy on the social web
On the difficulty of clustering microblog texts for online reputation management

WASSA '11 Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis
Quality-aware similarity assessment for entity matching in Web data

Information Systems
"I loan because...": understanding motivations for pro-social lending

Proceedings of the fifth ACM international conference on Web search and data mining
Short message communications: users, topics, and in-language processing

Proceedings of the 2nd ACM Symposium on Computing for Development
Online named entity recognition method for microtexts in social networking services: A case study of twitter

Expert Systems with Applications: An International Journal
Mining microblogs to infer music artist similarity and cultural listening patterns

Proceedings of the 21st international conference companion on World Wide Web
The twitter mute button: a web filtering challenge

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Representation models for text classification: a comparative analysis over three web document types

Proceedings of the 2nd International Conference on Web Intelligence, Mining and Semantics
Classification of short texts by deploying topical annotations

ECIR'12 Proceedings of the 34th European conference on Advances in Information Retrieval
Searching for quality microblog posts: filtering and ranking based on content analysis and implicit links

DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
Predicting semantic annotations on the real-time web

Proceedings of the 23rd ACM conference on Hypertext and social media
#nowplaying Madonna: a large-scale evaluation on estimating similarities between music artists and between movies from microblogs

Information Retrieval
Improving tweet stream classification by detecting changes in word probability

SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Efficient filtering in micro-blogging systems: we won't get flooded again

SSDBM'12 Proceedings of the 24th international conference on Scientific and Statistical Database Management
Re-tweeting from a linguistic perspective

LSM '12 Proceedings of the Second Workshop on Language in Social Media
TCSST: transfer classification of short & sparse text using external data

Proceedings of the 21st ACM international conference on Information and knowledge management
Graph-based collective classification for tweets

Proceedings of the 21st ACM international conference on Information and knowledge management
Tweet classification based on their lifetime duration

Proceedings of the 21st ACM international conference on Information and knowledge management
Extended information inference model for unsupervised categorization of web short texts

Journal of Information Science
Classifying unlabeled short texts using a fuzzy declarative approach

Language Resources and Evaluation
A document is known by the company it keeps: neighborhood consensus for short text categorization

Language Resources and Evaluation
CatStream: categorising tweets for user profiling and stream filtering

Proceedings of the 2013 international conference on Intelligent user interfaces
Understanding the top grass roots in sina-weibo

IScIDE'12 Proceedings of the third Sino-foreign-interchange conference on Intelligent Science and Intelligent Data Engineering
Towards detection of child sexual abuse media: categorization of the associated filenames

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Stream-based event prediction using bayesian and bloom filters

Proceedings of the 4th ACM/SPEC International Conference on Performance Engineering
Discovering filter keywords for company name disambiguation in twitter

Expert Systems with Applications: An International Journal
Extracting usability and user experience information from online user reviews

Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Harnessing web page directories for large-scale classification of tweets

Proceedings of the 22nd international conference on World Wide Web companion
Multi-step classification approaches to cumulative citation recommendation

Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
A framework for detecting public health trends with Twitter

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Steeler nation, 12th man, and boo birds: classifying Twitter user interests using time series

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
"w00t! feeling great today!": chatter in Twitter: identification and prevalence

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Identifying purpose behind electoral tweets

Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining
Using emotional context from article for contextual music recommendation

Proceedings of the 21st ACM international conference on Multimedia
Discovering health-related knowledge in social media using ensembles of heterogeneous features

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Short text classification by detecting information path

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
On sparsity and drift for effective real-time filtering in microblogs

Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Improving short text classification using public search engines

IUKM'13 Proceedings of the 2013 international conference on Integrated Uncertainty in Knowledge Modelling and Decision Making
Mining topic clouds from social data

Proceedings of the Fifth International Conference on Management of Emergent Digital EcoSystems
Language independent semantic kernels for short-text classification

Expert Systems with Applications: An International Journal
Classifying microblogs for disasters

Proceedings of the 18th Australasian Document Computing Symposium
Improving traffic prediction with tweet semantics

IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Personalized emerging topic detection based on a term aging model

ACM Transactions on Intelligent Systems and Technology (TIST) - Special Section on Intelligent Mobile Knowledge Discovery and Management Systems and Special Issue on Social Web Mining
Improving government services with social media feedback

Proceedings of the 19th international conference on Intelligent User Interfaces

Quantified Score

Hi-index	0.00

Visualization

Abstract

In microblogging services such as Twitter, the users may become overwhelmed by the raw data. One solution to this problem is the classification of short text messages. As short texts do not provide sufficient word occurrences, traditional classification methods such as "Bag-Of-Words" have limitations. To address this problem, we propose to use a small set of domain-specific features extracted from the author's profile and text. The proposed approach effectively classifies the text to a predefined set of generic classes such as News, Events, Opinions, Deals, and Private Messages.