Machine Learning
A study of smoothing methods for language models applied to Ad Hoc information retrieval
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Information diffusion through blogspace
Proceedings of the 13th international conference on World Wide Web
A web-based kernel function for measuring the similarity of short text snippets
Proceedings of the 15th international conference on World Wide Web
Measuring semantic similarity between words using web search engines
Proceedings of the 16th international conference on World Wide Web
Identifying Document Topics Using the Wikipedia Category Network
WI '06 Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence
Why we twitter: understanding microblogging usage and communities
Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Introduction to Information Retrieval
Introduction to Information Retrieval
Improving similarity measures for short segments of text
AAAI'07 Proceedings of the 22nd national conference on Artificial intelligence - Volume 2
Earthquake shakes Twitter users: real-time event detection by social sensors
Proceedings of the 19th international conference on World wide web
PET: a statistical model for popular events tracking in social communities
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Streaming first story detection with application to Twitter
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Smoothing techniques for adaptive online language models: topic tracking in tweet streams
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Exploiting real-time information retrieval in the microblogosphere
Proceedings of the 12th ACM/IEEE-CS joint conference on Digital Libraries
A wikipedia based semantic graph model for topic tracking in blogosphere
IJCAI'11 Proceedings of the Twenty-Second international joint conference on Artificial Intelligence - Volume Volume Three
Hi-index | 0.00 |
Microblogs such as Twitter have become an increasingly popular source of real-time information.Users tend to keep up-to-date with the developments of topics they are interested in. In this paper, we present an effective real-time tweets filtering system to exploit topic tracking in social media streams. We combine background corpus with foreground corpus to handle the cold start problem. Then we build the Content Model to describe the characteristics of tweets, in which we utilize the link information to expand tweets' content aiming at enriching the semantic information of tweets, and we also analyze the influence of tweet's quality measured by a group of well-defined symbols. Moreover, the Pseudo Relevance Feedback approach triggered by a fixed-width temporal sliding window is employed to adapt our system to the alteration of topics over time. Experimental results on Tweet11 corpus indicate that our system achieves good performance in both T11SU and F-0.5 metrics, and the proposed system has better performance than the best one of TREC2012 real-time filtering pilot task.