On the approximability of minimizing nonzero variables or unsatisfied relations in linear systems
Theoretical Computer Science
Atomic Decomposition by Basis Pursuit
SIAM Journal on Scientific Computing
Concept decompositions for large sparse text data using clustering
Machine Learning
Topic Detection and Tracking: Event-Based Information Organization
Topic Detection and Tracking: Event-Based Information Organization
Lambertian Reflectance and Linear Subspaces
IEEE Transactions on Pattern Analysis and Machine Intelligence
The Journal of Machine Learning Research
CVPR '05 Proceedings of the 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05) - Volume 1 - Volume 01
A language model approach to keyphrase extraction
MWE '03 Proceedings of the ACL 2003 workshop on Multiword expressions: analysis, acquisition and treatment - Volume 18
Introduction to Information Retrieval
Introduction to Information Retrieval
Robust Face Recognition via Sparse Representation
IEEE Transactions on Pattern Analysis and Machine Intelligence
Twitter power: Tweets as electronic word of mouth
Journal of the American Society for Information Science and Technology
Online Learning for Matrix Factorization and Sparse Coding
The Journal of Machine Learning Research
Emerging topic detection on Twitter based on temporal and social terms evaluation
Proceedings of the Tenth International Workshop on Multimedia Data Mining
Dense error correction via l1-minimization
IEEE Transactions on Information Theory
Streaming first story detection with application to Twitter
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Approximation accuracy, gradient methods, and error bound for structured convex optimization
Mathematical Programming: Series A and B - 20th International Symposium on Mathematical Programming – ISMP 2009
Decomposing background topics from keywords by principal component pursuit
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Probabilistic latent semantic analysis
UAI'99 Proceedings of the Fifteenth conference on Uncertainty in artificial intelligence
Foundations and Trends® in Machine Learning
-SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation
IEEE Transactions on Signal Processing
Highly Robust Error Correction byConvex Programming
IEEE Transactions on Information Theory
HotDigg: finding recent hot topics from digg
DASFAA'12 Proceedings of the 17th international conference on Database Systems for Advanced Applications - Volume Part I
TM-LDA: efficient online modeling of latent topic transitions in social media
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Term Weighting Schemes for Emerging Event Detection
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Towards Topic Trend Prediction on a Topic Evolution Model with Social Connection
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Emerging topic detection for organizations from microblogs
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 2nd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications
Entity linking at the tail: sparse signals, unknown entities, and phrase models
Proceedings of the 7th ACM international conference on Web search and data mining
A time-based collective factorization for topic discovery and monitoring in news
Proceedings of the 23rd international conference on World wide web
Novel document detection for massive data streams using distributed dictionary learning
IBM Journal of Research and Development
Hi-index | 0.00 |
Streaming user-generated content in the form of blogs, microblogs, forums, and multimedia sharing sites, provides a rich source of data from which invaluable information and insights maybe gleaned. Given the vast volume of such social media data being continually generated, one of the challenges is to automatically tease apart the emerging topics of discussion from the constant background chatter. Such emerging topics can be identified by the appearance of multiple posts on a unique subject matter, which is distinct from previous online discourse. We address the problem of identifying emerging topics through the use of dictionary learning. We propose a two stage approach respectively based on detection and clustering of novel user-generated content. We derive a scalable approach by using the alternating directions method to solve the resulting optimization problems. Empirical results show that our proposed approach is more effective than several baselines in detecting emerging topics in traditional news story and newsgroup data. We also demonstrate the practical application to social media analysis, based on a study on streaming data from Twitter.