Approximate nearest neighbors: towards removing the curse of dimensionality
STOC '98 Proceedings of the thirtieth annual ACM symposium on Theory of computing
A study of retrospective and on-line event detection
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Similarity estimation techniques from rounding algorithms
STOC '02 Proceedings of the thiry-fourth annual ACM symposium on Theory of computing
Topic Detection and Tracking: Event-Based Information Organization
Topic Detection and Tracking: Event-Based Information Organization
Locality-sensitive hashing scheme based on p-stable distributions
SCG '04 Proceedings of the twentieth annual symposium on Computational geometry
Event threading within news topics
Proceedings of the thirteenth ACM international conference on Information and knowledge management
The predictive power of online chatter
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Randomized algorithms and NLP: using locality sensitive hash function for high speed noun clustering
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Resource-adaptive real-time new event detection
Proceedings of the 2007 ACM SIGMOD international conference on Management of data
BlogScope: a system for online analysis of high volume text streams
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Why we twitter: understanding microblogging usage and communities
Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Proceedings of the first workshop on Online social networks
Search Engines: Information Retrieval in Practice
Search Engines: Information Retrieval in Practice
Efficient methods for topic model inference on streaming document collections
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Stream-based randomised language models for SMT
EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 2 - Volume 2
Online generation of locality sensitive hash signatures
ACLShort '10 Proceedings of the ACL 2010 Conference Short Papers
We're not in Kansas anymore: detecting domain changes in streams
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Unified analysis of streaming news
Proceedings of the 20th international conference on World wide web
ECIR'11 Proceedings of the 33rd European conference on Advances in information retrieval
Efficient online locality sensitive hashing via reservoir counting
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Subword and spatiotemporal models for identifying actionable information in Haitian Kreyol
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
WikiTopics: what is popular on Wikipedia and why
WASDGML '11 Proceedings of the Workshop on Automatic Summarization for Different Genres, Media, and Languages
Smoothing techniques for adaptive online language models: topic tracking in tweet streams
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Emerging topic detection using dictionary learning
Proceedings of the 20th ACM international conference on Information and knowledge management
Trend-based and reputation-versed personalized news network
Proceedings of the 3rd international workshop on Search and mining user-generated contents
Identifying content for planned events across social media sites
Proceedings of the fifth ACM international conference on Web search and data mining
Mining the interests of Chinese microbloggers via keyword extraction
Frontiers of Computer Science in China
Linguistic redundancy in Twitter
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Short message communications: users, topics, and in-language processing
Proceedings of the 2nd ACM Symposium on Computing for Development
Summarizing sporting events using twitter
Proceedings of the 2012 ACM international conference on Intelligent User Interfaces
The twitter mute button: a web filtering challenge
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Open domain event extraction from twitter
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Information Retrieval on the Blogosphere
Foundations and Trends in Information Retrieval
Hello, who is calling?: can words reveal the social nature of conversations?
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Using paraphrases for improving first story detection in news and Twitter
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Space efficiencies in discourse modeling via conditional random sampling
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Structured event retrieval over microblog archives
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Streaming analysis of discourse participants
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Twevent: segment-based event detection from tweets
Proceedings of the 21st ACM international conference on Information and knowledge management
Content-based crowd retrieval on the real-time web
Proceedings of the 21st ACM international conference on Information and knowledge management
Streaming trend detection in Twitter
International Journal of Web Based Communities
Towards Twitter context summarization with user influence models
Proceedings of the sixth ACM international conference on Web search and data mining
Semantic Expansion of Tweet Contents for Enhanced Event Detection in Twitter
ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Towards Topic Trend Prediction on a Topic Evolution Model with Social Connection
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Brand-Related Events Detection, Classification and Summarization on Twitter
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Geo-spatial event detection in the twitter stream
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Proceedings of the 22nd international conference on World Wide Web companion
Who broke the news?: an analysis on first reports of news events
Proceedings of the 22nd international conference on World Wide Web companion
Identifying local events by using microblogs as social sensors
Proceedings of the 10th Conference on Open Research Areas in Information Retrieval
EventSense: capturing the pulse of large-scale events by mining social media streams
Proceedings of the 17th Panhellenic Conference on Informatics
Exploiting hashtags for adaptive microblog crawling
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Efficient Nearest-Neighbor Search in the Probability Simplex
Proceedings of the 2013 Conference on the Theory of Information Retrieval
Event detection and trending in multiple social networking sites
Proceedings of the 16th Communications & Networking Symposium
OLAPing social media: the case of Twitter
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
AnchorMF: towards effective event context identification
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Building a large-scale corpus for evaluating event detection on twitter
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Expert Systems with Applications: An International Journal
Exploiting topic tracking in real-time tweet streams
Proceedings of the 2013 international workshop on Mining unstructured big data using natural language processing
International Journal of Ad Hoc and Ubiquitous Computing
Evidential location estimation for events detected in Twitter
Proceedings of the 7th Workshop on Geographic Information Retrieval
Improving traffic prediction with tweet semantics
IJCAI'13 Proceedings of the Twenty-Third international joint conference on Artificial Intelligence
Personalized emerging topic detection based on a term aging model
ACM Transactions on Intelligent Systems and Technology (TIST) - Special Section on Intelligent Mobile Knowledge Discovery and Management Systems and Special Issue on Social Web Mining
Streaming similarity search over one billion tweets using parallel locality-sensitive hashing
Proceedings of the VLDB Endowment
A time-based collective factorization for topic discovery and monitoring in news
Proceedings of the 23rd international conference on World wide web
Sequential Summarization: A Full View of Twitter Trending Topics
IEEE/ACM Transactions on Audio, Speech and Language Processing (TASLP)
Hi-index | 0.00 |
With the recent rise in popularity and size of social media, there is a growing need for systems that can extract useful information from this amount of data. We address the problem of detecting new events from a stream of Twitter posts. To make event detection feasible on web-scale corpora, we present an algorithm based on locality-sensitive hashing which is able overcome the limitations of traditional approaches, while maintaining competitive results. In particular, a comparison with a state-of-the-art system on the first story detection task shows that we achieve over an order of magnitude speedup in processing time, while retaining comparable performance. Event detection experiments on a collection of 160 million Twitter posts show that celebrity deaths are the fastest spreading news on Twitter.