SCG '93 Proceedings of the ninth annual symposium on Computational geometry
Journal of Computer and System Sciences
Optimal aggregation algorithms for middleware
PODS '01 Proceedings of the twentieth ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems
The discrepancy method: randomness and complexity
The discrepancy method: randomness and complexity
A Linear Time Algorithm for Finding All Maximal Scoring Subsequences
Proceedings of the Seventh International Conference on Intelligent Systems for Molecular Biology
Bursty and hierarchical structure in streams
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
On the bursty evolution of blogspace
WWW '03 Proceedings of the 12th international conference on World Wide Web
Efficient elastic burst detection in data streams
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Identifying similarities, periodicities and bursts for online search queries
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Parameter free bursty events detection in text streams
VLDB '05 Proceedings of the 31st international conference on Very large data bases
The hunting of the bump: on maximizing statistical discrepancy
SODA '06 Proceedings of the seventeenth annual ACM-SIAM symposium on Discrete algorithm
Improved algorithmms for the k maximum-sums problems
Theoretical Computer Science
BlogScope: spatio-temporal analysis of the blogosphere
Proceedings of the 16th international conference on World Wide Web
Analyzing feature trajectories for event detection
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
BlogScope: a system for online analysis of high volume text streams
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Using Burstiness to Improve Clustering of Topics in News Streams
ICDM '07 Proceedings of the 2007 Seventh IEEE International Conference on Data Mining
Introduction to Algorithms, Third Edition
Introduction to Algorithms, Third Edition
PET: a statistical model for popular events tracking in social communities
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Context modeling for ranking and tagging bursty features in text streams
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Chinese new word detection from query logs
ADMA'10 Proceedings of the 6th international conference on Advanced data mining and applications - Volume Part II
Analyzing word frequencies in large text corpora using inter-arrival times and bootstrapping
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
Keeping keywords fresh: a BM25 variation for personalized keyword extraction
Proceedings of the 2nd Temporal Web Analytics Workshop
Early detection of buzzwords based on large-scale time-series analysis of blog entries
Proceedings of the 23rd ACM conference on Hypertext and social media
On the spatiotemporal burstiness of terms
Proceedings of the VLDB Endowment
Generating event storylines from microblogs
Proceedings of the 21st ACM international conference on Information and knowledge management
STEM: a spatio-temporal miner for bursty activity
Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data
Trends in computer science research
Communications of the ACM
AnchorMF: towards effective event context identification
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Towards mobile language evolution exploitation
Multimedia Tools and Applications
Spatio-temporal characteristics of bursty words in Twitter streams
Proceedings of the 21st ACM SIGSPATIAL International Conference on Advances in Geographic Information Systems
Hybrid pseudo-relevance feedback for microblog retrieval
Journal of Information Science
Hi-index | 0.02 |
As the number and size of large timestamped collections (e.g. sequences of digitized newspapers, periodicals, blogs) increase, the problem of efficiently indexing and searching such data becomes more important. Term burstiness has been extensively researched as a mechanism to address event detection in the context of such collections. In this paper, we explore how burstiness information can be further utilized to enhance the search process. We present a novel approach to model the burstiness of a term, using discrepancy theory concepts. This allows us to build a parameter-free, linear-time approach to identify the time intervals of maximum burstiness for a given term. Finally, we describe the first burstiness-driven search framework and thoroughly evaluate our approach in the context of different scenarios.