Attention, intentions, and the structure of discourse
Computational Linguistics
IEEE/ACM Transactions on Networking (TON)
Agents that reduce work and information overload
Communications of the ACM
ACM Transactions on Information Systems (TOIS) - Special issue on social science perspectives on IS
A rule-based message filtering system
ACM Transactions on Information Systems (TOIS)
Email overload: exploring personal information management of email
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Email—the good, the bad, and the ugly
Communications of the ACM
Threading electronic mail: a preliminary study
Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
EmVis—a visual e-mail analysis tool
NPIV '97 Proceedings of the 1997 workshop on New paradigms in information visualization and manipulation
Concept features in Re:Agent, an intelligent Email agent
AGENTS '98 Proceedings of the second international conference on Autonomous agents
TOPIC ISLANDS—a wavelet-based text visualization system
Proceedings of the conference on Visualization '98
A study of retrospective and on-line event detection
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
On-line new event detection and tracking
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
MailCat: an intelligent assistant for organizing e-mail
Proceedings of the third annual conference on Autonomous Agents
Principles of mixed-initiative user interfaces
Proceedings of the SIGCHI conference on Human Factors in Computing Systems
Statistical Models for Text Segmentation
Machine Learning - Special issue on natural language learning
Event detection from time series data
KDD '99 Proceedings of the fifth ACM SIGKDD international conference on Knowledge discovery and data mining
Extracting significant time varying features from text
Proceedings of the eighth international conference on Information and knowledge management
The Hierarchical Hidden Markov Model: Analysis and Applications
Machine Learning
Mail-by-example: a visual query interface for email management
AVI '00 Proceedings of the working conference on Advanced visual interfaces
Automatic generation of overview timelines
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Improving text categorization methods for event tracking
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Principles of data mining
Finding simple intensity descriptions from event sequence data
Proceedings of the seventh ACM SIGKDD international conference on Knowledge discovery and data mining
Querying and mining data streams: you only get one look a tutorial
Proceedings of the 2002 ACM SIGMOD international conference on Management of data
ICDE '95 Proceedings of the Eleventh International Conference on Data Engineering
Incremental Learning in SwiftFile
ICML '00 Proceedings of the Seventeenth International Conference on Machine Learning
Finding Frequent Items in Data Streams
ICALP '02 Proceedings of the 29th International Colloquium on Automata, Languages and Programming
ThemeRiver: Visualizing Theme Changes over Time
INFOVIS '00 Proceedings of the IEEE Symposium on Information Vizualization 2000
Visualizing Sequential Patterns for Text Mining
INFOVIS '00 Proceedings of the IEEE Symposium on Information Vizualization 2000
Knowledge discovery in time series databases
IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics
Proceedings of the 15th international conference on World Wide Web
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
ACM Transactions on the Web (TWEB)
Towards automatic extraction of event and place semantics from flickr tags
SIGIR '07 Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval
BlogScope: a system for online analysis of high volume text streams
VLDB '07 Proceedings of the 33rd international conference on Very large data bases
Efficient concept clustering for ontology learning using an event life cycle on the web
Proceedings of the 2008 ACM symposium on Applied computing
Towards mining past content of Web pages
The New Review of Hypermedia and Multimedia - Web Archiving
A bayesian mixture model with linear regression mixing proportions
Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining
Event detection with common user interests
Proceedings of the 10th ACM workshop on Web information and data management
Methods for extracting place semantics from Flickr tags
ACM Transactions on the Web (TWEB)
Searching for events in the blogosphere
Proceedings of the 18th international conference on World wide web
Adaptive burst detection in a stream engine
Proceedings of the 2009 ACM symposium on Applied Computing
Measuring evolving data streams' behavior through their intrinsic dimension
New Generation Computing
Detecting Temporal Trends of Technical Phrases by Using Importance Indices and Linear Regression
ISMIS '09 Proceedings of the 18th International Symposium on Foundations of Intelligent Systems
Change (Detection) You Can Believe in: Finding Distributional Shifts in Data Streams
IDA '09 Proceedings of the 8th International Symposium on Intelligent Data Analysis: Advances in Intelligent Data Analysis VIII
Trends Analysis of Topics Based on Temporal Segmentation
DaWaK '09 Proceedings of the 11th International Conference on Data Warehousing and Knowledge Discovery
Constructing comprehensive summaries of large event sequences
ACM Transactions on Knowledge Discovery from Data (TKDD)
STORIES in Time: A Graph-Based Interface for News Tracking and Discovery
WI-IAT '09 Proceedings of the 2009 IEEE/WIC/ACM International Joint Conference on Web Intelligence and Intelligent Agent Technology - Volume 03
Event detection from flickr data through wavelet-based spatial analysis
Proceedings of the 18th ACM conference on Information and knowledge management
Comparing Temporal Behavior of Phrases on Multiple Indexes with a Burst Word Detection Method
RSFDGrC '09 Proceedings of the 12th International Conference on Rough Sets, Fuzzy Sets, Data Mining and Granular Computing
Detecting temporal patterns of technical phrases by using importance indices in a research documents
SMC'09 Proceedings of the 2009 IEEE international conference on Systems, Man and Cybernetics
APWeb/WAIM'07 Proceedings of the joint 9th Asia-Pacific web and 8th international conference on web-age information management conference on Advances in data and web management
Burst detection from multiple data streams: a network-based approach
IEEE Transactions on Systems, Man, and Cybernetics, Part C: Applications and Reviews
Identification, Modelling and Prediction of Non-periodic Bursts in Workloads
CCGRID '10 Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing
Context comparison of bursty events in web search and online media
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Context modeling for ranking and tagging bursty features in text streams
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Using cohesive subgroups for analyzing the evolution of the friend view mobile social network
UIC'10 Proceedings of the 7th international conference on Ubiquitous intelligence and computing
Extracting hot spots of topics from time-stamped documents
Data & Knowledge Engineering
The detection of scene features in Flickr
ICWL'10 Proceedings of the 2010 international conference on New horizons in web-based learning
Evaluating a temporal pattern detection method for finding research keys in bibliographical data
Transactions on rough sets XIV
A time-dependent topic model for multiple text streams
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Learning the funding momentum of research projects
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
Finding critical thresholds for defining bursts
DaWaK'11 Proceedings of the 13th international conference on Data warehousing and knowledge discovery
Analyzing word frequencies in large text corpora using inter-arrival times and bootstrapping
ECML PKDD'11 Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part II
ETree: Effective and Efficient Event Modeling for Real-Time Online Social Media Networks
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Robustness of change detection algorithms
IDA'11 Proceedings of the 10th international conference on Advances in intelligent data analysis X
Proceedings of the ACM 2012 conference on Computer Supported Cooperative Work
State aggregation in higher order markov chains for finding online communities
IDEAL'06 Proceedings of the 7th international conference on Intelligent Data Engineering and Automated Learning
Using query profiles for clarification
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Emotional reactions to real-world events in social networks
PAKDD'11 Proceedings of the 15th international conference on New Frontiers in Applied Data Mining
Time-aware visualization of document collections
Proceedings of the 27th Annual ACM Symposium on Applied Computing
Real-World behavior analysis through a social media lens
SBP'12 Proceedings of the 5th international conference on Social Computing, Behavioral-Cultural Modeling and Prediction
Coevolution of network structure and content
Proceedings of the 3rd Annual ACM Web Science Conference
A novel burst-based text representation model for scalable event detection
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Identifying event-related bursts via social media activities
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Generating event storylines from microblogs
Proceedings of the 21st ACM international conference on Information and knowledge management
Temporal web dynamics and its application to information retrieval
Proceedings of the sixth ACM international conference on Web search and data mining
Exploiting user comments for audio-visual content indexing and retrieval
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Detecting real-time burst topics in microblog streams: how sentiment can help
Proceedings of the 22nd international conference on World Wide Web companion
Proceedings of the 22nd international conference on World Wide Web
TopicFlow: visualizing topic alignment of Twitter data over time
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Evolution of communities on Twitter and the role of their leaders during emergencies
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Proceedings of the ACM SIGKDD Workshop on Outlier Detection and Description
Methods for extracting place semantics from Flickr tags
ACM Transactions on the Web (TWEB)
Understanding in-video dropouts and interaction peaks inonline lecture videos
Proceedings of the first ACM conference on Learning @ scale conference
Investigating query bursts in a web search engine
Web Intelligence and Agent Systems
Mapping ICT knowledge infrastructure in South Asia
Scientometrics
Story graphs: Tracking document set evolution using dynamic graphs
Intelligent Data Analysis - Dynamic Networks and Knowledge Discovery
Hi-index | 0.00 |
A fundamental problem in text data mining is to extract meaningful structure from document streams that arrive continuously over time. E-mail and news articles are two natural examples of such streams, each characterized by topics that appear, grow in intensity for a period of time, and then fade away. The published literature in a particular research field can be seen to exhibit similar phenomena over a much longer time scale. Underlying much of the text mining work in this area is the following intuitive premise—that the appearance of a topic in a document stream is signaled by a “burst of activity,” with certain features rising sharply in frequency as the topic emerges.The goal of the present work is to develop a formal approach for modeling such “bursts,” in such a way that they can be robustly and efficiently identified, and can provide an organizational framework for analyzing the underlying content. The approach is based on modeling the stream using an infinite-state automaton, in which bursts appear naturally as state transitions; it can be viewed as drawing an analogy with models from queueing theory for bursty network traffic. The resulting algorithms are highly efficient, and yield a nested representation of the set of bursts that imposes a hierarchical structure on the overall stream. Experiments with e-mail and research paper archives suggest that the resulting structures have a natural meaning in terms of the content that gave rise to them.