Attention, intentions, and the structure of discourse
Computational Linguistics
Term-weighting approaches in automatic text retrieval
Information Processing and Management: an International Journal
A self-organizing semantic map for information retrieval
SIGIR '91 Proceedings of the 14th annual international ACM SIGIR conference on Research and development in information retrieval
Information retrieval
Galaxy of news: an approach to visualizing and understanding expansive news landscapes
UIST '94 Proceedings of the 7th annual ACM symposium on User interface software and technology
Threading electronic mail: a preliminary study
Information Processing and Management: an International Journal - Special issue: methods and tools for the automatic construction of hypertext
Metadata visualization for digital libraries: interactive timeline editing and review
Proceedings of the third ACM conference on Digital libraries
A study of retrospective and on-line event detection
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
On-line new event detection and tracking
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Extracting significant time varying features from text
Proceedings of the eighth international conference on Information and knowledge management
Automatic generation of overview timelines
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Improving text categorization methods for event tracking
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Intelligent information triage
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Detecting and Browsing Events in Unstructured text
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Learning Approaches for Detecting and Tracking News Events
IEEE Intelligent Systems
Using Stem Rules to Refine Document Retrieval Queries
FQAS '98 Proceedings of the Third International Conference on Flexible Query Answering Systems
Bursty and hierarchical structure in streams
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Visualizing the non-visual: spatial analysis and interaction with information from text documents
INFOVIS '95 Proceedings of the 1995 IEEE Symposium on Information Visualization
A System for new event detection
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
On-Line New Event Detection using Single Pass Clustering TITLE2:
On-Line New Event Detection using Single Pass Clustering TITLE2:
An empirical study of smoothing techniques for language modeling
ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Identifying similarities, periodicities and bursts for online search queries
SIGMOD '04 Proceedings of the 2004 ACM SIGMOD international conference on Management of data
Tracking dynamics of topic trends using a finite mixture model
Proceedings of the tenth ACM SIGKDD international conference on Knowledge discovery and data mining
A probabilistic model for retrospective news event detection
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Discovering evolutionary theme patterns from text: an exploration of temporal text mining
Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining
Parameter free bursty events detection in text streams
VLDB '05 Proceedings of the 31st international conference on Very large data bases
Text Classification without Negative Examples Revisit
IEEE Transactions on Knowledge and Data Engineering
Topics over time: a non-Markov continuous-time model of topical trends
Proceedings of the 12th ACM SIGKDD international conference on Knowledge discovery and data mining
Automatic online news issue construction in web environment
Proceedings of the 17th international conference on World Wide Web
Automatic online news topic ranking using media focus and user attention based on aging theory
Proceedings of the 17th ACM conference on Information and knowledge management
Event detection with common user interests
Proceedings of the 10th ACM workshop on Web information and data management
Extracting Key Entities and Significant Events from Online Daily News
IDEAL '08 Proceedings of the 9th International Conference on Intelligent Data Engineering and Automated Learning
An Automatic Online News Topic Keyphrase Extraction System
WI-IAT '08 Proceedings of the 2008 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
A sentence level probabilistic model for evolutionary theme pattern mining from news corpora
Proceedings of the 2009 ACM symposium on Applied Computing
Early online identification of attention gathering items in social media
Proceedings of the third ACM international conference on Web search and data mining
Emerging topic detection on Twitter based on temporal and social terms evaluation
Proceedings of the Tenth International Workshop on Multimedia Data Mining
Identifying, attributing and describing spatial bursts
Proceedings of the VLDB Endowment
Evolutionary timeline summarization: a balanced optimization framework via iterative substitution
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
ETree: Effective and Efficient Event Modeling for Real-Time Online Social Media Networks
WI-IAT '11 Proceedings of the 2011 IEEE/WIC/ACM International Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Keyword-propagation-based information enriching and noise removal for web news videos
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
EventSearch: a system for event discovery and retrieval on multi-type historical data
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Comparative document summarization via discriminative sentence selection
ACM Transactions on Knowledge Discovery from Data (TKDD)
A novel burst-based text representation model for scalable event detection
ACL '12 Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics: Short Papers - Volume 2
Identifying event-related bursts via social media activities
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
An effective multi-clue fusion approach for web video topic detection
Proceedings of the 20th ACM international conference on Multimedia
Twevent: segment-based event detection from tweets
Proceedings of the 21st ACM international conference on Information and knowledge management
Comparative Document Summarization via Discriminative Sentence Selection
ACM Transactions on Knowledge Discovery from Data (TKDD)
Hi-index | 0.00 |
In this paper, an algorithm called Time Driven Documents-partition (TDD) is proposed to construct an event hierarchy in a text corpus based on a given query. Specifically, assume that a query contains only one feature - Election. Election is directly related to the events such as 2006 US Midterm Elections Campaign, 2004 US Presidential Election Campaign and 2004 Taiwan Presidential Election Campaign, where these events may further be divided into several smaller events (e.g. the 2006 US Midterm Elections Campaign can be broken down into events such as campaign for vote, election results and the resignation of Donald H. Rumsfeld). As such, an event hierarchy is resulted. Our proposed algorithm, TDD, tackles the problem by three major steps: (1)Identify the features that are related to the query according to both the timestamps and the contents of the documents. The features identified are regarded as bursty features; (2) Extract the documents that are highly related to the bursty features based on time; (3) Partition the extracted documents to form events and organize them in a hierarchicalstructure. To the best of our knowledge, there is little works targeting for constructing a feature-based event hierarchy for a text corpus. Practically, event hierarchies can assist us to efficiently locate our target information in a text corpus easily. Again, assume that Election is used for a query. Without an event hierarchy, it is very difficult to identify what are the major events related to it, when do these events happened, as well as the features and the news articles that are related to each of these events. We have archived two-year news articles to evaluate the feasibility of TDD. The encouraging results indicated that TDD is practically sound and highly effective.