Event tracking based on domain dependency
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Summarization as feature selection for text categorization
Proceedings of the tenth international conference on Information and knowledge management
Topic Detection and Tracking: Event-Based Information Organization
Topic Detection and Tracking: Event-Based Information Organization
A critical examination of TDT's cost function
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Topic detection and tracking evaluation overview
Topic detection and tracking
A summarization system for Chinese news from multiple sources
Journal of the American Society for Information Science and Technology
Entity-based cross-document coreferencing using the Vector Space Model
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Web-page classification through summarization
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Query based event extraction along a timeline
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Using coreference chains for text summarization
CorefApp '99 Proceedings of the Workshop on Coreference and its Applications
Using coreference for question answering
CorefApp '99 Proceedings of the Workshop on Coreference and its Applications
Hi-index | 0.00 |
Unification of the terminology usages which captures more term semantics is useful for event clustering. This paper proposes a metric of normalized chain edit distance to mine controlled vocabulary from cross-document co-reference chains incrementally. A novel threshold model that incorporates time decay function and spanning window utilizes the controlled vocabulary for event clustering on streaming news. The experimental results show that the proposed system has 16% performance increase compared to the baseline system and 6% performance increase compared to the system without introducing controlled vocabulary.