PAT-tree-based keyword extraction for Chinese information retrieval
Proceedings of the 20th annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Event tracking based on domain dependency
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
Summarization as feature selection for text categorization
Proceedings of the tenth international conference on Information and knowledge management
A critical examination of TDT's cost function
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Building a Chinese-English wordnet for translingual applications
ACM Transactions on Asian Language Information Processing (TALIP)
Topic detection and tracking: event-based information organization
Topic detection and tracking: event-based information organization
Topic detection and tracking evaluation overview
Topic detection and tracking
An NLP & IR approach to topic detection
Topic detection and tracking
A summarization system for Chinese news from multiple sources
Journal of the American Society for Information Science and Technology
Entity-based cross-document coreferencing using the Vector Space Model
COLING '98 Proceedings of the 17th international conference on Computational linguistics - Volume 1
Identification and classification of proper nouns in Chinese texts
COLING '96 Proceedings of the 16th conference on Computational linguistics - Volume 1
Web-page classification through summarization
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Query based event extraction along a timeline
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Using coreference chains for text summarization
CorefApp '99 Proceedings of the Workshop on Coreference and its Applications
Using coreference for question answering
CorefApp '99 Proceedings of the Workshop on Coreference and its Applications
Multidocument Summary Generation: Using Informative and Event Words
ACM Transactions on Asian Language Information Processing (TALIP)
Clustering of document collection - A weighting approach
Expert Systems with Applications: An International Journal
A cascaded classification approach to disambiguating polysemous mentions with social chains
Expert Systems with Applications: An International Journal
Event detection with spatial latent Dirichlet allocation
Proceedings of the 11th annual international ACM/IEEE joint conference on Digital libraries
Hi-index | 0.00 |
Unifying terminology usages which captures more term semantics is useful for event clustering. This paper proposes a metric of normalized chain edit distance to mine, incrementally, controlled vocabulary from cross-document coreference chains. Controlled vocabulary is employed to unify terms among different co-reference chains. A novel threshold model that incorporates both time decay function and spanning window uses the controlled vocabulary for event clustering on streaming news. Under correct co-reference chains, the proposed system has a 15.97% performance increase compared to the baseline system, and a 5.93% performance increase compared to the system without introducing controlled vocabulary. Furthermore, a Chinese co-reference resolution system with a chain filtering mechanism is used to experiment on the robustness of the proposed event clustering system. The clustering system using noisy co-reference chains still achieves a 10.55% performance increase compared to the baseline system. The above shows that our approach is promising.