Predicting the performance of linearly combined IR systems
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Foundations of statistical natural language processing
Foundations of statistical natural language processing
Automatic generation of overview timelines
SIGIR '00 Proceedings of the 23rd annual international ACM SIGIR conference on Research and development in information retrieval
The SMART automatic document retrieval systems—an illustration
Communications of the ACM
PrefixSpan: Mining Sequential Patterns by Prefix-Projected Growth
Proceedings of the 17th International Conference on Data Engineering
The Journal of Machine Learning Research
The Journal of Machine Learning Research
Language-specific models in multilingual topic tracking
Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
NLP and IR approaches to monolingual and multilingual link detection
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Semantic language models for topic detection and tracking
NAACLstudent '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Proceedings of the HLT-NAACL 2003 student research workshop - Volume 3
Story link detection and new event detection are asymmetric
NAACL-Short '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: companion volume of the Proceedings of HLT-NAACL 2003--short papers - Volume 2
Optimizing story link detection is not equivalent to optimizing new event detection
ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Convolution kernels with feature selection for natural language processing tasks
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Dynamic stopwording for story link detection
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Relevance models for topic detection and tracking
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Information Processing and Management: an International Journal
Fine-grained topic detection in news search results
Proceedings of the 27th Annual ACM Symposium on Applied Computing
Expert Systems with Applications: An International Journal
Learning to explore spatio-temporal impacts for event evaluation on social media
ISNN'12 Proceedings of the 9th international conference on Advances in Neural Networks - Volume Part II
Conceptualizing documents with Wikipedia
Proceedings of the fifth workshop on Exploiting semantic annotations in information retrieval
Exploiting potential citation papers in scholarly paper recommendation
Proceedings of the 13th ACM/IEEE-CS joint conference on Digital libraries
Hi-index | 0.00 |
The paper presents a novel approach to story link detection, where the goal is to determine whether a pair of news stories are linked, i.e., talk about the same event. The present work marks a departure from the prior work in that we measure similarity at two distinct levels of textual organization, the document and its collection, and combine scores at both levels to determine how well stories are linked. Experiments on the TDT-5 corpus show that the present approach, which we call a 'two-tier similarity model,' comfortably beats conventional approaches such as Clarity enhanced KL divergence, while performing robustly across diverse languages.