Relevance models for topic detection and tracking

Authors:
Victor Lavrenko;James Allan;Edward DeGuzman;Daniel LaFlamme;Veera Pollard;Stephen Thomas
Affiliations:
University of Massachusetts, Amherst, MA;University of Massachusetts, Amherst, MA;University of Massachusetts, Amherst, MA;University of Massachusetts, Amherst, MA;University of Massachusetts, Amherst, MA;University of Massachusetts, Amherst, MA
Venue:
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Year:
2002

Citing 14
Cited 42

Improving the effectiveness of information retrieval with local context analysis

ACM Transactions on Information Systems (TOIS)
First story detection in TDT is hard

Proceedings of the ninth international conference on Information and knowledge management
Relevance based language models

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A study of smoothing methods for language models applied to Ad Hoc information retrieval

Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Topic Detection and Tracking: Event-Based Information Organization

Topic Detection and Tracking: Event-Based Information Organization
Text Segmentation by Topic

ECDL '97 Proceedings of the First European Conference on Research and Advanced Technology for Digital Libraries
Topic detection and tracking evaluation overview

Topic detection and tracking
Corpora for topic detection and tracking

Topic detection and tracking
Probabilistic approaches to topic detection and tracking

Topic detection and tracking
Statistical models of topical content

Topic detection and tracking
Explorations within topic tracking and detection

Topic detection and tracking
Towards a "Universal dictionary" for multi-language information retrieval applications

Topic detection and tracking
An empirical study of smoothing techniques for language modeling

ACL '96 Proceedings of the 34th annual meeting on Association for Computational Linguistics
Inquery system overview

TIPSTER '93 Proceedings of a workshop on held at Fredericksburg, Virginia: September 19-23, 1993

Capturing term dependencies using a language model based on sentence trees

Proceedings of the eleventh international conference on Information and knowledge management
A System for new event detection

Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
Simple Semantics in Topic Detection and Tracking

Information Retrieval
Language-specific models in multilingual topic tracking

Proceedings of the 27th annual international ACM SIGIR conference on Research and development in information retrieval
Optimizing story link detection is not equivalent to optimizing new event detection

ACL '03 Proceedings of the 41st Annual Meeting on Association for Computational Linguistics - Volume 1
Better than the real thing?: iterative pseudo-query processing using cluster-based language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
PageRank without hyperlinks: structural re-ranking using links induced by language models

Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
On relevance distributions: Brief Communication

Journal of the American Society for Information Science and Technology
Tracking dragon-hunters with language models

CIKM '06 Proceedings of the 15th ACM international conference on Information and knowledge management
An information-theoretic approach to automatic evaluation of summaries

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Language model-based document clustering using random walks

HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Dependency structure language model for topic detection and tracking

Information Processing and Management: an International Journal
Robust techniques for organizing and retrieving spoken documents

EURASIP Journal on Applied Signal Processing
Mining categories for emails via clustering and pattern discovery

Journal of Intelligent Information Systems
Web video topic discovery and tracking via bipartite graph reinforcement model

Proceedings of the 17th international conference on World Wide Web
The opposite of smoothing: a language model approach to ranking query-specific document clusters

Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
A latent variable model for query expansion using the hidden markov model

Proceedings of the 17th ACM conference on Information and knowledge management
Statistical Language Models for Information Retrieval A Critical Review

Foundations and Trends in Information Retrieval
Clusters, language models, and ad hoc information retrieval

ACM Transactions on Information Systems (TOIS)
A relevance model for a data warehouse contextualized with documents

Information Processing and Management: an International Journal
Re-ranking search results using language models of query-specific clusters

Information Retrieval
Towards Automatic Detection of Potentially Important International Events/Phenomena from News Articles at Mostly Domestic News Sites

Proceedings of the 2007 conference on Information Modelling and Knowledge Bases XVIII
Personalized text snippet extraction using statistical language models

Pattern Recognition
Predicting Neighbor Goodness in Collaborative Filtering

FQAS '09 Proceedings of the 8th International Conference on Flexible Query Answering Systems
Topic detection and tracking with spatio-temporal evidence

ECIR'03 Proceedings of the 25th European conference on IR research
Use of topicality and information measures to improve document representation for story link detection

ECIR'07 Proceedings of the 29th European conference on IR research
Improved IR in cohesion model for link detection system

ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
Story link detection based on event model with uneven SVM

AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
PageRank without hyperlinks: Structural reranking using links induced by language models

ACM Transactions on Information Systems (TOIS)
Two-tier similarity model for story link detection

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Utilizing inter-passage and inter-document similarities for reranking search results

ACM Transactions on Information Systems (TOIS)
Story link detection based on event words

CICLing'11 Proceedings of the 12th international conference on Computational linguistics and intelligent text processing - Volume Part II
The opposite of smoothing: a language model approach to ranking query-specific document clusters

Journal of Artificial Intelligence Research
A study of the integration of passage-, document-, and cluster-based information for re-ranking search results

Information Retrieval
Semi-automatic hot event detection

ADMA'06 Proceedings of the Second international conference on Advanced Data Mining and Applications
A performance prediction approach to enhance collaborative filtering performance

ECIR'2010 Proceedings of the 32nd European conference on Advances in Information Retrieval
Fine-grained topic detection in news search results

Proceedings of the 27th Annual ACM Symposium on Applied Computing
Unsupervised and supervised learning to evaluate event relatedness based on content mining from social-media streams

Expert Systems with Applications: An International Journal
Keyphrase extraction through query performance prediction

Journal of Information Science
Generating event storylines from microblogs

Proceedings of the 21st ACM international conference on Information and knowledge management
Probabilistic co-relevance for query-sensitive similarity measurement in information retrieval

Information Processing and Management: an International Journal
Folktale classification using learning to rank

ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval

Quantified Score

Hi-index	0.00

Visualization

Abstract

We extend relevance modeling to the link detection task of Topic Detection and Tracking (TDT) and show that it substantially improves performance. Relevance modeling, a statistical language modeling technique related to query expansion, is used to enhance the topic model estimate associated with a news story, boosting the probability of words that are associated with the story even when they do not appear in the story. To apply relevance modeling to TDT, it had to be extended to work with stories rather than short queries, and the similarity comparison had to be changed to a modified form of Kullback-Leibler. We demonstrate that relevance models result in very substantial improvements over the language modeling baseline. We also show how the use of relevance modeling makes it possible to choose a single parameter for within- and cross-mode comparisons of stories.