Novelty detection: the TREC experience

Authors:
Ian Soboroff;Donna Harman
Affiliations:
National Institute of Standards and Technology, Gaithersburg, MD;National Institute of Standards and Technology, Gaithersburg, MD
Venue:
HLT '05 Proceedings of the conference on Human Language Technology and Empirical Methods in Natural Language Processing
Year:
2005

Citing 7
Cited 14

Information filtering and information retrieval: two sides of the same coin?

Communications of the ACM - Special issue on information filtering
Overview of the second text retrieval conference (TREC-2)

TREC-2 Proceedings of the second conference on Text retrieval conference
Variations in relevance judgments and the measurement of retrieval effectiveness

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
The use of MMR, diversity-based reranking for reordering documents and producing summaries

Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
First story detection in TDT is hard

Proceedings of the ninth international conference on Information and knowledge management
Information Retrieval

Information Retrieval
Introduction to the Special Issue: Overview of the TREC Routing and Filtering Tasks

Information Retrieval

Event detection with common user interests

Proceedings of the 10th ACM workshop on Web information and data management
Syntactic Query Models for Restatement Retrieval

SPIRE '09 Proceedings of the 16th International Symposium on String Processing and Information Retrieval
Contrastive summarization: an experiment with consumer reviews

NAACL-Short '09 Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Short Papers
Automatically evaluating content selection in summarization without human models

EMNLP '09 Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing: Volume 1 - Volume 1
Tell me more, not just "more of the same"

Proceedings of the 15th international conference on Intelligent user interfaces
From bursty patterns to bursty facts: The effectiveness of temporal text mining for news

Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
On the relationship between novelty and popularity of user-generated content

CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Estimating importance features for fact mining: with a case study in biography mining

Large Scale Semantic Access to Content (Text, Image, Video, and Sound)
Repeatable and reliable search system evaluation using crowdsourcing

Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Information Retrieval in the Commentsphere

ACM Transactions on Intelligent Systems and Technology (TIST)
On the Relationship between Novelty and Popularity of User-Generated Content

ACM Transactions on Intelligent Systems and Technology (TIST)
DualSum: a topic-model based approach for update summarization

EACL '12 Proceedings of the 13th Conference of the European Chapter of the Association for Computational Linguistics
An online system with end-user services: mining novelty concepts from tv broadcast subtitles

Proceedings of the 19th ACM SIGKDD international conference on Knowledge discovery and data mining
Story graphs: Tracking document set evolution using dynamic graphs

Intelligent Data Analysis - Dynamic Networks and Knowledge Discovery

Quantified Score

Hi-index	0.00

Visualization

Abstract

A challenge for search systems is to detect not only when an item is relevant to the user's information need, but also when it contains something new which the user has not seen before. In the TREC novelty track, the task was to highlight sentences containing relevant and new information in a short, topical document stream. This is analogous to highlighting key parts of a document for another person to read, and this kind of output can be useful as input to a summarization system. Search topics involved both news events and reported opinions on hot-button subjects. When people performed this task, they tended to select small blocks of consecutive sentences, whereas current systems identified many relevant and novel passages. We also found that opinions are much harder to track than events.