Novelty and redundancy detection in adaptive filtering
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Retrieval and novelty detection at the sentence level
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
The relationship between Precision-Recall and ROC curves
ICML '06 Proceedings of the 23rd international conference on Machine learning
The nature of novelty detection
Information Retrieval
An information-pattern-based approach to novelty detection
Information Processing and Management: an International Journal
Combining named entities and tags for novel sentence detection
Proceedings of the WSDM '09 Workshop on Exploiting Semantic Annotations in Information Retrieval
Expert Systems with Applications: An International Journal
An intelligent system for sentence retrieval and novelty mining
International Journal of Knowledge Engineering and Data Mining
Design of an intelligent novelty detection application
International Journal of Innovative Computing and Applications
Dimensionality reduction for blog tag mining
International Journal of Web Engineering and Technology
Chinese categorization and novelty mining
PAKDD'11 Proceedings of the 15th Pacific-Asia conference on Advances in knowledge discovery and data mining - Volume Part II
Probabilistic Models for Social Media Mining
International Journal of Information Technology and Web Engineering
Adaptable Services for Novelty Mining
International Journal of Systems and Service-Oriented Engineering
Hi-index | 12.05 |
With the abundance of raw text documents available on the internet, many articles contain redundant information. Novel sentence mining can discover novel, yet relevant, sentences given a specific topic defined by a user. In real-time novelty mining, an important issue is to how to select a suitable novelty metric that quantitatively measures the novelty of a particular sentence. To utilize the merits of different metrics, a blended metric is proposed by combining both cosine similarity and new word count metrics. The blended metric has been tested on TREC 2003 and TREC 2004 Novelty Track data. The experimental results show that the blended metric can perform generally better on topics with different ratios of novelty, which is useful for real-time novelty mining in topics with varying degrees of novelty.