An algorithm for suffix stripping
Readings in information retrieval
On-line new event detection and tracking
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Novelty and redundancy detection in adaptive filtering
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Natural language processing in support of decision-making: phrases and part-of-speech tagging
Information Processing and Management: an International Journal
Topic-conditioned novelty detection
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Retrieval and novelty detection at the sentence level
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
A System for new event detection
Proceedings of the 26th annual international ACM SIGIR conference on Research and development in informaion retrieval
First story detection using a composite document representation
HLT '01 Proceedings of the first international conference on Human language technology research
Novelty detection based on sentence level patterns
Proceedings of the 14th ACM international conference on Information and knowledge management
Chinese Word Segmentation and Named Entity Recognition: A Pragmatic Approach
Computational Linguistics
Chinese Named Entity Recognition combining a statistical model with human knowledge
MultiNER '03 Proceedings of the ACL 2003 workshop on Multilingual and mixed-language named entity recognition - Volume 15
The nature of novelty detection
Information Retrieval
A fast, accurate deterministic parser for Chinese
ACL-44 Proceedings of the 21st International Conference on Computational Linguistics and the 44th annual meeting of the Association for Computational Linguistics
An information-pattern-based approach to novelty detection
Information Processing and Management: an International Journal
Machine learning techniques for business blog search and mining
Expert Systems with Applications: An International Journal
Combining named entities and tags for novel sentence detection
Proceedings of the WSDM '09 Workshop on Exploiting Semantic Annotations in Information Retrieval
Sentence-Level Novelty Detection in English and Malay
PAKDD '09 Proceedings of the 13th Pacific-Asia Conference on Advances in Knowledge Discovery and Data Mining
Design and development of a mobile peer-to-peer social networking application
Expert Systems with Applications: An International Journal
Evaluation of novelty metrics for sentence-level novelty mining
Information Sciences: an International Journal
Database optimization for novelty detection
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
Detecting novel business blogs
ICICS'09 Proceedings of the 7th international conference on Information, communications and signal processing
Multilingual novelty detection
Expert Systems with Applications: An International Journal
An intelligent system for sentence retrieval and novelty mining
International Journal of Knowledge Engineering and Data Mining
Design of an intelligent novelty detection application
International Journal of Innovative Computing and Applications
Database optimization for novelty mining of business blogs
Expert Systems with Applications: An International Journal
Multilingual sentence categorization and novelty mining
Information Processing and Management: an International Journal
International Journal of Advanced Pervasive and Ubiquitous Computing
Probabilistic Models for Social Media Mining
International Journal of Information Technology and Web Engineering
Adaptable Services for Novelty Mining
International Journal of Systems and Service-Oriented Engineering
Hi-index | 0.00 |
Automated mining of novel documents or sentences from chronologically ordered documents or sentences is an open challenge in text mining. In this paper, we describe the preprocessing techniques for detecting novel Chinese text and discuss the influence of different Part of Speech (POS) filtering rules on the detection performance. Experimental results on APWSJ and TREC 2004 Novelty Track data show that the Chinese novelty mining performance is quite different when choosing two dissimilar POS filtering rules. Thus, the selection of words to represent Chinese text is of vital importance to the success of the Chinese novelty mining. Moreover, we compare the Chinese novelty mining performance with that of English and investigate the impact of preprocessing steps on detecting novel Chinese text, which will be very helpful for developing a Chinese novelty mining system.