SIGIR '94 Proceedings of the 17th annual international ACM SIGIR conference on Research and development in information retrieval
Combining multiple evidence from different properties of weighting schemes
SIGIR '95 Proceedings of the 18th annual international ACM SIGIR conference on Research and development in information retrieval
Combining classifiers in text categorization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Training algorithms for linear text classifiers
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
Context-sensitive learning methods for text categorization
SIGIR '96 Proceedings of the 19th annual international ACM SIGIR conference on Research and development in information retrieval
A study of retrospective and on-line event detection
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
On-line new event detection and tracking
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Using a generalized instance set for automatic text categorization
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Boosting and Rocchio applied to text filtering
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
A re-examination of text categorization methods
Proceedings of the 22nd annual international ACM SIGIR conference on Research and development in information retrieval
An Evaluation of Statistical Approaches to Text Categorization
Information Retrieval
Learning Approaches for Detecting and Tracking News Events
IEEE Intelligent Systems
Maximizing Text-Mining Performance
IEEE Intelligent Systems
Topic detection and tracking in English and Chinese
IRAL '00 Proceedings of the fifth international workshop on on Information retrieval with Asian languages
A study of thresholding strategies for text categorization
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A meta-learning approach for text categorization
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Unsupervised and supervised clustering for topic tracking
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Intelligent information triage
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
A visualisation tool for topic tracking analysis and development
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Detection As Multi-Topic Tracking
Information Retrieval
Information Filtering in TREC-9 and TDT-3: A Comparative Analysis
Information Retrieval
Introduction to topic detection and tracking
Topic detection and tracking
Topic detection and tracking
Bursty and hierarchical structure in streams
Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining
Bursty and Hierarchical Structure in Streams
Data Mining and Knowledge Discovery
Simple Semantics in Topic Detection and Tracking
Information Retrieval
RCV1: A New Benchmark Collection for Text Categorization Research
The Journal of Machine Learning Research
An adaptive k-nearest neighbor text categorization strategy
ACM Transactions on Asian Language Information Processing (TALIP)
Topic activation analysis for document streams based on document arrival rate and relevance
Proceedings of the 2005 ACM symposium on Applied computing
Topic tracking using subject templates and clustering positive training instances
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 2
A server for real-time event tracking in news
HLT '01 Proceedings of the first international conference on Human language technology research
Investigations on event evolution in TDT
NAACLstudent '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology: Proceedings of the HLT-NAACL 2003 student research workshop - Volume 3
Data association for topic intensity tracking
ICML '06 Proceedings of the 23rd international conference on Machine learning
Text mining techniques for patent analysis
Information Processing and Management: an International Journal
Using bilingual comparable corpora and semi-supervised clustering for topic tracking
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Discovering event evidence amid massive, dynamic datasets
Proceedings of the 9th annual conference companion on Genetic and evolutionary computation
Time-dependent event hierarchy construction
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Mining correlated bursty topic patterns from coordinated text streams
Proceedings of the 13th ACM SIGKDD international conference on Knowledge discovery and data mining
Dynamic stopwording for story link detection
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Topic tracking based on bilingual comparable corpora and semisupervised clustering
ACM Transactions on Asian Language Information Processing (TALIP)
Bilingual topic aspect classification with a few training examples
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Research intelligence involving information retrieval - An example of conferences and journals
Expert Systems with Applications: An International Journal
Constructing comprehensive summaries of large event sequences
ACM Transactions on Knowledge Discovery from Data (TKDD)
Discovering event evolution graphs from news corpora
IEEE Transactions on Systems, Man, and Cybernetics, Part A: Systems and Humans
Generic title labeling for clustered documents
Expert Systems with Applications: An International Journal
Topic detection and tracking with spatio-temporal evidence
ECIR'03 Proceedings of the 25th European conference on IR research
Improved IR in cohesion model for link detection system
ICDM'07 Proceedings of the 7th industrial conference on Advances in data mining: theoretical aspects and applications
Sentence-level event classification in unstructured texts
Information Retrieval
Connecting the dots between news articles
Proceedings of the 16th ACM SIGKDD international conference on Knowledge discovery and data mining
Using chi-square statistics to measure similarities for text categorization
Expert Systems with Applications: An International Journal
Connecting Two (or Less) Dots: Discovering Structure in News Articles
ACM Transactions on Knowledge Discovery from Data (TKDD)
Tracing the event evolution of terror attacks from on-line news
ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Toward generic title generation for clustered documents
AIRS'06 Proceedings of the Third Asia conference on Information Retrieval Technology
Topic tracking based on linguistic features
IJCNLP'05 Proceedings of the Second international joint conference on Natural Language Processing
Indices of novelty for emerging topic detection
Information Processing and Management: an International Journal
Applying CLIR techniques to event tracking
AIRS'04 Proceedings of the 2004 international conference on Asian Information Retrieval Technology
Trains of thought: generating information maps
Proceedings of the 21st international conference on World Wide Web
Identifying event sequences using hidden Markov model
NLDB'07 Proceedings of the 12th international conference on Applications of Natural Language to Information Systems
Twevent: segment-based event detection from tweets
Proceedings of the 21st ACM international conference on Information and knowledge management
The decomposed k-nearest neighbor algorithm for imbalanced text classification
FGIT'12 Proceedings of the 4th international conference on Future Generation Information Technology
Blog topic analysis using TF smoothing and LDA
Proceedings of the 7th International Conference on Ubiquitous Information Management and Communication
AnchorMF: towards effective event context identification
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
Automated tracking of events from chronologically ordered document streams is a new challenge for statistical text classification. Existing learning techniques must be adapted or improved in order to effectively handle difficult situations where the number of positive training instances per event is extremely small, the majority of training documents are unlabelled, and most of the events have a short duration in time. We adapted several supervised text categorization methods, specifically several new variants of the k-Nearest Neighbor (kNN) algorithm and a Rocchio approach, to track events. All of these methods showed significant improvement (up to 71% reduction in weighted error rates) over the performance of the original kNN algorithm on TDT benchmark collections, making kNN among the top-performing systems in the recent TDT3 official evaluation. Furthermore, by combining these methods, we significantly reduced the variance in performance of our event tracking system over different data collections, suggesting a robust solution for parameter optimization.