Viewing morphology as an inference process
SIGIR '93 Proceedings of the 16th annual international ACM SIGIR conference on Research and development in information retrieval
Relevance based language models
Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval
Improving stemming for Arabic information retrieval: light stemming and co-occurrence analysis
SIGIR '02 Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to topic detection and tracking
Topic detection and tracking
Topic detection and tracking evaluation overview
Topic detection and tracking
Probabilistic approaches to topic detection and tracking
Topic detection and tracking
Signal boosting for translingual topic tracking: document expansion and n-best translation
Topic detection and tracking
An NLP & IR approach to topic detection
Topic detection and tracking
Adaptive vector space text filtering for monolingual and cross-language application
Adaptive vector space text filtering for monolingual and cross-language application
Information Processing and Management: an International Journal - Special issue: Cross-language information retrieval
NLP and IR approaches to monolingual and multilingual link detection
COLING '02 Proceedings of the 19th international conference on Computational linguistics - Volume 1
Relevance models for topic detection and tracking
HLT '02 Proceedings of the second international conference on Human Language Technology Research
Automatic new topic identification using multiple linear regression
Information Processing and Management: an International Journal
An information-theoretic approach to automatic evaluation of summaries
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
Topic tracking with time granularity reasoning
ACM Transactions on Asian Language Information Processing (TALIP)
Using bilingual comparable corpora and semi-supervised clustering for topic tracking
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Topic tracking based on bilingual comparable corpora and semisupervised clustering
ACM Transactions on Asian Language Information Processing (TALIP)
Novelty detection for cross-lingual news stories with visual duplicates and speech transcripts
Proceedings of the 15th international conference on Multimedia
Web video topic discovery and tracking via bipartite graph reinforcement model
Proceedings of the 17th international conference on World Wide Web
Measuring novelty and redundancy with multiple modalities in cross-lingual broadcast news
Computer Vision and Image Understanding
Story tracking: linking similar news over time and across languages
MMIES '08 Proceedings of the Workshop on Multi-source Multilingual Information Extraction and Summarization
Cross-language linking of news stories on the web using interlingual topic modelling
Proceedings of the 2nd ACM workshop on Social web search and mining
Topic tracking based on keywords dependency profile
AIRS'08 Proceedings of the 4th Asia information retrieval conference on Information retrieval technology
Two-tier similarity model for story link detection
CIKM '10 Proceedings of the 19th ACM international conference on Information and knowledge management
Effect of overt pronoun resolution in topic tracking
LTC'09 Proceedings of the 4th conference on Human language technology: challenges for computer science and linguistics
A unified framework for web video topic discovery and visualization
Pattern Recognition Letters
Arabic texts analysis for topic modeling evaluation
Information Retrieval
Multi-lingual detection of terrorist content on the web
WISI'06 Proceedings of the 2006 international conference on Intelligence and Security Informatics
Arabic news: topic and novelty detection
Proceedings of the 3rd International Conference on Information and Communication Systems
A survey of methods to ease the development of highly multilingual text mining applications
Language Resources and Evaluation
Hi-index | 0.00 |
Topic tracking is complicated when the stories in the stream occur in multiple languages. Typically, researchers have trained only English topic models because the training stories have been provided in English. In tracking, non-English test stories are then machine translated into English to compare them with the topic models. We propose a native language hypothesis stating that comparisons would be more effective in the original language of the story. We first test and support the hypothesis for story link detection. For topic tracking the hypothesis implies that it should be preferable to build separate language-specific topic models for each language in the stream. We compare different methods of incrementally building such native language topic models.