The vocabulary problem in human-system communication
Communications of the ACM
Combining the evidence of multiple query representations for information retrieval
TREC-2 Proceedings of the second conference on Text retrieval conference
Authoritative sources in a hyperlinked environment
Journal of the ACM (JACM)
Relevance score normalization for metasearch
Proceedings of the tenth international conference on Information and knowledge management
Topic Detection and Tracking: Event-Based Information Organization
Topic Detection and Tracking: Event-Based Information Organization
System Fusion for Improving Performance in Information Retrieval Systems
ITCC '01 Proceedings of the International Conference on Information Technology: Coding and Computing
Information diffusion through blogspace
Proceedings of the 13th international conference on World Wide Web
Fusion of effective retrieval strategies in the same information retrieval system
Journal of the American Society for Information Science and Technology
Structure and evolution of blogspace
Communications of the ACM - The Blogosphere
Combining the language model and inference network approaches to retrieval
Information Processing and Management: an International Journal - Special issue: Bayesian networks and information retrieval
Linear discriminant model for information retrieval
Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval
Similarity measures for tracking information flow
Proceedings of the 14th ACM international conference on Information and knowledge management
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Early versus late fusion in semantic video analysis
Proceedings of the 13th annual ACM international conference on Multimedia
Visualization of News Distribution in Blog Space
WI-IATW '06 Proceedings of the 2006 IEEE/WIC/ACM international conference on Web Intelligence and Intelligent Agent Technology
Wikify!: linking documents to encyclopedic knowledge
Proceedings of the sixteenth ACM conference on Conference on information and knowledge management
Why we twitter: understanding microblogging usage and communities
Proceedings of the 9th WebKDD and 1st SNA-KDD 2007 workshop on Web mining and social network analysis
Learning to link with wikipedia
Proceedings of the 17th ACM conference on Information and knowledge management
Meme-tracking and the dynamics of the news cycle
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Studying the effects of noisy text on text mining applications
Proceedings of The Third Workshop on Analytics for Noisy Unstructured Text Data
Why are they excited?: identifying and explaining spikes in blog mood levels
EACL '06 Proceedings of the Eleventh Conference of the European Chapter of the Association for Computational Linguistics: Posters & Demonstrations
Using twitter to recommend real-time topical news
Proceedings of the third ACM conference on Recommender systems
Predicting the volume of comments on online news stories
Proceedings of the 18th ACM conference on Information and knowledge management
A generative blog post retrieval model that uses query expansion based on external collections
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Learning similarity metrics for event identification in social media
Proceedings of the third ACM international conference on Web search and data mining
Early online identification of attention gathering items in social media
Proceedings of the third ACM international conference on Web search and data mining
What is Twitter, a social network or a news media?
Proceedings of the 19th international conference on World wide web
News article ranking: leveraging the wisdom of bloggers
RIAO '10 Adaptivity, Personalization and Fusion of Heterogeneous Information
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Hypergeometric language models for republished article finding
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Linking archives using document enrichment and term selection
TPDL'11 Proceedings of the 15th international conference on Theory and practice of digital libraries: research and advanced technology for digital libraries
Towards high-quality semantic entity detection over online forums
SocInfo'11 Proceedings of the Third international conference on Social informatics
Adding semantics to microblog posts
Proceedings of the fifth ACM international conference on Web search and data mining
Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
Language intent models for inferring user browsing behavior
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Exploiting temporal topic models in social media retrieval
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Is news sharing on Twitter ideologically biased?
Proceedings of the 2013 conference on Computer supported cooperative work
Late data fusion for microblog search
ECIR'13 Proceedings of the 35th European conference on Advances in Information Retrieval
Challenges and opportunities of local journalism: a case study of the 2012 Korean general election
Proceedings of the 5th Annual ACM Web Science Conference
Discovering filter keywords for company name disambiguation in twitter
Expert Systems with Applications: An International Journal
Proceedings of the 22nd international conference on World Wide Web companion
Journal of Network and Computer Applications
Hi-index | 0.00 |
Much of what is discussed in social media is inspired by events in the news and, vice versa, social media provide us with a handle on the impact of news events. We address the following linking task: given a news article, find social media utterances that implicitly reference it. We follow a three-step approach: we derive multiple query models from a given source news article, which are then used to retrieve utterances from a target social media index, resulting in multiple ranked lists that we then merge using data fusion techniques. Query models are created by exploiting the structure of the source article and by using explicitly linked social media utterances that discuss the source article. To combat query drift resulting from the large volume of text, either in the source news article itself or in social media utterances explicitly linked to it, we introduce a graph-based method for selecting discriminative terms. For our experimental evaluation, we use data from Twitter, Digg, Delicious, the New York Times Community, Wikipedia, and the blogosphere to generate query models. We show that different query models, based on different data sources, provide complementary information and manage to retrieve different social media utterances from our target index. As a consequence, data fusion methods manage to significantly boost retrieval performance over individual approaches. Our graph-based term selection method is shown to help improve both effectiveness and efficiency.