Similarity Search in High Dimensions via Hashing
VLDB '99 Proceedings of the 25th International Conference on Very Large Data Bases
Learning to paraphrase: an unsupervised approach using multiple-sequence alignment
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
What the geeks know: hypertext and the problem of literacy
Proceedings of the sixteenth ACM conference on Hypertext and hypermedia
MapReduce: simplified data processing on large clusters
OSDI'04 Proceedings of the 6th conference on Symposium on Opearting Systems Design & Implementation - Volume 6
Exploring a digital library through key ideas
Proceedings of the 8th ACM/IEEE-CS joint conference on Digital libraries
Generating links by mining quotations
Proceedings of the nineteenth ACM conference on Hypertext and hypermedia
SpotSigs: robust and efficient near duplicate detection in large web collections
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval
Controlled experiments on the web: survey and practical guide
Data Mining and Knowledge Discovery
Meme-tracking and the dynamics of the news cycle
Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining
Proceedings of the 20th ACM conference on Hypertext and hypermedia
Brute force and indexed approaches to pairwise document similarity comparisons with MapReduce
Proceedings of the 32nd international ACM SIGIR conference on Research and development in information retrieval
Highlighting disputed claims on the web
Proceedings of the 19th international conference on World wide web
A novel traffic analysis for identifying search fields in the long tail of web sites
Proceedings of the 19th international conference on World wide web
Reading Hypertext
Data-Intensive Text Processing with MapReduce
Data-Intensive Text Processing with MapReduce
HeidelTime: High quality rule-based extraction and normalization of temporal expressions
SemEval '10 Proceedings of the 5th International Workshop on Semantic Evaluation
Using Crowdsourcing and Active Learning to Track Sentiment in Online Media
Proceedings of the 2010 conference on ECAI 2010: 19th European Conference on Artificial Intelligence
Hadoop: The Definitive Guide
Using gaze patterns to study and predict reading struggles due to distraction
CHI '11 Extended Abstracts on Human Factors in Computing Systems
Emerging trends in search user interfaces
Proceedings of the 22nd ACM conference on Hypertext and hypermedia
Intrinsic plagiarism detection
ECIR'06 Proceedings of the 28th European conference on Advances in Information Retrieval
Hi-index | 0.00 |
Online critical literacy challenges readers to recognize and question how online textual information has been shaped by its greater context. While comparing information from multiple sources provides a foundation for such awareness, keeping pace with everything being written is a daunting proposition, especially for the casual reader. We propose a new form of technological assistance for critical literacy which automatically discovers and displays underlying memes: ideas represented by similar phrases which occur across diýerent information sources. By surfacing these memes to users, we create a rich hypertext representation in which underlying memes can be explored in context. Given the vast scale of social media, we describe a highly-scalable system architecture designed for MapReduce distributed computing. To validate our approach, we report on use of our system to discover and browse memes in a 1.5 TB collection of crawled social media. Our primary contributions include: 1) a novel technological approach and hypertext browsing design for supporting critical literacy; and 2) a highly-scalable system architecture for meme discovery, providing a solid foundation for further system extensions and refinements.