An efficient algorithm for the “optimal” stable marriage
Journal of the ACM (JACM)
Snowball: extracting relations from large plain-text collections
DL '00 Proceedings of the fifth ACM conference on Digital libraries
Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Automatic acquisition of domain knowledge for Information Extraction
COLING '00 Proceedings of the 18th conference on Computational linguistics - Volume 2
Discovering relations among named entities from large corpora
ACL '04 Proceedings of the 42nd Annual Meeting on Association for Computational Linguistics
Multi-field information extraction and cross-document fusion
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
Preemptive information extraction using unrestricted relation discovery
HLT-NAACL '06 Proceedings of the main conference on Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics
StatSnowball: a statistical approach to extracting entity relationships
Proceedings of the 18th international conference on World wide web
Distant supervision for relation extraction without labeled data
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Unsupervised modeling of Twitter conversations
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Collective cross-document relation extraction without labelled data
EMNLP '10 Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing
Identifying content for planned events across social media sites
Proceedings of the fifth ACM international conference on Web search and data mining
Adding semantics to microblog posts
Proceedings of the fifth ACM international conference on Web search and data mining
Named entity recognition in tweets: an experimental study
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Social event detection and retrieval in collaborative photo collections
Proceedings of the 2nd ACM International Conference on Multimedia Retrieval
Open domain event extraction from twitter
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Social event detection on twitter
ICWE'12 Proceedings of the 12th international conference on Web Engineering
Extracting social events based on timeline and sentiment analysis in twitter corpus
NLDB'12 Proceedings of the 17th international conference on Applications of Natural Language Processing and Information Systems
Minimum-risk training of approximate CRF-based NLP systems
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Automatically constructing a normalisation dictionary for microblogs
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Lexical normalization for social media text
ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context
Jointly exploiting visual and non-visual information for event-related social media retrieval
Proceedings of the 3rd ACM conference on International conference on multimedia retrieval
Evolution of communities on Twitter and the role of their leaders during emergencies
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
How the live web feels about events
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Hi-index | 0.00 |
We present a novel method for record extraction from social streams such as Twitter. Unlike typical extraction setups, these environments are characterized by short, one sentence messages with heavily colloquial speech. To further complicate matters, individual messages may not express the full relation to be uncovered, as is often assumed in extraction tasks. We develop a graphical model that addresses these problems by learning a latent set of records and a record-message alignment simultaneously; the output of our model is a set of canonical records, the values of which are consistent with aligned messages. We demonstrate that our approach is able to accurately induce event records from Twitter messages, evaluated against events from a local city guide. Our method achieves significant error reduction over baseline methods.