Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data
ICML '01 Proceedings of the Eighteenth International Conference on Machine Learning
Building a large annotated corpus of English: the penn treebank
Computational Linguistics - Special issue on using large corpora: II
Feature-rich part-of-speech tagging with a cyclic dependency network
NAACL '03 Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology - Volume 1
Unsupervised modeling of Twitter conversations
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Summarizing microblogs automatically
HLT '10 Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics
Word representations: a simple and general method for semi-supervised learning
ACL '10 Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics
Annotating named entities in Twitter data with crowdsourcing
CSLDAMT '10 Proceedings of the NAACL HLT 2010 Workshop on Creating Speech and Language Data with Amazon's Mechanical Turk
Predicting the Future with Social Media
WI-IAT '10 Proceedings of the 2010 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology - Volume 01
Robust sentiment detection on Twitter from biased and noisy data
COLING '10 Proceedings of the 23rd International Conference on Computational Linguistics: Posters
Journal of the American Society for Information Science and Technology
Adapting a WSJ trained part-of-speech tagger to noisy text: preliminary results
Proceedings of the 2011 Joint Workshop on Multilingual OCR and Analytics for Noisy Unstructured Text Data
Extracting semantic annotations from twitter
Proceedings of the fourth workshop on Exploiting semantic annotations in information retrieval
Online conversation mining for author characterization and topic identification
Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
On the generation of rich content metadata from social media
Proceedings of the 3rd international workshop on Search and mining user-generated contents
Mining the interests of Chinese microbloggers via keyword extraction
Frontiers of Computer Science in China
Named entity recognition in tweets: an experimental study
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
Towards building large-scale distributed systems for twitter sentiment analysis
Proceedings of the 27th Annual ACM Symposium on Applied Computing
Sentiment analysis on twitter data for portuguese language
PROPOR'12 Proceedings of the 10th international conference on Computational Processing of the Portuguese Language
Entity-centric topic-oriented opinion summarization in twitter
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
Open domain event extraction from twitter
Proceedings of the 18th ACM SIGKDD international conference on Knowledge discovery and data mining
TwiNER: named entity recognition in targeted twitter stream
SIGIR '12 Proceedings of the 35th international ACM SIGIR conference on Research and development in information retrieval
Towards an advanced system for real-time event detection in high-volume data streams
Proceedings of the 5th Ph.D. workshop on Information and knowledge
Proceedings of the Workshop on Semantic Analysis in Social Media
Automatically constructing a normalisation dictionary for microblogs
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Type-supervised hidden Markov models for part-of-speech tagging with incomplete tag dictionaries
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Community-based classification of noun phrases in twitter
Proceedings of the 21st ACM international conference on Information and knowledge management
Lexical normalization for social media text
ACM Transactions on Intelligent Systems and Technology (TIST) - Special section on twitter and microblogging services, social recommender systems, and CAMRa2010: Movie recommendation in context
Mining Divergent Opinion Trust Networks through Latent Dirichlet Allocation
ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Fusing Text and Frienships for Location Inference in Online Social Networks
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
The utility of social and topical factors in anticipating repliers in Twitter conversations
Proceedings of the 5th Annual ACM Web Science Conference
TV program detection in tweets
Proceedings of the 11th european conference on Interactive TV and video
Exploiting hybrid contexts for Tweet segmentation
Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval
Event identification for local areas using social media streaming data
Proceedings of the ACM SIGMOD Workshop on Databases and Social Networks
Using topic models for Twitter hashtag recommendation
Proceedings of the 22nd international conference on World Wide Web companion
FS-NER: a lightweight filter-stream approach to named entity recognition on twitter data
Proceedings of the 22nd international conference on World Wide Web companion
Towards automatic assessment of the social media impact of news content
Proceedings of the 22nd international conference on World Wide Web companion
Practical extraction of disaster-relevant information from social media
Proceedings of the 22nd international conference on World Wide Web companion
Supervised polarity classification of Spanish tweets based on linguistic knowledge
Proceedings of the 2013 ACM symposium on Document engineering
Identifying purpose behind electoral tweets
Proceedings of the Second International Workshop on Issues of Sentiment Discovery and Opinion Mining
RAProp: ranking tweets by exploiting the tweet/user/web ecosystem and inter-tweet agreement
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Tracing the German centennial flood in the stream of tweets: first lessons learned
Proceedings of the Second ACM SIGSPATIAL International Workshop on Crowdsourced and Volunteered Geographic Information
Automatic Domain-Specific Sentiment Lexicon Generation with Label Propagation
Proceedings of International Conference on Information Integration and Web-based Applications & Services
Scalable topic-specific influence analysis on microblogs
Proceedings of the 7th ACM international conference on Web search and data mining
Semantic stability in social tagging streams
Proceedings of the 23rd international conference on World wide web
Twitter n-gram corpus with demographic metadata
Language Resources and Evaluation
Hi-index | 0.00 |
We address the problem of part-of-speech tagging for English data from the popular micro-blogging service Twitter. We develop a tagset, annotate data, develop features, and report tagging results nearing 90% accuracy. The data and tools have been made available to the research community with the goal of enabling richer text analysis of Twitter and related social media data sets.