Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms
Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms
A quantitative analysis of lexical differences between genders in telephone conversations
ACL '05 Proceedings of the 43rd Annual Meeting on Association for Computational Linguistics
TSD '08 Proceedings of the 11th international conference on Text, Speech and Dialogue
Get out the vote: determining support or opposition from congressional floor-debate transcripts
EMNLP '06 Proceedings of the 2006 Conference on Empirical Methods in Natural Language Processing
Modeling latent biographic attributes in conversational genres
ACL '09 Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP: Volume 2 - Volume 2
Democrats, republicans and starbucks afficionados: user classification in twitter
Proceedings of the 17th ACM SIGKDD international conference on Knowledge discovery and data mining
Deriving knowledge profiles from twitter
EC-TEL'11 Proceedings of the 6th European conference on Technology enhanced learning: towards ubiquitous learning
On the generation of rich content metadata from social media
Proceedings of the 3rd international workshop on Search and mining user-generated contents
Author age prediction from text using linear regression
LaTeCH '11 Proceedings of the 5th ACL-HLT Workshop on Language Technology for Cultural Heritage, Social Sciences, and Humanities
Discriminating gender on Twitter
EMNLP '11 Proceedings of the Conference on Empirical Methods in Natural Language Processing
1st international workshop on user modeling from social media
Proceedings of the 2012 ACM international conference on Intelligent User Interfaces
Identifying communicator roles in twitter
Proceedings of the 21st international conference companion on World Wide Web
BlurMe: inferring and obfuscating user gender based on ratings
Proceedings of the sixth ACM conference on Recommender systems
Mixing methods and theory to explore web activity
Proceedings of the 3rd Annual ACM Web Science Conference
Automatic humor classification on Twitter
NAACL HLT '12 Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies: Student Research Workshop
Streaming analysis of discourse participants
EMNLP-CoNLL '12 Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning
Proceedings of the 21st ACM international conference on Information and knowledge management
Tweets reveal more than you know: a learning style analysis on twitter
EC-TEL'12 Proceedings of the 7th European conference on Technology Enhanced Learning
Do online social network friends still threaten my privacy?
Proceedings of the third ACM conference on Data and application security and privacy
Recommending targeted strangers from whom to solicit information on social media
Proceedings of the 2013 international conference on Intelligent user interfaces
Personal User or Organizational User? Behavior on Microblog can Tell
ASONAM '12 Proceedings of the 2012 International Conference on Advances in Social Networks Analysis and Mining (ASONAM 2012)
Determining language variant in microblog messages
Proceedings of the 28th Annual ACM Symposium on Applied Computing
Using explicit linguistic expressions of preference in social media to predict voting behavior
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Steeler nation, 12th man, and boo birds: classifying Twitter user interests using time series
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
What is he/she like?: estimating Twitter user attributes from contents and social neighbors
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Identifying user attributes through non-i.i.d. multi-instance learning
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Language independent gender classification on Twitter
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Proceedings of the 19th ACM Symposium on Virtual Reality Software and Technology
Proceedings of the 22nd ACM international conference on Conference on information & knowledge management
Account Reachability: A Measure of Privacy Risk for Exposure of a User's Multiple SNS Accounts
Proceedings of International Conference on Information Integration and Web-based Applications & Services
Recognition of understanding level and language skill using measurements of reading behavior
Proceedings of the 19th international conference on Intelligent User Interfaces
User profiling in an ego network: co-profiling attributes and relationships
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
Social media outlets such as Twitter have become an important forum for peer interaction. Thus the ability to classify latent user attributes, including gender, age, regional origin, and political orientation solely from Twitter user language or similar highly informal content has important applications in advertising, personalization, and recommendation. This paper includes a novel investigation of stacked-SVM-based classification algorithms over a rich set of original features, applied to classifying these four user attributes. It also includes extensive analysis of features and approaches that are effective and not effective in classifying user attributes in Twitter-style informal written genres as distinct from the other primarily spoken genres previously studied in the user-property classification literature. Our models, singly and in ensemble, significantly outperform baseline models in all cases. A detailed analysis of model components and features provides an often entertaining insight into distinctive language-usage variation across gender, age, regional origin and political orientation in modern informal communication.