Software forensics: can we track code to its authors?
Computers and Security
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
Mining e-mail content for author identification forensics
ACM SIGMOD Record
Automatic Text Categorization and Its Application to Text Retrieval
IEEE Transactions on Knowledge and Data Engineering
Text Categorization with Suport Vector Machines: Learning with Many Relevant Features
ECML '98 Proceedings of the 10th European Conference on Machine Learning
A Comparative Study on Feature Selection in Text Categorization
ICML '97 Proceedings of the Fourteenth International Conference on Machine Learning
Style mining of electronic messages for multiple authorship discrimination: first results
Proceedings of the ninth ACM SIGKDD international conference on Knowledge discovery and data mining
Automatic text categorization in terms of genre and author
Computational Linguistics
Automatic detection of text genre
ACL '98 Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics and Eighth Conference of the European Chapter of the Association for Computational Linguistics
Recognizing text genres with simple metrics using discriminant analysis
COLING '94 Proceedings of the 15th conference on Computational linguistics - Volume 2
HICSS '05 Proceedings of the Proceedings of the 38th Annual Hawaii International Conference on System Sciences (HICSS'05) - Track 1 - Volume 01
Segmenting documents by stylistic character
Natural Language Engineering
Journal of the American Society for Information Science and Technology
Journal of the American Society for Information Science and Technology
The Multilingual Internet: Language, Culture and Communication Online
The Multilingual Internet: Language, Culture and Communication Online
Chat mining for gender prediction
ADVIS'06 Proceedings of the 4th international conference on Advances in Information Systems
Online conversation mining for author characterization and topic identification
Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
A Semantic e-Collaboration Approach to Enable Awareness in Globally Distributed Organizations
International Journal of e-Collaboration
A unified data mining solution for authorship analysis in anonymous textual communications
Information Sciences: an International Journal
Mining Criminal Networks from Chat Log
WI-IAT '12 Proceedings of the The 2012 IEEE/WIC/ACM International Joint Conferences on Web Intelligence and Intelligent Agent Technology - Volume 01
Character usage in Chinese short message service SMS: a real-world study in Mainland China
International Journal of Mobile Communications
Hi-index | 0.00 |
The focus of this paper is to investigate the possibility of predicting several user and message attributes in text-based, real-time, online messaging services. For this purpose, a large collection of chat messages is examined. The applicability of various supervised classification techniques for extracting information from the chat messages is evaluated. Two competing models are used for defining the chat mining problem. A term-based approach is used to investigate the user and message attributes in the context of vocabulary use while a style-based approach is used to examine the chat messages according to the variations in the authors' writing styles. Among 100 authors, the identity of an author is correctly predicted with 99.7% accuracy. Moreover, the reverse problem is exploited, and the effect of author attributes on computer-mediated communications is discussed.