Spelling checkers,spelling correctors and the misspellings of poor spellers
Information Processing and Management: an International Journal
Exponentiated gradient versus gradient descent for linear predictors
Information and Computation
An introduction to support Vector Machines: and other kernel-based learning methods
An introduction to support Vector Machines: and other kernel-based learning methods
Machine learning in automated text categorization
ACM Computing Surveys (CSUR)
Automatic text categorization in terms of genre and author
Computational Linguistics
Exploring the use of linguistic features in domain and genre classification
EACL '99 Proceedings of the ninth conference on European chapter of the Association for Computational Linguistics
Music artist style identification by semi-supervised learning from both lyrics and content
Proceedings of the 12th annual ACM international conference on Multimedia
On combining multiple clusterings
Proceedings of the thirteenth ACM international conference on Information and knowledge management
Whose thumb is it anyway?: classifying author personality from weblog text
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
ACM Transactions on Information Systems (TOIS)
Author identification: Using text sampling to handle the class imbalance problem
Information Processing and Management: an International Journal
Author identification using writer-dependent and writer-independent strategies
Proceedings of the 2008 ACM symposium on Applied computing
Chat mining: Predicting user and message attributes in computer-mediated communication
Information Processing and Management: an International Journal
Tensor Space Models for Authorship Identification
SETN '08 Proceedings of the 5th Hellenic conference on Artificial Intelligence: Theories, Models and Applications
Stylometric Identification in Electronic Markets: Scalability and Robustness
Journal of Management Information Systems
A survey of modern authorship attribution methods
Journal of the American Society for Information Science and Technology
Authorship attribution and verification with many authors and limited data
COLING '08 Proceedings of the 22nd International Conference on Computational Linguistics - Volume 1
Forensic Authorship Attribution Using Compression Distances to Prototypes
IWCF '09 Proceedings of the 3rd International Workshop on Computational Forensics
Particle Swarm Model Selection for Authorship Verification
CIARP '09 Proceedings of the 14th Iberoamerican Conference on Pattern Recognition: Progress in Pattern Recognition, Image Analysis, Computer Vision, and Applications
Compression and stylometry for author identification
IJCNN'09 Proceedings of the 2009 international joint conference on Neural Networks
Authorship attribution via combination of evidence
ECIR'07 Proceedings of the 29th European conference on IR research
Text-based video content classification for online video-sharing sites
Journal of the American Society for Information Science and Technology
Authorship classification: a syntactic tree mining approach
Proceedings of the ACM SIGKDD Workshop on Useful Patterns
Improving mood classification in music digital libraries by combining lyrics and audio
Proceedings of the 10th annual joint conference on Digital libraries
On combining multiple clusterings: an overview and a new perspective
Applied Intelligence
Language Resources and Evaluation
Authorship classification: a discriminative syntactic tree mining approach
Proceedings of the 34th international ACM SIGIR conference on Research and development in Information Retrieval
Automatic natural language style classification and transformation
IRSG'08 Proceedings of the 2008 BCS-IRSG conference on Corpus Profiling
Implicit group membership detection in online text: analysis and applications
SBP'12 Proceedings of the 5th international conference on Social Computing, Behavioral-Cultural Modeling and Prediction
A unified data mining solution for authorship analysis in anonymous textual communications
Information Sciences: an International Journal
Web search query privacy: Evaluating query obfuscation and anonymizing networks
Journal of Computer Security
Hi-index | 0.00 |
This paper considers the use of computational stylistics for performing authorship attribution of electronic messages, addressing categorization problems with as many as 20 different classes (authors). Effective stylistic characterization of text is potentially useful for a variety of tasks, as language style contains cues regarding the authorship, purpose, and mood of the text, all of which would be useful adjuncts to information retrieval or knowledge-management tasks. We focus here on the problem of determining the author of an anonymous message, based only on the message text. Several multiclass variants of the Winnow algorithm were applied to a vector representation of the message texts to learn models for discriminating different authors. We present results comparing the classification accuracy of the different approaches. The results show that stylistic models can be accurately learned to determine an author's identity.