Envisioning information
A statistical approach to machine translation
Computational Linguistics
Introduction to the theory of neural computation
Introduction to the theory of neural computation
Learning internal representations by error propagation
Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
C4.5: programs for machine learning
C4.5: programs for machine learning
A corpus-based approach to language learning
A corpus-based approach to language learning
The nature of statistical learning theory
The nature of statistical learning theory
An introduction to Kolmogorov complexity and its applications (2nd ed.)
An introduction to Kolmogorov complexity and its applications (2nd ed.)
On the entropy of DNA: algorithms and measurements based on memory and rapid convergence
Proceedings of the sixth annual ACM-SIAM symposium on Discrete algorithms
Numerical Recipes in C++: the art of scientific computing
Numerical Recipes in C++: the art of scientific computing
The Code Book: The Evolution of Secrecy from Mary, Queen of Scots, to Quantum Cryptography
The Code Book: The Evolution of Secrecy from Mary, Queen of Scots, to Quantum Cryptography
Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms
Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms
A Tutorial on Support Vector Machines for Pattern Recognition
Data Mining and Knowledge Discovery
Using Literal and Grammatical Statistics for Authorship Attribution
Problems of Information Transmission
Gender-Preferential Text Mining of E-mail Discourse
ACSAC '02 Proceedings of the 18th Annual Computer Security Applications Conference
A practical part-of-speech tagger
ANLC '92 Proceedings of the third conference on Applied natural language processing
Language independent authorship attribution using character level language models
EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Applying Authorship Analysis to Extremist-Group Web Forum Messages
IEEE Intelligent Systems
Segmenting documents by stylistic character
Natural Language Engineering
Journal of the American Society for Information Science and Technology
Journal of the American Society for Information Science and Technology
Introduction to Automata Theory, Languages, and Computation (3rd Edition)
Introduction to Automata Theory, Languages, and Computation (3rd Edition)
Author verification by linguistic profiling: An exploration of the parameter space
ACM Transactions on Speech and Language Processing (TSLP)
Obfuscating document stylometry to preserve author anonymity
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Whose thumb is it anyway?: classifying author personality from weblog text
COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
An attempt to use weighted cusums to identify sublanguages
NeMLaP3/CoNLL '98 Proceedings of the Joint Conferences on New Methods in Language Processing and Computational Natural Language Learning
Cross-entropy and linguistic typology
NeMLaP3/CoNLL '98 Proceedings of the Joint Conferences on New Methods in Language Processing and Computational Natural Language Learning
Visualizing authorship for identification
ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Chat mining for gender prediction
ADVIS'06 Proceedings of the 4th international conference on Advances in Information Systems
Automatically profiling the author of an anonymous text
Communications of the ACM - Inspiring Women in Computing
Empirical evaluation of authorship obfuscation using JGAAP
Proceedings of the 3rd ACM workshop on Artificial intelligence and security
Language Resources and Evaluation
Plagiarism and authorship analysis: introduction to the special issue
Language Resources and Evaluation
Authorship attribution in the wild
Language Resources and Evaluation
Developing a corpus of plagiarised short answers
Language Resources and Evaluation
Lost in translation: authorship attribution using frame semantics
HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Authorship attribution with latent Dirichlet allocation
CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
HotSec'11 Proceedings of the 6th USENIX conference on Hot topics in security
Who wrote this code? identifying the authors of program binaries
ESORICS'11 Proceedings of the 16th European conference on Research in computer security
Online conversation mining for author characterization and topic identification
Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
On the generation of rich content metadata from social media
Proceedings of the 3rd international workshop on Search and mining user-generated contents
Utilising user texts to improve recommendations
UMAP'10 Proceedings of the 18th international conference on User Modeling, Adaptation, and Personalization
A weighted profile intersection measure for profile-based authorship attribution
MICAI'11 Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Authorship Attribution Based on Specific Vocabulary
ACM Transactions on Information Systems (TOIS)
Using psycholinguistic features for profiling first language of authors
Journal of the American Society for Information Science and Technology
Implicit group membership detection in online text: analysis and applications
SBP'12 Proceedings of the 5th international conference on Social Computing, Behavioral-Cultural Modeling and Prediction
Use fewer instances of the letter "i": toward writing style anonymization
PETS'12 Proceedings of the 12th international conference on Privacy Enhancing Technologies
Adversarial stylometry: Circumventing authorship recognition to preserve privacy and anonymity
ACM Transactions on Information and System Security (TISSEC)
EACL 2012 Proceedings of the Workshop on Computational Approaches to Deception Detection
On the role of poetic versus nonpoetic features in “kindred” and diachronic poetry attribution
Journal of the American Society for Information Science and Technology
Expert Systems with Applications: An International Journal
Explanation in computational stylometry
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Syntactic dependency-based n-grams: more evidence of usefulness in classification
CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Syntactic dependency-based n-grams as classification features
MICAI'12 Proceedings of the 11th Mexican international conference on Advances in Computational Intelligence - Volume Part II
Detecting multiple aliases in social media
Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Proceedings of the 1st International Workshop on Collaborative Annotations in Shared Environment: metadata, vocabularies and techniques in the Digital Humanities
Syntactic N-grams as machine learning features for natural language processing
Expert Systems with Applications: An International Journal
Hi-index | 0.00 |
Authorship attribution, the science of inferring characteristics of the author from the characteristics of documents written by that author, is a problem with a long history and a wide range of application. Recent work in "non-traditional" authorship attribution demonstrates the practicality of automatically analyzing documents based on authorial style, but the state of the art is confusing. Analyses are difficult to apply, little is known about type or rate of errors, and few "best practices" are available. In part because of this confusion, the field has perhaps had less uptake and general acceptance than is its due. This review surveys the history and present state of the discipline, presenting some comparative results when available. It shows, first, that the discipline is quite successful, even in difficult cases involving small documents in unfamiliar and less studied languages; it further analyzes the types of analysis and features used and tries to determine characteristics of well-performing systems, finally formulating these in a set of recommendations for best practices.