Authorship attribution

Authors:
Patrick Juola
Affiliations:
Department of Mathematics and Computer Science, Duquesne University, Pittsburgh, PA
Venue:
Foundations and Trends in Information Retrieval
Year:
2006

Citing 29
Cited 29

Envisioning information

Envisioning information
A statistical approach to machine translation

Computational Linguistics
Introduction to the theory of neural computation

Introduction to the theory of neural computation
Learning internal representations by error propagation

Parallel distributed processing: explorations in the microstructure of cognition, vol. 1
C4.5: programs for machine learning

C4.5: programs for machine learning
A corpus-based approach to language learning

A corpus-based approach to language learning
The nature of statistical learning theory

The nature of statistical learning theory
An introduction to Kolmogorov complexity and its applications (2nd ed.)

An introduction to Kolmogorov complexity and its applications (2nd ed.)
On the entropy of DNA: algorithms and measurements based on memory and rapid convergence

Proceedings of the sixth annual ACM-SIAM symposium on Discrete algorithms
Numerical Recipes in C++: the art of scientific computing

Numerical Recipes in C++: the art of scientific computing
The Code Book: The Evolution of Secrecy from Mary, Queen of Scots, to Quantum Cryptography

The Code Book: The Evolution of Secrecy from Mary, Queen of Scots, to Quantum Cryptography
Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms

Learning to Classify Text Using Support Vector Machines: Methods, Theory and Algorithms
A Tutorial on Support Vector Machines for Pattern Recognition

Data Mining and Knowledge Discovery
Using Literal and Grammatical Statistics for Authorship Attribution

Problems of Information Transmission
Gender-Preferential Text Mining of E-mail Discourse

ACSAC '02 Proceedings of the 18th Annual Computer Security Applications Conference
A practical part-of-speech tagger

ANLC '92 Proceedings of the third conference on Applied natural language processing
Language independent authorship attribution using character level language models

EACL '03 Proceedings of the tenth conference on European chapter of the Association for Computational Linguistics - Volume 1
Applying Authorship Analysis to Extremist-Group Web Forum Messages

IEEE Intelligent Systems
Segmenting documents by stylistic character

Natural Language Engineering
A framework for authorship identification of online messages: Writing-style features and classification techniques

Journal of the American Society for Information Science and Technology
Feature instability as a criterion for selecting potential style markers: Special Topic Section on Computational Analysis of Style

Journal of the American Society for Information Science and Technology
Introduction to Automata Theory, Languages, and Computation (3rd Edition)

Introduction to Automata Theory, Languages, and Computation (3rd Edition)
Author verification by linguistic profiling: An exploration of the parameter space

ACM Transactions on Speech and Language Processing (TSLP)
Obfuscating document stylometry to preserve author anonymity

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
Whose thumb is it anyway?: classifying author personality from weblog text

COLING-ACL '06 Proceedings of the COLING/ACL on Main conference poster sessions
An attempt to use weighted cusums to identify sublanguages

NeMLaP3/CoNLL '98 Proceedings of the Joint Conferences on New Methods in Language Processing and Computational Natural Language Learning
Cross-entropy and linguistic typology

NeMLaP3/CoNLL '98 Proceedings of the Joint Conferences on New Methods in Language Processing and Computational Natural Language Learning
Visualizing authorship for identification

ISI'06 Proceedings of the 4th IEEE international conference on Intelligence and Security Informatics
Chat mining for gender prediction

ADVIS'06 Proceedings of the 4th international conference on Advances in Information Systems

Automatically profiling the author of an anonymous text

Communications of the ACM - Inspiring Women in Computing
Empirical evaluation of authorship obfuscation using JGAAP

Proceedings of the 3rd ACM workshop on Artificial intelligence and security
Intrinsic plagiarism analysis

Language Resources and Evaluation
Plagiarism and authorship analysis: introduction to the special issue

Language Resources and Evaluation
Authorship attribution in the wild

Language Resources and Evaluation
Developing a corpus of plagiarised short answers

Language Resources and Evaluation
Lost in translation: authorship attribution using frame semantics

HLT '11 Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies: short papers - Volume 2
Authorship attribution with latent Dirichlet allocation

CoNLL '11 Proceedings of the Fifteenth Conference on Computational Natural Language Learning
Herbert west: deanonymizer

HotSec'11 Proceedings of the 6th USENIX conference on Hot topics in security
Who wrote this code? identifying the authors of program binaries

ESORICS'11 Proceedings of the 16th European conference on Research in computer security
Online conversation mining for author characterization and topic identification

Proceedings of the 4th workshop on Workshop for Ph.D. students in information & knowledge management
On the generation of rich content metadata from social media

Proceedings of the 3rd international workshop on Search and mining user-generated contents
Utilising user texts to improve recommendations

UMAP'10 Proceedings of the 18th international conference on User Modeling, Adaptation, and Personalization
A weighted profile intersection measure for profile-based authorship attribution

MICAI'11 Proceedings of the 10th Mexican international conference on Advances in Artificial Intelligence - Volume Part I
Authorship Attribution Based on Specific Vocabulary

ACM Transactions on Information Systems (TOIS)
Using psycholinguistic features for profiling first language of authors

Journal of the American Society for Information Science and Technology
Implicit group membership detection in online text: analysis and applications

SBP'12 Proceedings of the 5th international conference on Social Computing, Behavioral-Cultural Modeling and Prediction
Use fewer instances of the letter "i": toward writing style anonymization

PETS'12 Proceedings of the 12th international conference on Privacy Enhancing Technologies
Adversarial stylometry: Circumventing authorship recognition to preserve privacy and anonymity

ACM Transactions on Information and System Security (TISSEC)
Detecting stylistic deception

EACL 2012 Proceedings of the Workshop on Computational Approaches to Deception Detection
On the role of poetic versus nonpoetic features in “kindred” and diachronic poetry attribution

Journal of the American Society for Information Science and Technology
Text mining applied to plagiarism detection: The use of words for detecting deviations in the writing style

Expert Systems with Applications: An International Journal
Explanation in computational stylometry

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume 2
Syntactic dependency-based n-grams: more evidence of usefulness in classification

CICLing'13 Proceedings of the 14th international conference on Computational Linguistics and Intelligent Text Processing - Volume Part I
Syntactic dependency-based n-grams as classification features

MICAI'12 Proceedings of the 11th Mexican international conference on Advances in Computational Intelligence - Volume Part II
Detecting multiple aliases in social media

Proceedings of the 2013 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining
Towards a taxonomy of suspected forgery in authorship attribution field: a case: Montale's Diario postumo

Proceedings of the 1st International Workshop on Collaborative Annotations in Shared Environment: metadata, vocabularies and techniques in the Digital Humanities
Probabilistic neural network with homogeneity testing in recognition of discrete patterns set

Neural Networks
Syntactic N-grams as machine learning features for natural language processing

Expert Systems with Applications: An International Journal

Quantified Score

Hi-index	0.00

Visualization

Abstract

Authorship attribution, the science of inferring characteristics of the author from the characteristics of documents written by that author, is a problem with a long history and a wide range of application. Recent work in "non-traditional" authorship attribution demonstrates the practicality of automatically analyzing documents based on authorial style, but the state of the art is confusing. Analyses are difficult to apply, little is known about type or rate of errors, and few "best practices" are available. In part because of this confusion, the field has perhaps had less uptake and general acceptance than is its due. This review surveys the history and present state of the discipline, presenting some comparative results when available. It shows, first, that the discipline is quite successful, even in difficult cases involving small documents in unfamiliar and less studied languages; it further analyzes the types of analysis and features used and tries to determine characteristics of well-performing systems, finally formulating these in a set of recommendations for best practices.