Syntactic Identifier Conciseness and Consistency

Authors:
Dawn Lawrie;Henry Feild;David Binkley
Affiliations:
Loyola College, USA;Loyola College, USA;Loyola College, USA
Venue:
SCAM '06 Proceedings of the Sixth IEEE International Workshop on Source Code Analysis and Manipulation
Year:
2006

Citing 0
Cited 4

Increasing diversity: Natural language measures for software fault prediction

Journal of Systems and Software
AURA: a hybrid approach to identify framework evolution

Proceedings of the 32nd ACM/IEEE International Conference on Software Engineering - Volume 1
Canonical method names for Java: using implementation semantics to identify synonymous verbs

SLE'10 Proceedings of the Third international conference on Software language engineering
An exploratory study of identifier renamings

Proceedings of the 8th Working Conference on Mining Software Repositories

Quantified Score

Hi-index	0.00

Visualization

Abstract

Readers of programs have two main sources of domain information: identifier names and comments. It is therefore important for the identifier names (as well as comments) to communicate clearly the concepts that they are meant to represent. Deiβenb篓ock and Pizka recently introduced rules for concise and consistent variable naming. One requirement of their approach is an expert provided mapping from identifiers to concepts. An approach for the concise and consistent naming of variables that does not require any additional information (e.g., a mapping) is presented. Using a pool of 48 million lines of code, experiments with the resulting syntactic rules for concise and consistent naming illustrate that violations of the syntactic pattern exist. Two case studies show that three quarters of the violations uncovered are "real". That is they would be identified using a concept mapping. Techniques for reducing the number of false positives are also presented. Finally, two related studies show that evolution does not introduce rule violations and that programmers tend to use a rather limited vocabulary.