Syntactic Identifier Conciseness and Consistency

  • Authors:
  • Dawn Lawrie;Henry Feild;David Binkley

  • Affiliations:
  • Loyola College, USA;Loyola College, USA;Loyola College, USA

  • Venue:
  • SCAM '06 Proceedings of the Sixth IEEE International Workshop on Source Code Analysis and Manipulation
  • Year:
  • 2006

Quantified Score

Hi-index 0.00

Visualization

Abstract

Readers of programs have two main sources of domain information: identifier names and comments. It is therefore important for the identifier names (as well as comments) to communicate clearly the concepts that they are meant to represent. Deiβenb篓ock and Pizka recently introduced rules for concise and consistent variable naming. One requirement of their approach is an expert provided mapping from identifiers to concepts. An approach for the concise and consistent naming of variables that does not require any additional information (e.g., a mapping) is presented. Using a pool of 48 million lines of code, experiments with the resulting syntactic rules for concise and consistent naming illustrate that violations of the syntactic pattern exist. Two case studies show that three quarters of the violations uncovered are "real". That is they would be identified using a concept mapping. Techniques for reducing the number of false positives are also presented. Finally, two related studies show that evolution does not introduce rule violations and that programmers tend to use a rather limited vocabulary.