Distinguishing copies from originals in software clones
Proceedings of the 4th International Workshop on Software Clones
Proceedings of the 19th ACM SIGSOFT symposium and the 13th European conference on Foundations of software engineering
WikiWho: precise and efficient attribution of authorship of revisioned content
Proceedings of the 23rd international conference on World wide web
Hi-index | 0.00 |
Source code metadata on a file-level granularity is too coarse for certain applications. But fine-grained metadata (e.g. line-by-line authorship) easily gets lost due to changes like merging, moving or copying code. Enabling metadata to survive code evolution provides valuable insights into program source code. This helps developers to understand the sources and opens up opportunities for advanced tools. We present a concept that utilizes different search heuristics to identify probable ancestors of source documents, and pair this with clone detection to locate origins of inserted code. Arbitrary kinds of metadata can then be linked to code sections and be preserved automatically while code evolves. We evaluate our approach using code from the Hydra and FreeCol projects, and sketch prospective applications.