On the Use of Automated Text Summarization Techniques for Summarizing Source Code

Authors:
Sonia Haiduc;Jairo Aponte;Laura Moreno;Andrian Marcus
Affiliations:
-;-;-;-
Venue:
WCRE '10 Proceedings of the 2010 17th Working Conference on Reverse Engineering
Year:
2010

Citing 0
Cited 3

Improving traceability link recovery methods through software artifact summarization

Proceedings of the 6th International Workshop on Traceability in Emerging Forms of Software Engineering
Concept location using formal concept analysis and information retrieval

ACM Transactions on Software Engineering and Methodology (TOSEM)
Why did this code change?

Proceedings of the 2013 International Conference on Software Engineering

Quantified Score

Hi-index	0.00

Visualization

Abstract

During maintenance developers cannot read the entire code of large systems. They need a way to get a quick understanding of source code entities (such as, classes, methods, packages, etc.), so they can efficiently identify and then focus on the ones related to their task at hand. Sometimes reading just a method header or a class name does not tell enough about its purpose and meaning, while reading the entire implementation takes too long. We study a solution which mitigates the two approaches, i.e., short and accurate textual descriptions that illustrate the software entities without having to read the details of the implementation. We create such descriptions using techniques from automatic text summarization. The paper presents a study that investigates the suitability of various such techniques for generating source code summaries. The results indicate that a combination of text summarization techniques is most appropriate for source code summarization and that developers generally agree with the summaries produced.