An XML-Based Lightweight C++ Fact Extractor

  • Authors:
  • Affiliations:
  • Venue:
  • IWPC '03 Proceedings of the 11th IEEE International Workshop on Program Comprehension
  • Year:
  • 2003

Quantified Score

Hi-index 0.02

Visualization

Abstract

A lightweight fact extractor is presented that utilizes XML tools, such as XPath and XSLT, to extract static information from C++ source code programs. The source code is first converted into an XML representation, srcML, to facilitate the use of a wide variety of XML tools. The method is deemed lightweight because only a partial parsing of the source is done. Additionally, the technique is quite robust and can be applied to incomplete andnon-compile-able source code. The trade off to this approach is that queries on some low level details cannot be directly addressed. This approach is applied to a fact extractor benchmark as comparison with other, abet heavier weight, fact extractors. Fact extractors are widely used to support understanding tasks associated with maintenance, reverse engineeringand various other software engineering tasks.