Mining for paths in flow graphs

  • Authors:
  • Adam Jocksch;José Nelson Amaral;Marcel Mitran

  • Affiliations:
  • Research in Motion, Waterloo, Canada;Department of Computing Science, University of Alberta, Edmonton, Canada;IBM Toronto Software Laboratory, Toronto, Canada

  • Venue:
  • ICDM'10 Proceedings of the 10th industrial conference on Advances in data mining: applications and theoretical aspects
  • Year:
  • 2010

Quantified Score

Hi-index 0.00

Visualization

Abstract

This paper presents FlowGSP, a data-mining algorithm that discovers frequent sequences of attributes in subpaths of a flow graph. FlowGSP was evaluated using flow graphs derived from the execution of transactions in the IBM® WebSphere® Application Server, a large real-world enterprise application server. The vertices of this flow graph may represent single instructions, bytecodes, basic blocks, regions, or entire methods. These vertices are annotated with attributes that correspond to run-time characteristics of the execution of the program. FlowGSP successfully identified a number of existing characteristics of the Web-Sphere Application Server which had previously been discovered only through extensive manual examination. In addition, a multithreaded implementation of FlowGSP demonstrates the algorithm's suitability for exploiting the resources of modern multi-core computers.