Autograph: automatically extracting workflow file signatures

  • Authors:
  • Anna Povzner;Kimberly Keeton;Arif Merchant;Charles B. Morrey, III;Mustafa Uysal;Marcos K. Aguilera

  • Affiliations:
  • UC Santa Cruz;HP Labs;HP Labs;HP Labs;HP Labs;Microsoft Research

  • Venue:
  • ACM SIGOPS Operating Systems Review
  • Year:
  • 2009

Quantified Score

Hi-index 0.00

Visualization

Abstract

Storage management activities, such as reporting, file placement, migration and archiving, require the ability to discover files that belong to an application workflow by relying only on information from the file server. Some classes of application workflows, such as rendering an animated sequence from its graphics models or building an application from its source files, often exhibit a high degree of repeatability. We describe a system called Autograph that exploits this repeatability to discover files that belong to an application workflow. Our approach examines traces of file accesses, finds repeated and correlated accesses, and infers which files likely belong to the same workflow. Our solution targets server workflows and uses file server traces, which contain less process and file information than the local machine traces used in prior work. We show that Autograph successfully extracts workflow file signatures, even if the workflows are concurrent or share files.