Simple fast algorithms for the editing distance between trees and related problems
SIAM Journal on Computing
A graph distance metric based on the maximal common subgraph
Pattern Recognition Letters
Web document clustering: a feasibility demonstration
Proceedings of the 21st annual international ACM SIGIR conference on Research and development in information retrieval
Introduction to Modern Information Retrieval
Introduction to Modern Information Retrieval
WW-FLOW: Web-Based Workflow Management with Runtime Encapsulation
IEEE Internet Computing
Organizing Business Knowledge: The MIT Process Handbook
Organizing Business Knowledge: The MIT Process Handbook
An Efficient and Scalable Algorithm for Clustering XML Documents by Structure
IEEE Transactions on Knowledge and Data Engineering
Workflow mining: a survey of issues and approaches
Data & Knowledge Engineering
Business Process Choreography for B2B Collaboration
IEEE Internet Computing
Process mining: a research agenda
Computers in Industry - Special issue: Process/workflow mining
Automatic Control of Workflow Processes Using ECA Rules
IEEE Transactions on Knowledge and Data Engineering
Efficient Phrase-Based Document Indexing for Web Document Clustering
IEEE Transactions on Knowledge and Data Engineering
State-Space Optimization of ETL Workflows
IEEE Transactions on Knowledge and Data Engineering
Development of process execution rules for workload balancing on agents
Data & Knowledge Engineering - Special issue: Business process management
Athena: text mining based discovery of scientific workflows in disperse repositories
RED'10 Proceedings of the Third international conference on Resource Discovery
Approximate clone detection in repositories of business process models
BPM'12 Proceedings of the 10th international conference on Business Process Management
Investigating clinical care pathways correlated with outcomes
BPM'13 Proceedings of the 11th international conference on Business Process Management
Hi-index | 0.02 |
Process-centric information systems have been accumulating a mount of process models. Process designers continue to create new process models and they long for process analysis tools in various viewpoints. This paper proposes a novel approach of process analysis. Workflow clustering facilitates to analyze accumulated workflow process models and classify them into characteristic groups. The framework consists of two phases: domain classification and pattern analysis. Domain classification exploits an activity similarity measure, while pattern analysis does a transition similarity measure. Process models are represented as weighted complete dependency graphs, and then similarities among their graph vectors are estimated in consideration of relative frequency of each activity and transition. Finally, the models are clustered based on the similarities by a hierarchical clustering algorithm. We implemented the methodology and experimented sets of synthetic processes. Workflow clustering is adaptable to various process analyses, such as workflow recommendation, workflow mining, and process patterns analysis.