Discovering process models from event multiset

  • Authors:
  • Dongyi Wang;Jidong Ge;Hao Hu;Bin Luo;Liguo Huang

  • Affiliations:
  • State Key Laboratory for Novel Software Technology, Nanjing University, China and Software Institute, Nanjing University, Nanjing 210093, China;State Key Laboratory for Novel Software Technology, Nanjing University, China and Software Institute, Nanjing University, Nanjing 210093, China;State Key Laboratory for Novel Software Technology, Nanjing University, China;State Key Laboratory for Novel Software Technology, Nanjing University, China and Software Institute, Nanjing University, Nanjing 210093, China;Dept. of Computer Science and Engineering, Southern Methodist University, Dallas, TX 75275-0122, USA

  • Venue:
  • Expert Systems with Applications: An International Journal
  • Year:
  • 2012

Quantified Score

Hi-index 12.05

Visualization

Abstract

The aim of process mining is to discover the process model from the event log which is recorded by the information system. Typical steps of process mining algorithm can be described as: (1) generating event traces from event log, (2) analyzing event traces and obtaining ordering relations of tasks, (3) generating process model with ordering relations of tasks. The first two steps could be very time consuming involving millions of events and thousands of event traces. This paper presents a novel algorithm (@l-algorithm) which almost eliminates these two steps in generating event traces from event log and analyzing event traces so as to reduce the performance of process mining algorithm. Firstly, we retrieve the event multiset (input data of algorithm marked as MS) which records the frequency of each event but ignores their orders when extracted from event logs. The event in event multiset contains the information of post-activities. Secondly, we obtain ordering relations from event multiset. The ordering relations contain causal dependency, potential parallelism and non-potential parallelism. Finally, we discover a process models with ordering relations. The complexity of @l-algorithm is only bound up with the event classes (the set of events in event logs) that has significantly improved the performance of existing process mining algorithms and is expected to be more practical in real-world process mining based on event logs, as well as being able to detect SWF-nets, short-loops and most of implicit dependency (generated by non-free choice constructions).