Andrea Burattin, Ph.D.

Assistant Professor

Online Discovery of Declarative Process Models from Event Streams

Abstract

Journal cover

Today's business processes are often controlled and supported by information systems. These systems record real-time information about business processes during their executions. This enables the analysis at runtime of the process behavior. However, many modern systems produce "big data", i.e., collections of data sets so large and complex that it becomes impossible to store and process all of them. Moreover, few processes are in steady-state but, due to changing circumstances, they evolve and systems need to adapt continuously. In this paper, we present a novel framework for the discovery of LTL-based declarative process models from streaming event data in settings where it is impossible to store all events over an extended period of time or where processes evolve while being analyzed. The framework continuously updates a set of valid business constraints based on the events occurred in the event stream. In addition, our approach is able to provide meaningful information about the most significant concept drifts, i.e., changes occurring in a process during its execution. We report about experimental results obtained using synthetic logs and a real-life event log pertaining to the treatment of patients diagnosed with cancer in a large Dutch academic hospital.

Paper Information and Files

Published in IEEE Transactions on Services Computing, vol. 8 (2015), no. 6, pp. 833-846.

  • Download text
  • Benchmark dataset

DOI: 10.1109/TSC.2015.2459703
ISSN: 1939-1374

Errata corrige

  • As reported on the description text, line 7 of Algorithm 1 should be $b_\text{curr}\gets \left\lceil\frac{N}{w} \right\rceil$ and not $b_\text{curr}\gets \left\lceil\frac{N}{\epsilon} \right\rceil$. Thanks to Robin Stenzel for pointing out the error.