Abstract
In the field of process mining, it is worth noting that process mining techniques assume that the resulting event logs can not only continuously record the occurrence of events but also contain all event data. However, like in IoT systems, data transmission may fail due to weak signal or resource competition, which causes the company’s information system to be unable to keep a complete event log. Based on a incomplete event log, the process model obtained by using existing process mining technologies is deviated from actual business process to a certain degree. In this paper, we propose a method for repairing missing activities based on succession relation of activities from event logs. We use an activity relation matrix to represent the event log and cluster it. The number of traces in the cluster is used as a measure of similarity calculation between incomplete traces and cluster results. Parallel activities in selecting pre-occurrence and post-occurrence activities of missing activities from incomplete traces are considered. Experimental results on real-life event logs show that our approach performs better than previous method in repairing missing activities.








Similar content being viewed by others
Notes
References
Van der Aalst W, Weijters T, Maruster L (2004) Workflow mining: discovering process models from event logs. IEEE Trans Knowl Data Eng 16(9):1128–1142
van der Aalst WM (2018) Process discovery from event data: relating models and logs through abstractions. Wiley Interdiscip Rev Data Min Knowl Discov 8(3):e1244
Aalst WMPVD (2011) Process mining: discovery. Springer, Berlin
van der Aalst (2012) Process mining manifesto. n: Daniel F, Barkaoui K, Dustdar S (eds) Business process management workshops. Springer, Berlin, pp 169–194
Augusto A, Conforti R, Dumas M, La Rosa M, Bruno G (2016) Automated discovery of structured process models: discover structured vs. discover and structure. In: Comyn-Wattiau I, Tanaka K, Song IY, Yamamoto S, Saeki M (eds) Conceptual modeling. Springer, Cham, pp 313–329
Augusto A, Conforti R, Dumas M, La Rosa M, Polyvyanyy A (2019) Split miner: automated discovery of accurate and simple business process models from event logs. Knowl Inf Syst 59(2):251–284. https://doi.org/10.1007/s10115-018-1214-x
Brown ML, Kros JF (2003) Data mining and the impact of missing data. Ind Manag Data Syst 103(8):611–621
De Medeiros AKA, Van Dongen BF, Van der Aalst WMP, Weijters AJMM (2004) Process mining: extending the \(\alpha \)-algorithm to mine short loops, BETA Working Paper Series, WP 113, Eindhoven University of Technology, Eindhoven
De Weerdt J, Vanden Broucke S, Vanthienen J, Baesens B (2013) Active trace clustering for improved process discovery. IEEE Trans Knowl Data Eng 25(12):2708–2720
Delias P, Doumpos M, Grigoroudis E, Matsatsinis N (2019) A non-compensatory approach for trace clustering. Int Trans Oper Res 26(5):1828–1846
Di Francescomarino C, Dumas M, Federici M, Ghidini C, Maggi FM, Rizzi W, Simonetto L (2018) Genetic algorithms for hyperparameter optimization in predictive business process monitoring. Inf Syst 74:67–83
Effendi YA, Sarno R (2017) Discovering process model from event logs by considering overlapping rules. In: 2017 4th International conference on electrical engineering, computer science and informatics (EECSI) pp 1–6
Fahland D, van der Aalst WMP (2012) Repairing process models to reflect reality. In: Barros A, Gal A, Kindler E (eds) Business process management. Springer, Berlin, pp 229–245
Greco G, Guzzo A, Pontieri L, Saccà D (2004) Mining expressive process models by clustering workflow traces. In: Dai H, Srikant R, Zhang C (eds) Advances in knowledge discovery and data mining. Springer, Berlin, pp 52–62
Gu CQ, Chang HY, Yi Y (2008) Workflow mining: extending the \(\alpha \)algorithm to mine duplicate tasks. In: 2008 International conference on machine learning and cybernetics, IEEE, vol 1, pp 361–368
Günther CW, Van Der Aalst WMP (2007) Fuzzy mining - adaptive process simplification based on multi-perspective metrics. In: Alonso G, Dadam P, Rosemann M (eds) Business process management, BPM 2007. Lecture notes in computer science, vol 4714. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-75183-0_24
Ha QT, Bui HN, Nguyen TT (2016) A trace clustering solution based on using the distance graph model. In: Nguyen NT, Iliadis L, Manolopoulos Y, Trawiński B (eds) Computational collective intelligence. Springer, Cham, pp 313–322
Jaeger D, Jung R (eds) (2015) Self-organizing maps. Springer, New York, pp 2655–2655. https://doi.org/10.1007/978-1-4614-6675-8_100525
Lamma E, Mello P, Riguzzi F, Storari S (2008) Applying inductive logic programming to process mining. In: Blockeel H, Ramon J, Shavlik J, Tadepalli P (eds) Inductive logic programming. Springer, Berlin, pp 132–146
Lee C, Choy KL, Ho GT, Lam CH (2016) A slippery genetic algorithm-based process mining system for achieving better quality assurance in the garment industry. Expert Syst Appl 46:236–248
Leemans SJJ, Fahland D, van der Aalst WMP (2013) Discovering block-structured process models from event logs - a constructive approach. In: Colom JM, Desel J (eds) Application and theory of petri nets and concurrency. Springer, Berlin, pp 311–329
Leemans SJJ, Fahland D, van der Aalst WMP (2014) Discovering block-structured process models from incomplete event logs. In: Ciardo G, Kindler E (eds) Application and theory of petri nets and concurrency. Springer, Cham, pp 91–110
de Leoni M, van der Aalst WMP (2013) Aligning event logs and process models for multi-perspective conformance checking: An approach based on integer linear programming. In: Daniel F, Wang J, Weber B (eds) Business process management. Springer, Berlin, pp 113–129
Lu X, Fahland D, van der Aalst WMP (2015) Conformance checking based on partially ordered event data. In: Fournier F, Mendling J (eds) Business process management workshops. Springer, Cham, pp 75–88
Mannhardt F, de Leoni M, Reijers HA, van der Aalst WMP (2016) Balanced multi-perspective checking of process conformance. Computing 98(4):407–437. https://doi.org/10.1007/s00607-015-0441-1
Mannhardt F, de Leoni M, Reijers HA, van der Aalst WM, Toussaint PJ (2018) Guided process discovery-a pattern-based approach. Inf Syst 76:1–18
de Medeiros AKA, Weijters AJMM, van der Aalst WMP (2007) Genetic process mining: an experimental evaluation. Data Min Knowl Discov 14(2):245–304. https://doi.org/10.1007/s10618-006-0061-7
Rozinat A, Van der Aalst WM (2008) Conformance checking of processes based on monitoring real behavior. Inf Syst 33(1):64–95
Song M, Günther CW, van der Aalst WMP (2009) Trace clustering in process mining. In: Ardagna D, Mecella M, Yang J (eds) Business process management workshops. Springer, Berlin, pp 109–120
Sun Y, Bauer B, Weidlich M (2017) Compound trace clustering to generate accurate and simple sub-process models. In: Maximilien M, Vallecillo A, Wang J, Oriol M (eds) Service-oriented computing. Springer, Cham, pp 175–190
Van Der Aalst WMP (2013) Business process management: a comprehensive survey. ISRN Softw Eng 2013:37. https://doi.org/10.1155/2013/507984
Wang P, Tan W, Tang A, Hu K (2018) A novel trace clustering technique based on constrained trace alignment. In: Zu Q, Hu B (eds) Human centered computing. Springer, Cham, pp 53–63
Weijters A, Ribeiro J (2011) Flexible heuristics miner (FHM). In: 2011 IEEE symposium on computational intelligence and data mining (CIDM), IEEE, pp 310–317
Wen L, van der Aalst WMP, Wang J, Sun J (2007) Mining process models with non-free-choice constructs. Data Min Knowl Discov 15(2):145–180. https://doi.org/10.1007/s10618-007-0065-y
Xu J, Liu J (2019) A profile clustering based event logs repairing approach for process mining. IEEE Access 7:17872–17881
Zakarija I, Skopljanac-Macina F, Blaskovic B (2015) Discovering process model from incomplete log using process mining. In: 2015 57th International symposium ELMAR (ELMAR) pp 117–120
ZarehFarkhady R, Aali SH, Branch B (2012) A two phase approach for process mining in incomplete and noisy Logs. Int J Comput Sci Issues, 9(1)
van Zelst SJ, van Dongen BF, van der Aalst WMP (2018) Event stream-based process discovery using abstract representations. Knowl Inf Syst 54(2):407–435. https://doi.org/10.1007/s10115-017-1060-2
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Liu, J., Xu, J., Zhang, R. et al. A repairing missing activities approach with succession relation for event logs. Knowl Inf Syst 63, 477–495 (2021). https://doi.org/10.1007/s10115-020-01524-6
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10115-020-01524-6