Celonis Report 1276
Celonis Report 1276
BACHELOR OF TECHNOLOGY
In
Information Technology
by
M Bindu
Regd.No.:22501A1276
Under Supervision of
Dr. G. Reshma
internship coordinator
(Duration: Jan-Mar 2024)
i. . v
DEPARTMENT OF INFORMATION TECHNOLOGY
Prasad V Potluri Siddhartha Institute of Technology
CERTIFICATE
ii. . v
iii. . v
ACKNOWLEDGEMENT
First I would like to thanks AICTE EDUSKILLS FOUNDATION for giving me the
opportunity to do an internship virtually.
I also would like all the people that worked along with me, with their patience and
openness they created an enjoyable working environment.
It is indeed with a great sense of pleasure and immense sense of gratitude that I
acknowledge the help of these individuals.
I am highly indebted to Principal Dr. K. SIVAJI BABU, for the facilities provided to
accomplish this internship.
I would like to thank my Head of the Department Dr. B.V. SUBBA RAO for his
constructive criticism throughout my internship.
I am extremely great full to my department staff members and friends who helped me
in successful completion of this internship.
M BINDU
(22501A1276)
iv. . v
ABSTRACT
Celonis Process Mining is a technology platform that helps organizations visualize, analyze, and
optimize their business processes in real-time. It provides insights into how processes like order-to-
cash, purchase-to-pay, and supply chain operations are actually performing based on data extracted
from enterprise systems like ERP (e.g., SAP, Oracle) or CRM systems (e.g., Salesforce).
Key Components:
Data Extraction: Celonis pulls raw data (logs, timestamps, events) from an organization’s
transactional systems.
Process Discovery: It reconstructs actual processes by mapping event logs and showing
how tasks or events flow in the real world, including deviations from intended processes.
Process Visualization: The platform creates visual representations (process maps) of
workflows, making it easy to see bottlenecks, inefficiencies, or compliance issues.
Performance Analysis: Celonis provides detailed metrics on process performance, such as
lead times, throughput, rework rates, and compliance with business rules. It can benchmark
process performance and identify opportunities for optimization.
Actionable Insights: Using AI and machine learning, Celonis suggests improvements or
optimizations for processes. These could be eliminating unnecessary steps, reducing wait
times, or improving resource allocation.
Automation: Celonis can also help automate parts of the processes to increase efficiency
and reduce human intervention, integrating with Robotic Process Automation (RPA) tools
to drive automation.
v. . v
Key parts of the report:
Under each division we further provide specific industry solutions on focused domains with
cutting edge technologies
Identification of Bottlenecks and Inefficiencies: The report will highlight critical points
of delay, rework, or process deviations, providing clear areas where the company can focus
its optimization efforts.
Process Standardization: The report can reveal variations in the way processes are
executed, allowing the organization to standardize best practices and reduce inconsistencies.
Real-Time Process Monitoring: The report might propose solutions to enhance real-time
monitoring of key business processes, allowing for quicker detection and resolution of
issues.
Reduction in Operational Costs: By pinpointing areas where time and resources are
wasted, the report can help identify cost-saving opportunities, such as reducing rework,
cutting down on unnecessary steps, or improving procurement processes.
Process Automation Opportunities: The analysis might reveal repetitive manual tasks that
could be automated through Robotic Process Automation (RPA), leading to significant time
and cost saving
vi. . v
TABLE OF CONTENTS
Declaration.................................................................................................................................
Certificate ..................................................................................................................................
Internship completed certificate...............................................................................................
Acknowledgement .....................................................................................................................
Abstract ......................................................................................................................................
Table of contents........................................................................................................................
List of figures..............................................................................................................................
Index............................................................................................................................................
Learning objectives/internship objectives................................................................................
Weekly overview of internship…..............................................................................................
1. Introduction...........................................................................................................................3
1.1 Introduction
1.2 History of company
1.3 Products and Services
1.4 About the course
2. Process Mining………………………………………………………………………………5
2.1 Process Mining Fundamentals
2.2 Process Mining Implementation
2.3 Data Analysis and Prepocessing
2.4 Creation of the Celonis Data Model
2.4.1 Creation of the Activity and case table
2.4.2 Creation of Data Model
2.4.3 PQL introduction and JSON import
2.5 Creation of the Celonis Dashboards
2.5.1 Standby Crew Utilization KP
2.5.2 Notification Automation Rate KPI
3. Modules ...................................................................................................................................12
3.1 Process Mining
3.2 Process Mining (cloud)
3.3 Process Mining transparency
3.4 Process Mining is the MRI for Processes
3.5 Mining Algorithms
3.6 Starting a project in mining
3.7 Industrial usage of mining
3.8 Process Mining Software’s
3.9 Software Key Functions
3.10 Process Mining Software Providers
4.Technology……………………………………………………………………………………19
4.1 App templates
4.2 Extracting and loading data
4.3 Editing data transformation
4.4 Customizing process apps
4.5 Root cause analysis
4.6 Managing access control for process apps
5. Applications...............................................................................................................................21
6. Learning outcomes……………………………………………………………………………22
7. Conclusion…………………………………………………………………………………….23
vii. . v
Learning Objectives/Internship Objectives
Develop skills in using the Celonis platform for process visualization and analysis
efficiency
Understand the role of process mining in digital transformation and operational excellence
viii. . v
WEEKLY OVERVIEW OF INTERNSHIP ACTIVITIES
1
DATE DAY NAME OF THE TOPIC/MODULE COMPLETED
18-02-2024 SUNDAY APP TEMPLATES
4th WEEK
CHAPTER-1
2
INTRODUCTION
1.1 Introduction of Celonis
Celonis is a German data processing company that offers software as a service (SaaS) to
improve business processes. It is headquartered in Munich, Germany. Celonis is the global
leader in execution management. The Celonis Execution Management System (EMS)
provides companies a modern way to run their business processes entirely on data and
intelligence.
CHAPTER-2
PROCESS MINING
4
Types of Process Mining
Process mining typically includes three types of analysis:
Discovery: Automatically creating process models from event logs without any pre-
existing models. It helps reveal the true nature of processes.
Conformance Checking: Comparing the actual process (as derived from event logs) with
a pre-defined model to identify deviations or compliance issues.
Enhancement: Extending or improving existing process models based on the event data
to optimize performance, remove bottlenecks, or improve efficiency.
5
transparency it is possible to analyze bottlenecks and counteract against the identified problems.
Celonis software allows to intuitively explore the business process, checks for conforming cases,
and has additional functionalities, like a machine learning workbench or an action engine that
proactively suggests process improvements. This work will make use of Celonis process mining
to analyze crew management data and to derive action potentials for process improvements. A
Celonis on-premise 4.6 version was used to build every process mining analysis
6
accordance with CLH experts so assure categories correctly represented the data. We also added
tables not included in our Netline database to make our data more comprehensible (e.g. adding
airport coordinates to map airports). In order to understand the data, SQL queries have been
performed to understand the data structure and data type of the underlying data. As an example, a
part of the content of the raw event table from Netline is illustrated below. This table was one of
our main sources of information. As visible in figure 4, the Netline event table contains many
columns. Of biggest importance to us was TLC, begin date and begin time as well as the column
type. While the TLC allowed us to identify a crew member, the begin date and begin time helped
us to chronologize events. The column types were used to specify the type of event. The types are
very general and contain 8 categories, including Leg, Absence or Simulator. Within CLH a Leg
refers to a completed flight. Also visible from figure 4 are the differences in data types. In the
Netline table data points are stored as integers, varchars and datetime formats.
In order to get the data into the right shape for process mining it was essential to perform extract,
transform and load (ETL) tasks on the data. Not all tables could be used in their raw format,
therefore we had to create a set of new tables based on the raw tables. For this, we extracted the
necessary columns, thereby also reducing the volume of our set of tables. Next, we performed
transformations to standardize the tables to achieve a consistent data set. First, inconsistencies in
the format of dates in the tables would have lead to inconsistent Case Keys. Case Keys are the
unique IDs to trace employees across their duty days. We create one case key for every duty day
of every employee. To prevent inconsistencies in our Case Keys, we standardized the date format
with one of the SQL-supported formats: yyyy-mm-dd. After standardizing the date format, we
concatenated it with the time. This allows for a more detailed process flow in the data model,
allowing us to track each employee across a given day, with a chronological order of all events
due to the precise timestamp. The time was stored as a normal integer value such as ”245”
representing 02:45 am or ”2309” symbolizing 11:09 pm. The dates and the times were
concatenated in the final datetime format yyyy-mm-dd HH:mm:ss. Furthermore, it was essential
for all tables to have one column in common to be able to join the tables together and infer their
relationship to one another. This was accomplished by adding the Case Key column to most of
the tables. The Case Key was a concatenation of the TLC, which is an integer value, with the
begin date. Thus, the Case Key has the format TLC yyyy-mm-dd. Tables such as the
Cancellations table, which contains cancelled flights, did not need to be joined via the Case Key,
but rather through the flight number. This resulted in a consistent data set across eight tables
containing all important information for every event in the crew management process.
7
2.4.1Creation of the Activity and Case Table
Based on the tables we extracted from the Netline database, we were able to create the cases table
and the activity table; the two most important tables in a process mining analysis. The activity
table contains every activity executed. Our activity table contains over 3.4 million distinct
activities. Based on the mapping provided to us by CLH, the initial activity table contained the
following columns: ID, activity name, timestamp, flight, departure and arrival. A template for the
activity table, which was used in the past for another process mining project in one of CLHs
subsidaries, contained numerous restrictions: i.e. only specific types of flights, duties or captains
with a specific rank. This served as the basis for our first analysis of the crew management
process. However, after receiving feedback from experts from CLH, especially regarding the
internal codes used at CLH, we wanted to add more details to enhance our data model. It was an
iterative process to add more activities as information was made available to us over time. We
wanted to present the crew management process in as much detail as possible in our process
mining analysis. In the final analysis, many activities from the template were removed and
replaced by more precise ones which better represent the process flow. Therefore, the final
activity table contained additional columns such as longitude and latitude coordinates of all
European airports targeted by CLH and a duty code category, which describes the duty activity in
more detail such as proceeding flight, vacation, sick days etc. This greatly increased the amount
of detailed information we were able to provide in our analyses, as we were able to target specific
events through the duty code categories.
The cases table contains every case we examine in the process mining model. Our cases table
consists of over 1.3 million distinct cases. Every case contains all activities one crew member
completes during one duty day. Thus, the cases table has one entry for each crew member on
every duty day. Therefore, each case can be identified via its distinct Case Key, meaning the
cases table creates a list of all distinct case keys. An inner join of the cases table with the activity
table has been performed for each of the activities to achieve a consistent data set. This means we
keep only the activities related to cases in our cases table. Thereby we created a 1:N relationship
between the cases table and the activities table. The cases table contains further information, such
as the home base of a TLC, the TLC itself as well as cancelled flights. This enabled our team to
build relationships between the cases table and different tables. These relationships were
necessary for a detailed analysis of the crew management process.
8
to obtain the final data model. The data model with all relationships between tables is depicted
below
Celonis only supports 1:1 or 1:N relationships among the tables. The relationships among the
tables we inferred were verified by the CLH IT experts in a separate meeting. This ensured a
proper setup of the data model, which was imperative to achieve a meaningful analysis. Due to
the connections we realized among the tables it was possible to retrieve more detailed
information about the activities, such as the name of a hotel where a crew member stayed or from
which city a crew member took a proceeding flight to start their duty.
This data model, with all its internal relationships, is used within Celonis to visualize the process
flow and to create the process analyses. Hence, the data model is a crucial part of process mining.
During data model creation, we had to manually reload the model in Celonis whenever we
implemented changes in the SQL server. However, once the data model was stable and any
inconsistencies had been removed, we were able to automate data loads and ensure our analyses
were always up-to-date.
9
specified, one can make use of the KPI and visualize it on the Celonis front end. Therefore, the
analyses serve as dashboards to see the historic changes of that respective KPI over the recorded
time inside the data model or as a single digit on a daily basis - it depends on the setup by the data
engineer who designs the analysis. Celonis allows for arbitrarily many KPIs and analysis sheets.
Due to the similar work of the subsidiary of Lufthansa, it was possible to transfer the analysis
sheets from the subsidiary as a template and adapt it to our use cases. However, the analysis was
hosted in the cloud version of Celonis, and CLH has an on-premise Celonis version in their entire
IT landscape. Nevertheless, it is possible to transfer analyses across different versions of Celonis.
Hence, we exported a JSON file containing all the analysis descriptions from the cloud system
and imported it into our on-premise version. This also allowed us to replace column
names,column values, and color schema to fit CLH design inside the JSON file to reduce the
amount of necessary changes. From that point on, additional analysis sheets were created with
PQL statements in order to adapt the analysis sheets to the needs of our project.
10
utilization across the recorded time frame. Furthermore, it is also possible to conduct a root cause
analysis, to better understand why the activations took place, i.e. sickness of a colleague or long
delay of another flight. This creates transparency and can aid in increasing efficiency and
effectivity in the long-term.
11
CHAPTER-3
MODULES
3.1 Process Mining
Process mining applies data science to discover, validate and improve workflows. By combining
data mining and process analytics, organizations can mine log data from their information
systems to understand the performance of their processes, revealing bottlenecks and other areas
of improvement. Process mining leverages a data-driven approach to process optimization,
allowing managers to remain objective in their decision-making around resource allocation for
existing processes. Process mining focuses on different perspectives, such as control-flow,
organizational. case, and time. While much of the work around process mining focuses on the
sequence of activities---control-flow---the other perspectives also provide valuable information
for management teams.
In this module we learned how to:
Data transformation
Data analysis
Continuous monitoring
12
3.3 Processes Mining transparency
Process mining is a process management technique. It aims to discover, monitor and improve
process flows by extracting readily available knowledge from information systems event logs.
Process mining provides companies with complete visibility into how processes really work.
With these insights, companies can then identify opportunities for process optimization. Process
mining involves several steps.
The automated process discovery-extraction of process models from an event log.
&
The conformity check-monitoring deviations by comparing model and protocol.
In this module we learned how to:
Automation Process Discovery
Conformity Check
Organization Mining
13
It takes an event log as input and computes the order relation of the events contained in
the log.
14
the goal : When a company has multiple factories in different regions, there are usually
differences between the reliability of deliveries. It is fairly easy to see that they exist, but
it is more difficult to understand exactly where or why they are happening. Process
mining can be used to compare the performance of different locations, down to individual
process steps, including duration, cost, and the person performing the step. All event data
available in the systems is suitable for use. In this way, facts can be generated.
Banking and finance: In the financial sector, it is important to comply with rules and
regulations and to be able to provide evidence of this. By using the event data from the
systems. individual cases can also be visualized as a process flow.
In this module we learned how to:
Production
Financing
Telecom
15
Identify bottlenecks & process optimization opportunities Provide insights into failed process
steps Ensure end-to-end view of the entire process Monitor performance indicators in real time
Perform data eleansing Compliance analysis & gap analysis Provide continuous business process
monitoring in real time improve process model.
In this module we learned how to:
Identifications
Optimization
17
Functionality: Provides a virtual environment to model various changes in the process
and evaluate their impact on performance metrics.
Key Use: Experimenting with changes to improve processes without the risk of disrupting
current operations.
9. Process Benchmarking
Purpose: Compare the performance of specific processes against industry standards or
internal benchmarks.
Functionality: Assesses process performance in relation to competitors or internal goals,
helping identify areas where the organization is underperforming.
Key Use: Driving continuous improvement by measuring processes against best practices
or competitors' performance.
10. Process Visualization
Purpose: Create interactive, easy-to-understand visual representations of process flows.
Functionality: Generates process maps, diagrams, and dashboards to help users
intuitively explore and understand their business processes.
Key Use: Facilitating communication and understanding of complex processes among
stakeholders.
18
CHAPTER-4
TECHNOLOGY
Process Mining Technologies
Process mining applies data science to discover, validate and improve workflows. By combining
data mining and process analytics, organizations can mine log data from their information
systems to understand the performance of their processes, revealing bottlenecks and other
areas of improvement. Process mining technologies are tools and techniques used to analyze,
monitor, and improve business processes by extracting knowledge from event logs available in a
system. These technologies bridge the gap between data science and business process
management (BPM) by allowing organizations to use data from their enterprise systems (such as
ERP, CRM, and workflow systems) to visualize how processes actually run.
20
Apromore: An open-source process mining tool providing features for process discovery,
conformance checking, and process enhancement.
CHAPTER-5
APPLICATIONS
Process mining applies data science to discover, validate and improve workflows. By combining
data mining and process analytics, organizations can mine log data from their information
systems to understand the performance of their processes, revealing bottlenecks and other areas
of improvement. Process mining is beneficial for many situations in large organizations. Areas
where process mining can be actively applied include the following
Automation for RPA Success: Understanding actual processes, variations, and
opportunities is essential to ensure success in Robotic Process Automation (RPA)
projects.
Comprehensive Process Reporting: Providing complete process KPIs and dashboards to
monitor and analyze the performance of a given process.
Digital Transformation: Gaining insights into the "big picture" to understand how
businesses operate, prioritize key areas, and implement effective digital transformations.
Scaling Optimization Across Operations:Expanding optimization efforts across
multiple business units and locations, supporting process control through the analysis of
process data.
Effortless Process Capture: Capturing processes across the enterprise with minimal
human effort, ensuring comprehensive process visibility.
Bottleneck and Inefficiency Identification: Detecting bottlenecks, deviations, and
inefficient processes, which can then be improved or automated.
Continuous Improvement Monitoring: Continuously monitoring and measuring the
impact of process improvements to ensure sustained efficiency.
Simplified Compliance and Auditing: Ensuring compliance by maintaining complete
audit trails, simplifying the auditing process.
End-to-End Process Perspective: Delivering a full context and end-to-end view of
processes to support effective process improvements.
Automation Opportunity Identification: Identifying the most valuable and effective
processes suitable for automation, ensuring optimal returns from automation investments.
Business Process Discovery: Automatically generating real-time process models from
21
event logs to visualize how processes operate.
Process Optimization: Identifying inefficiencies, bottlenecks, and redundant steps to
streamline workflows and improve performance.
Compliance Monitoring: Ensuring business processes adhere to regulatory standards and
company policies by comparing actual performance with predefined models.
Learning outcomes
Gain an overall understanding of basic Process Mining concepts.
Become familiar with Mining core services and tools
Learn the architectural principles of the process Mining.
Understand and be able to explain Process Mining and compliance measures.
Understand the Process Mining budget and pricing philosophy.
22
Engage in hands-on practice to hone key skills. Learn the knowledge and skills required
to take the Process Mining Virtual internship Certified.
Conclusion
The Celonis Process Mining Virtual Internship has been an invaluable experience, providing
hands-on exposure to process mining, data analysis, and business process optimization.
Throughout the internship, I gained a deep understanding of how Celonis' powerful tools can be
used to identify inefficiencies and optimize business workflows. I learned to work with process
data, apply analytical techniques, and use Celonis' Action Engine and dashboards to recommend
data-driven improvements.
This experience honed my skills in data analytics, problem-solving, and critical thinking, as I
worked on real-life business cases, applying theoretical concepts to practical scenarios. I also
developed my ability to communicate insights clearly to stakeholders, which is essential for
driving meaningful change in business operations.
By doing this internship I learnt
The importance of Process Mining.
Tools that helps us to optimize our service costs.
Software Production and Estimation
Processing the Huge data.
And other different Services that are provided in Mining.
Overall, this internship has equipped me with valuable skills and knowledge, fostering a strong
foundation in process mining and preparing me for a career in data analysis, process
improvement, and business intelligence. The experience of working with cutting-edge technology
in a virtual setting has also enhanced my adaptability and self-management skills, which will be
instrumental in my future endeavors.
23