SlideShare a Scribd company logo
Process Mining: Data 
Science in Action 
Process Mining as a Tool to Find Out What 
People and Organizations Really Do 
prof.dr.ir. Wil van der Aalst 
www.processmining.org
How many 
students will 
take my class 
next year? 
PAGE 2 
What is the 
real 
curriculum? 
When, where and 
why do students 
deviate? 
When, where 
and why do 
students drop 
out? 
Will I 
graduate or 
drop out? 
When will I 
graduate? 
Should I take this 
course if I want to 
graduate in two 
years from now?
PAGE 3 
It's the 
process, 
stupid! 
(not the data, not the system, and 
not a particular decision)
The data scientist … 
PAGE 4
Data Science Center Eindhoven (DSC/e) 
PAGE 5 
28 research 
groups are 
involved
Data Science Center Eindhoven (DSC/e) 
PAGE 6 7 research programs
Process 
Mining
www.olifantenpaadjes.nl 
Process discovery
acbaaaeabbcfbcccbcbedddetefcfcbbcdd 
Process 
Discovery
10 
Conformance checking
aadccdbbfeaeeafcfaebbbbbdcdbcdd 
Conformance 
Checking
PAGE 12 
Let's play
PAGE 13 
Play-Out
Play Out: A possible scenario 
a b d e g 
Case Activity Timestamp Resource 
432 register travel request (a) 18-3-2014:9.15 John 
432 get support from local manager (b) 18-3-2014:9.25 Mary 
432 check budget by finance (d) 19-3-2014:8.55 John 
432 decide (e) 19-3-2014:9.36 Sue 
432 accept request (g) 19-3-2014:9.48 Mary 
PAGE 14
Play Out: Another scenario 
PAGE 15 
a d c e f b d e h
Play Out: Process model allows for many 
more scenarios 
adcefcdefbdefbdeg 
adcefcdefbdefbdeg 
abdeg 
acdefcdefabcdbeehfbdeg 
abdeg 
adceahdbeh abdeg 
PAGE 16 
abdeg 
abcefbdeh 
adceg adbeh 
acbefbdeh 
acdefcdefbdeh 
adceh 
adcefcdefbdefbdeg 
adbeh 
acdefcdefabcdbeehfbdeg
PAGE 17 
Play-In
Play In: 
Simple process allowing for 4 traces 
abdeh abdeh 
abdehadbeh adbeh 
PAGE 18 
abdeg adbeg 
abdeh 
adbeh 
adbeg 
abdeg 
abdehadbeh
Play In: 
Process allowing for more traces 
adcefcdefbdefbdeg 
adcefcdefbdefbdeg 
abdeg 
acbefbdeg adbeh 
adceahdbeh acdefcdefabcdbeehfbdeg 
PAGE 19 
abdeg 
abcefbdeh 
adcegadbeh 
adcefcdefbdefbdeg acbefbadcedhefcdefbdeh abdeg 
abdeg 
adceh 
acdefcdefbdeh
No modeling needed! 
PAGE 20 
No modeling needed!
Example Process Discovery 
(Dutch housing agency, 208 cases, 5987 events) 
PAGE 21
Example process discovery for hospital 
(627 gynecological oncology patients, 24331 events) 
PAGE 22
Process discovery algorithms 
(small selection) 
PAGE 23 
heuristic mining 
α algorithm 
α# algorithm 
α++ algorithm 
distributed genetic mining 
language-based regions 
genetic mining state-based regions 
neural networks 
hidden Markov models 
automata-based learning 
stochastic task graphs 
conformal process graph 
mining block structures 
multi-phase mining 
partial-order based mining 
fuzzy mining 
LTL mining 
ILP mining 
ETM genetic algorithm 
Inductive Miner (infrequent)
Language based regions 
PAGE 24 
Region R = (X,Y,c) corresponding to place pR: X = {a1,a2,c1} = 
transitions producing a token for pR, Y = {b1,b2,c1} = transitions 
consuming a token from pR, and c is the initial marking of pR.
Basic idea: enough tokens should be 
present when consuming 
A place is feasible if it 
can be added without 
disabling any of the 
traces in the event log. 
PAGE 25
PAGE 26 
Replay
Replay 
PAGE 27 
a c d e g
Replay 
PAGE 28 
a c 
check budget 
(d) is missing! 
e g 
?
Alignments: Relating reality and model 
a c » e g 
a c d e g 
check budget (d) did not happen but 
should have according to the model 
PAGE 29
Replay 
PAGE 30 
a c h d e g 
reject request (h) 
is impossible 
?
Alignments: Relating reality and model 
PAGE 31 
a c h d e g 
a c » d e g 
reject request (h) happened but could 
not happen according to the model
Any trace in reality can be related to a 
path in the model 
a c e f d d b c e h 
PAGE 32
Any trace in reality can be related to a 
path in the model 
a c » e f d d b c e h 
a c d e f d » b » e h 
optimization problem using a cost function 
PAGE 33 
check is 
missing 
one check 
too many 
cannot both 
be done
PAGE 34 
process 
model 
event log 
synchronous 
move 
move on 
model only 
move on log 
only
Conformance Checking 
(WOZ objections Dutch municipality, 745 objections, 9583 event, f= 0.988) 
PAGE 35
Replay with timestamps 
a9.15 c9.20 d9.35 e10.15 g11.30 
PAGE 36 
9.15 
9.20 
9.35 
10.15 
11.30 
5 55 
20 40 
75
Replay with timestamps for many traces 
PAGE 37 
frequencies 
of activities 
frequencies 
of paths 
durations of 
activities 
waiting times and 
other delays between 
activities
PAGE 38 
Alignments are essential! 
• conformance checking to diagnose deviations 
• squeezing reality into the model to do model-based 
analysis
Example: BPI Challenge 2012 
(Dutch financial institute, doi:10.4121/uuid:3926db30-f712-4394-aebc-75976070e91f) 
PAGE 39 Work of Arya Adriansyah (Replay project)
PAGE 40 
Auditor's toolbox
PAGE 41 
Business analyst's 
toolbox
Software
600+ plug-ins available covering the 
whole process mining spectrum
 Process Mining: Data Science in Action - Wil van der Aalst, TU/e, DSC/e, HSE
Disco 
PAGE 45
Overview: Role of process models 
PAGE 46 
Play-Out 
Play-In 
Replay
decomposed/distributed 
process mining
data Big data
http://www.multivu.com/assets/58095/photos/Data-is-the-new-oil-infographic-Nigel-Holmes-2012-from-The-Human-Face-of-Big-Data-original.jpg
What if? 
PAGE 50 
there are more than 
1000 different 
activities? 
there are more 
than 1.000.000 
cases? 
there are more 
than 100.000.000 
events?
Decompose event log! 
vertical or horizontal 
PAGE 51 
sets of 
cases 
sets of 
activities
Vertical distribution: Split cases 
PAGE 52 
sets of 
cases
Horizontal distribution 
PAGE 53 
sets of 
activities 
{a,b,e,f,g} 
{a,b,e,f,g} {b,c,d,e} 
{b,c,d,e}
Horizontal distribution: The key idea 
PAGE 54 
projected on 
{a,b,e,f,g} 
projected on 
{b,c,d,e}
Two foundational ways of spitting event 
data: horizontal or vertical 
PAGE 55
 Process Mining: Data Science in Action - Wil van der Aalst, TU/e, DSC/e, HSE
Decomposing Conformance Checking 
PAGE 57 
See "divide and conquer" 
framework by Eric Verbeek.
Example of a valid decomposition 
Log can be split in the same way! 
PAGE 58
Example of an alignment for observed 
trace a,b,c,d,e,c,d,g,f 
PAGE 59 
Etc. 
a,b,c,d,e,c,d,g,f
Conformance checking can be 
decomposed !!! 
• General result for any valid decomposition: Any 
event log or trace is perfectly fitting the overall 
model if and only if it is also fitting all the individual 
fragments 
PAGE 60 
for any Petri net 
(not just WF-nets) 
for any valid 
decompsition 
Wil van der Aalst, Decomposing Petri nets for process mining: 
A generic approach. Distributed and Parallel Databases, 
Volume 31, Issue 4, pp 471-507, 2013 
also for data 
Petri nets
Example 
(work with Jorge Munoz-Gama and Josep Carmona) 
prFm6 
PAGE 61
Decomposing Process Discovery 
PAGE 62 
See "divide and conquer" 
framework by Eric Verbeek.
conclusion
Don't just check the temperature … 
PAGE 64
X-ray your processes! 
PAGE 65
PAGE 66 
www.processmining.org 
www.win.tue.nl/ieeetfpm/
Process Mining: Data Science in Action 
https://www.coursera.org/course/procmin 
First Massive Open 
Online Course (MOOC) 
on Process Mining

More Related Content

PDF
Process Mining - Chapter 1 - Introduction
PDF
Business Process Modeling
PPTX
Data ops in practice
PPTX
Business Process Management Approach
PDF
Building End-to-End Delta Pipelines on GCP
PPTX
Process Mining Introduction
PDF
Data Lake Architecture – Modern Strategies & Approaches
PPTX
Data Observability Best Pracices
Process Mining - Chapter 1 - Introduction
Business Process Modeling
Data ops in practice
Business Process Management Approach
Building End-to-End Delta Pipelines on GCP
Process Mining Introduction
Data Lake Architecture – Modern Strategies & Approaches
Data Observability Best Pracices

What's hot (20)

PPTX
Azure Synapse Analytics Overview (r1)
PPTX
ODSC May 2019 - The DataOps Manifesto
PDF
Getting Started with Databricks SQL Analytics
PDF
Solution Architecture And (Robotic) Process Automation Solutions
KEY
Event Driven Architecture
PDF
What is in your Business Analysis Toolkit?
PPTX
Technical stories v1.2
PDF
A Kafka journey and why migrate to Confluent Cloud?
PDF
An Introduction to Azure IaaS
PPTX
DMM9 - Data Migration Testing
PDF
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
PPTX
Using Azure DevOps to continuously build, test, and deploy containerized appl...
PDF
Documentation in the agile software development process
PPTX
Event-driven microservices
PPTX
Data Observability.pptx
PDF
Modern Data architecture Design
PDF
Defining Your Cloud Strategy
PPTX
NashTech - Azure Application Insights
PPTX
Cloud Adoption Plan - Planning phase
PDF
Spark as a Service with Azure Databricks
Azure Synapse Analytics Overview (r1)
ODSC May 2019 - The DataOps Manifesto
Getting Started with Databricks SQL Analytics
Solution Architecture And (Robotic) Process Automation Solutions
Event Driven Architecture
What is in your Business Analysis Toolkit?
Technical stories v1.2
A Kafka journey and why migrate to Confluent Cloud?
An Introduction to Azure IaaS
DMM9 - Data Migration Testing
Delta Lake OSS: Create reliable and performant Data Lake by Quentin Ambard
Using Azure DevOps to continuously build, test, and deploy containerized appl...
Documentation in the agile software development process
Event-driven microservices
Data Observability.pptx
Modern Data architecture Design
Defining Your Cloud Strategy
NashTech - Azure Application Insights
Cloud Adoption Plan - Planning phase
Spark as a Service with Azure Databricks
Ad

Viewers also liked (20)

PDF
Process Mining - Chapter 11 - Analyzing Lasagna Processes
PDF
Process Mining - Chapter 9 - Operational Support
PDF
Process Mining - Chapter 12 - Analyzing Spaghetti Processes
PDF
Process mining chapter_11_analyzing_lasagna_processes
POTX
Introducing to Datamining vs. OLAP - مقدمه و مقایسه ای بر داده کاوی و تحلیل ...
PPTX
Process mining approaches kashif.namal@gmail.com
PPTX
Ontologies And Process Mining
PPTX
Dealing with concept drifts in process mining
PDF
Process Mining - Chapter 4 - Getting the Data
PPT
Process Mining: Understanding and Improving Desire Lines in Big Data
PDF
Process Mining - Chapter 8 - Mining Additional Perspectives
PDF
Process Mining - Chapter 6 - Advanced Process Discovery_techniques
PDF
Building Information Model (BIM) based process mining
PPTX
Bim based process mining master thesis presentation
PDF
Большие данные в физике элементарных частиц на примере LHCb - Guy Wilkinson, ...
PDF
Как делается Яндекс.Браузер — Михаил Лопаткин
PDF
Коллективная разработка документации: от индивидуального авторства к командн...
PDF
вера сивакова
PDF
Антон Качалов - Популярно об IPMI и UEFI
PDF
Симаков Алексей - Системы управления кластерами
Process Mining - Chapter 11 - Analyzing Lasagna Processes
Process Mining - Chapter 9 - Operational Support
Process Mining - Chapter 12 - Analyzing Spaghetti Processes
Process mining chapter_11_analyzing_lasagna_processes
Introducing to Datamining vs. OLAP - مقدمه و مقایسه ای بر داده کاوی و تحلیل ...
Process mining approaches kashif.namal@gmail.com
Ontologies And Process Mining
Dealing with concept drifts in process mining
Process Mining - Chapter 4 - Getting the Data
Process Mining: Understanding and Improving Desire Lines in Big Data
Process Mining - Chapter 8 - Mining Additional Perspectives
Process Mining - Chapter 6 - Advanced Process Discovery_techniques
Building Information Model (BIM) based process mining
Bim based process mining master thesis presentation
Большие данные в физике элементарных частиц на примере LHCb - Guy Wilkinson, ...
Как делается Яндекс.Браузер — Михаил Лопаткин
Коллективная разработка документации: от индивидуального авторства к командн...
вера сивакова
Антон Качалов - Популярно об IPMI и UEFI
Симаков Алексей - Системы управления кластерами
Ad

Similar to Process Mining: Data Science in Action - Wil van der Aalst, TU/e, DSC/e, HSE (20)

PPTX
AI for Business Process Management
PPTX
Process Mining and Predictive Process Monitoring in Apromore
PDF
Process Mining: Past, Present, and Open Challenges (AIST 2017 Keynote)
PDF
Discovering Concurrency: Learning (Business) Process Models from Examples
PPTX
Apromore: Advanced Business Process Analytics on the Cloud
PPT
Process Mining Reloaded: Event Structures as a Unified Representation of Proc...
PPTX
Business Process Analytics: From Insights to Predictions
PPT
Big Data Expo 2015 - Data Science Center Eindhove
PPTX
Process Mining: A Guide for Practitioners
PPTX
Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)
PDF
Predictive Analytics Powered By Process Mining: It’s The Process, Stupid!
PPTX
Liquid process model collections
PDF
Extract Business Process Performance using Data Mining
PDF
Process mining
PDF
Process Mining Book
PDF
Process Mining and Predictive Monitoring: an overview
PDF
Keynote Gartner Business Process Management Summit, February 2009, London
PDF
Object-Centric Processes - from cases to objects and relations… and beyond
PDF
Process Mining: closing the gap between Data Science and BPM
PDF
Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017
AI for Business Process Management
Process Mining and Predictive Process Monitoring in Apromore
Process Mining: Past, Present, and Open Challenges (AIST 2017 Keynote)
Discovering Concurrency: Learning (Business) Process Models from Examples
Apromore: Advanced Business Process Analytics on the Cloud
Process Mining Reloaded: Event Structures as a Unified Representation of Proc...
Business Process Analytics: From Insights to Predictions
Big Data Expo 2015 - Data Science Center Eindhove
Process Mining: A Guide for Practitioners
Process Mining: BPM on Steroids (CPOs@BPM&O 2019 Keynote)
Predictive Analytics Powered By Process Mining: It’s The Process, Stupid!
Liquid process model collections
Extract Business Process Performance using Data Mining
Process mining
Process Mining Book
Process Mining and Predictive Monitoring: an overview
Keynote Gartner Business Process Management Summit, February 2009, London
Object-Centric Processes - from cases to objects and relations… and beyond
Process Mining: closing the gap between Data Science and BPM
Process Mining Data-driven Process Improvement - idBigdata Meetup 17 Oct 2017

More from Yandex (20)

PDF
Предсказание оттока игроков из World of Tanks
PDF
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
PDF
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
PDF
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
PDF
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
PDF
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
PDF
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
PDF
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
PDF
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
PDF
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
PDF
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
PDF
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
PDF
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
PDF
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
PDF
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
PDF
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
PDF
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
PDF
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
PDF
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
PDF
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...
Предсказание оттока игроков из World of Tanks
Как принять/организовать работу по поисковой оптимизации сайта, Сергей Царик,...
Структурированные данные, Юлия Тихоход, лекция в Школе вебмастеров Яндекса
Представление сайта в поиске, Сергей Лысенко, лекция в Школе вебмастеров Яндекса
Плохие методы продвижения сайта, Екатерины Гладких, лекция в Школе вебмастеро...
Основные принципы ранжирования, Сергей Царик и Антон Роменский, лекция в Школ...
Основные принципы индексирования сайта, Александр Смирнов, лекция в Школе веб...
Мобильное приложение: как и зачем, Александр Лукин, лекция в Школе вебмастеро...
Сайты на мобильных устройствах, Олег Ножичкин, лекция в Школе вебмастеров Янд...
Качественная аналитика сайта, Юрий Батиевский, лекция в Школе вебмастеров Янд...
Что можно и что нужно измерять на сайте, Петр Аброськин, лекция в Школе вебма...
Как правильно поставить ТЗ на создание сайта, Алексей Бородкин, лекция в Школ...
Как защитить свой сайт, Пётр Волков, лекция в Школе вебмастеров
Как правильно составить структуру сайта, Дмитрий Сатин, лекция в Школе вебмас...
Технические особенности создания сайта, Дмитрий Васильева, лекция в Школе веб...
Конструкторы для отдельных элементов сайта, Елена Першина, лекция в Школе веб...
Контент для интернет-магазинов, Катерина Ерошина, лекция в Школе вебмастеров ...
Как написать хороший текст для сайта, Катерина Ерошина, лекция в Школе вебмас...
Usability и дизайн - как не помешать пользователю, Алексей Иванов, лекция в Ш...
Cайт. Зачем он и каким должен быть, Алексей Иванов, лекция в Школе вебмастеро...

Recently uploaded (20)

PDF
Vigrab.top – Online Tool for Downloading and Converting Social Media Videos a...
DOC
Rose毕业证学历认证,利物浦约翰摩尔斯大学毕业证国外本科毕业证
PDF
FINAL CALL-6th International Conference on Networks & IOT (NeTIOT 2025)
PPTX
Power Point - Lesson 3_2.pptx grad school presentation
PPTX
Database Information System - Management Information System
PDF
Sims 4 Historia para lo sims 4 para jugar
PPT
415456121-Jiwratrwecdtwfdsfwgdwedvwe dbwsdjsadca-EVN.ppt
PDF
Unit-1 introduction to cyber security discuss about how to secure a system
PPT
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
PPTX
E -tech empowerment technologies PowerPoint
DOCX
Unit-3 cyber security network security of internet system
PPT
Ethics in Information System - Management Information System
PPTX
Internet___Basics___Styled_ presentation
PPTX
INTERNET------BASICS-------UPDATED PPT PRESENTATION
PDF
SASE Traffic Flow - ZTNA Connector-1.pdf
PPTX
presentation_pfe-universite-molay-seltan.pptx
PPTX
artificial intelligence overview of it and more
PDF
The Ikigai Template _ Recalibrate How You Spend Your Time.pdf
PPTX
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
PDF
The New Creative Director: How AI Tools for Social Media Content Creation Are...
Vigrab.top – Online Tool for Downloading and Converting Social Media Videos a...
Rose毕业证学历认证,利物浦约翰摩尔斯大学毕业证国外本科毕业证
FINAL CALL-6th International Conference on Networks & IOT (NeTIOT 2025)
Power Point - Lesson 3_2.pptx grad school presentation
Database Information System - Management Information System
Sims 4 Historia para lo sims 4 para jugar
415456121-Jiwratrwecdtwfdsfwgdwedvwe dbwsdjsadca-EVN.ppt
Unit-1 introduction to cyber security discuss about how to secure a system
isotopes_sddsadsaadasdasdasdasdsa1213.ppt
E -tech empowerment technologies PowerPoint
Unit-3 cyber security network security of internet system
Ethics in Information System - Management Information System
Internet___Basics___Styled_ presentation
INTERNET------BASICS-------UPDATED PPT PRESENTATION
SASE Traffic Flow - ZTNA Connector-1.pdf
presentation_pfe-universite-molay-seltan.pptx
artificial intelligence overview of it and more
The Ikigai Template _ Recalibrate How You Spend Your Time.pdf
June-4-Sermon-Powerpoint.pptx USE THIS FOR YOUR MOTIVATION
The New Creative Director: How AI Tools for Social Media Content Creation Are...

Process Mining: Data Science in Action - Wil van der Aalst, TU/e, DSC/e, HSE