SlideShare a Scribd company logo
Copyright © 2015 Splunk Inc.
Data Informed
Healthcare Delivery
Process Improvement
2
Agenda
Problem Background and Motivation
Methodology
Capabilities: Data Integration and Feature Engineering
Capabilities: Statistics, Machine Learning, and Visualization
Capabilities: Operational Integration
Product Demonstration
3
3
1. Get Ready
2. Travel by Car
3. Conference
Starts
4. Join Reception
5. Have Dinner
6. Go Home
1. Get Ready
2. Travel by Car
3. Conference
Starts
4. Give a Talk
5. Join Reception
6. Have Dinner
7. Go Home
4
4
More Cases
1. Get Ready
2. Travel by Car
3. Conference Starts
4. Join Reception
5. Have Dinner
6. Go Home
7. Travel by Car
1. Get Ready
2. Travel by Car
3. Conference Starts
4. Give a Talk
5. Join Reception
6. Have Dinner
7. Go Home
8. Travel by Car
1. Get Ready
2. Travel by Air
3. Conference Starts
4. Give a Talk
5. Join Reception
6. Have Dinner
7. Go Home
8. Pay Parking
9. Travel by Car
1. Get Ready
2. Travel byTrain
3. Conference Starts
4. Join Reception
5. Have Dinner
6. Go Home
7. Pay Parking
8. Travel by Car
5
6
Generalized Information Flow for Chronic Care
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3002133/
7
Healthcare Delivery Workflow Examples #1
Healthcare Delivery Workflow Examples #2
9
Health Events
10
Domains of Data Diversity in Health Data
1
Subjects
Persons, Sensors,
Actuators, Mobile
Devices
Information
Users
Clinical, Family, Patient
System and
Locations
Home, Hospital, ER,
Nursing Homes
Ownership and
Management
11
Virtual
Physical
Cloud
Healthcare Data is Time Oriented and Diverse
1
EHR
Systems
Web
Services
Developers
App
Support
Telecoms
Networking
Desktops
Servers
Security
Devices
Storage
Messaging
Claims
Clickstream
HIE
Patient
Portals
Healthcare Apps IT Systems and Med Devices Patient-Facing Data
Medical
Devices
CDR
Medical
Records
PHI Access
Audit Logs
HL7
Messaging
Billing
Departmental
and
Homegrown
Applications
12
Example of Events: Healthcare Services
Entity ID Event ID Properties
Timestamp Activity Resource
12345678 4798669 02/06/2015 14:00 Primary Care Visit Pete
4798670 04/06/2015 11:00 Surgery Rose
4798671 04/06/2015 12:00 Primary Care Visit Pete
4798672 04/06/2015 10:00 Chemotherapy John
4798673 04/06/2015 15:00 Evaluation Pete
98765432 5798670 03/06/2015 14:00 Primary Care Visit Pete
1
13
Example of Events: Resources (Devices/Beds)
Entity ID Event ID Properties
Timestamp
(creation)
Patient identifier Begin time End time
D1 4798669 02/06/2015
14:00
p1 14:00 15:00
4798670 04/06/2015
11:00
p2 15:15 16:30
4798671 04/06/2015
12:00
p3 16:45 17:00
4798672 04/06/2015
10:00
p4 17:15 18:00
4798673 04/06/2015
15:00
p5 18:15 19:00
D2 5798670 03/06/2015
14:00
p6 15:00 17:00
1
14
Example of Events: Medications
Entity ID Event ID Properties
Timestamp NDCNUM Days Supply
12345678 4798669 02/06/2015 14:00 378214605 30
4798670 04/06/2015 11:00 378024301 60
4798671 04/06/2015 12:00 378024301 90
4798672 04/06/2015 10:00 378024301 90
4798673 04/06/2015 15:00 228202996 90
98765432 5798670 03/06/2015 14:00 378024301 60
1
15
Example of Events: Lab
Entity ID Event ID Properties
Timestamp Key Value
12345678 4798669 02/06/2015 14:00 HbA1C 8%
4798670 04/06/2015 11:00 LDL 100 mg/dl
4798671 04/06/2015 12:00 HDL 50 mg/dl
4798672 04/06/2015 10:00 Systolic 110
4798673 04/06/2015 15:00 Diastolic 75
97865432 5798670 03/06/2015 14:00 HbA1C 9%
1
16
Process Analytics
Event
Log
Mining
Techniques
Mined
Model
1. Start
2. Get Ready
3. Travel by Train
4. Beta Event Starts
5. Visit Brewery
6. Have Dinner
7. Go Home
8. Travel by Train
1. Start
2. Get Ready
3. Travel by Train
4. Beta Event Starts
5. Give a Talk
6. Visit Brewery
7. Have Dinner
8. Go Home
9. Travel by Train
1. Start
2. Get Ready
3. Travel by Car
4. Beta Event Starts
5. Give a Talk
6. Visit Brewery
7. Have Dinner
8. Go Home
9. Pay Parking
10. Travel by Car
1. Start
2. Get Ready
3. Travel by Car
4. Conference Starts
5. Join Reception
6. Have Dinner
7. Go Home
8. Pay Parking
9. Travel by Car
10. End
Start
Get Ready
Travel by CarTravel by Train
BETA PhD Day Starts
Visit Brewery
Have Dinner
Go Home
Travel by Train Pay for Parking
Travel by Car
End
Give a Talk
Start
Get Ready
Travel by
Air
Travel by
Car
Conference Starts
Give a Talk
Join Reception
Have Dinner
Go Home
Travel by
Train
Travel by
Car
Pay
Parking
End
What is Process Analytics?
1
Analyze Observed Behavior from event data and
metadata to discover patterns, monitor compliance,
and optimize workflow.
Performance Analysis Auditing/Compliance Detect Bottlenecks, Deviations in Flow
18
Process Analytics Use Cases
ACTION ORIENTED
Redesign Care Process/Plan
Adjust Parameters
Intervene (ad-hoc problem solving)
Detect deviations and bottlenecks
Plan, Predict, Recommend
GOAL ORIENTED
Improve Process KPIs related to
Timing
Improve Process KPIs related to
Frequency
Improve KPIs related to Outcome
Improve KPIs related to Cost
Improve KPIs related to Risk
19
Traditional Process Analysis Vs. Splunk Approach
Splunk Approach
Discover actual behavior of people,
organization, and machines and relate
to modeled behavior.
Correlate millions of ad-hoc events
showing how reality is different from
perceptions, opinions, and beliefs.
Provide clue for standardization, reduce
unwarranted variations, and better
prepare to handle ad-hoc events.
Traditional Approach
Based on the opinion of the
expert.
Assemble an appropriate team
and to organize modeling
sessions.
The knowledge of the team
members is used to build an
adequate process model.
20
Monitoring and Tracking Flow of “X”
21
Listen to Flow: Patient Scheduling
22
Listen to Flow: Clinician Staffing
23
Listen to Flow: Waiting Time
24
Listen to Flow: Asset Capacity
25
Listen to Flow: Surgical Checklist
26
Listen to Flow: Care Coordination
27
Care Coordination Analytics
Integrating Patient Generated Data and Enterprise Data for Outcome
and Cost Measurements, Proactive Health Interventions, and finding
gaps, conflicts, and interaction issues
2
28
Selective Improvement Opportunities (Michael Porter)
2
http://www.isc.hbs.edu/health-care/vbhcd/Pages/default.aspx
29
Agenda
Problem Background and Motivation
Capabilities: Methodology
Capabilities: Data Integration and Feature Engineering
Capabilities: Statistics, Machine Learning, and Visualization
Operational Integration
Product Demonstration
3
Improvement Method
3
3
Systems Engineering and Healthcare Delivery
3
Call for efficiency, consistency, and safety
3
Towards a Learning Healthcare System
3
A Connected Healthcare System
36
36
Discovery
Discover,
Diagnose,
Create
Knowledge
1
Compliance
Detect
Monitor
Compare
2 3
Enhancement
Forecast
Predict
Recommend
Analytics Methods
37 3
Real World Business
Questions, Improvement
Opportunities
Data Collection Data Preparation
Exploratory Visualization, Statistics and Machine
Learning
Communication, Visualization
Reports, Findings
Evaluation
Post Mortem: Knowledge Discovery Workflow
Decision Support Product
Think of the Process/Walk
in the Patients’ shoes
Analytics Engine
Pre Mortem: Monitoring, Detection , and Predictions
Health
Management
System
Gap,
Anomaly, and
Conflict
detections
Dashboards
/Alerts
Predictive
Modeling/M
odel
Maintenance
Data Warehouse
Patient Clinical
Situations/Curren
t Events
Standard
Reports/Que
ries
Data Archival
Rules System
39
Agenda
Problem Background and Motivation
Capabilities: Methodology
Capabilities: Data Integration and Feature Engineering
Capabilities: Statistics, Machine Learning, and Visualization
Operational Integration
Product Demonstration
40
Required Capabilities
4
Schema-less
approach/ late
binding to schema
Dynamic
“normalization” of
data
Agile analytics
and reporting
Scalable search
and analytics
Seamless
operational
integration
Process Data Mining Core Engine
41
Computational Framework
Integrate Untapped Data: Any Source, Type, Volume, Velocity
Healthcare
Apps Data/HL7
Event Logs
Healthcare Apps Audit Logs
Medical Device (PACS)/RFID
Metadata (logs)
Patient Generated Data
Hadoop Clusters Relational Database No SQL Data StoreSplunk Clusters
Explore Visualize Dashboard ShareAnalyze Monitor
and alert
External
Applications
Integration
(SDK, REST API)
42
Application Development Platform
4
43
Getting Data In
4
Universal and
Heavy
Forwarders
Modular Input
Stream, HTTP
Event Collector
RDBMS, Hadoop
44
Data Integration
Data Types Example Vendors
Application Log, Audit Logs from E.H.R. Cerner, Epic, McKesson, GE
Enterprise Health Data Cerner, Epic, McKesson, GE, HL7 interface
engines (InterSystem, Cloverleaf, Orion),
Health Information Exchanges (DB
Motion, Aetna, Optum)
Device, Mobile, Sensors Qualcomm, Medtronic, Philips, GE, Roche,
Apple HealthKit, Google Fit
45
Data Integration: Ingest any text data
4
MSH|^~&|EPIC|MGH||MGH|20150324190937|OHEDSCRIBE|ADT^A08|725
467|T|2.3|||||||||
………
PID|1||12345^^^EPI^MR||LUCUS^STEPHANEY||19751225|M|||^^^^^US^P
|||||||6100215419|999-99-9999|||||||||||N||
........
<recordTarget>
<patientRole>
<id extension="12345" root="PlaceholderOrganization" />
<addr use="HP”>
<streetAddressLine>180 Fake Road</streetAddressLine>
<city>Providence</city>
<state>RI</state>
<postalCode>02912</postalCode>
<country>US</country>
</addr>
<telecom use="WP" value="tel:+1-401-867-7949" />
<patient>
<name>
<given>Stephaney</given>
<family>Lucus</family>
</name>
<administrativeGenderCode code="F" codeSystem="2.16.840.1.113883.3.560.100.2"
displayName="Male" />
{
"resourceType": "Patient",
"identifier": [
{
"system": "urn:oid:1.2.36.146.595.217.0.1",
"value": "12345",
"period": {
"start": "2001-05-06"
}
}
],
"name": [
{
"use": "official",
"family": [”Lucus"],
"given": [”Stephaney”]
},
],
"gender": {
"coding": [
{
"system": "http://hl7.org/fhir/v3/AdministrativeGender",
"code": "M",
"display": "Male"
}
]
},
"birthDate": "1974-12-25",
"address": [
{
"use": "home",
"line": ["534 Erewhon St"],
"city": "PleasantVille",
"state": "Vic",
"zip": "3999"
}
]
}
Patient
identifier
name
telecom
gender
birthDate
deceased
address
maritalStatus
….
active
46
Tagging for “Normalization”
4
Patient
identifier
name
telecom
gender
birthDate
deceased
address
maritalStatus
….
active
47
 Search events with tag in any field
 Search events with tag in a specific field
 Search events with tag using wildcards
Adding Metadata Knowledge: Search with Tags
4
Tag=GLYCEMIC, ASTHMA
tag::DX=diabetes type 2
Tag=diabetes*
1
2
3
Aliases
4
 Normalize field labels to simplify search and correlation
 Apply multiple aliases to a single field
 Example: Username | cs_username | User  user
 Example: pid | patient | patient_id  PATIENTID
 Aliases appear alongside original fields
Event Tagging
4
 Classify and group common events
 Capture and share knowledge
 Based on search
 Use in combination with fields and tags to define
event topography
1) Regular Expression
2) Natural Language Processing using SDK and REST
API
5
Feature Extraction from Texts
51
 Defines least common denominator for a
data domain
 Standard method to parse, categorize,
normalize data
 Set of field names and tags by domain
 Packaged as a Data Models in a Splunk App
Common Information Model (CIM)
5
52
Agenda
Problem Background and Motivation
Capabilities: Methodology
Capabilities: Data Integration and Feature Engineering
Capabilities: Statistics, Machine Learning, and Visualization
Operational Integration
Product Demonstration
Sparkline: Visualize frequency distributions
Identify co-occurring spikes
Sankey Diagram: Visualize flow and frequency
Graphs: Visualize Network and Relations
Parallel Coordinates: Visualize Multivariates
Tree-Map
Timechart Swimlanes
HL7 Interface Process Tracking
60 6
Process Flow: Patient
61
Process Flow: Asset
6
Improving Healthcare Operations Using Process Data Mining
63
Machine Learning Overview
6
ML Tools & Process Support
ML Libraries (3rd Party, open source)
ML Libraries (Core) +
Metafor AD
Customer AppsITSI, ES, SBA, etc.
ML Toolkit &
Showcase
ML SPL
Core
App
Supervised & Unsupervised Machine Learning
•SupervisedLearning:generalizingfromlabeleddata
– Classification
– Prediction
– Estimation
– Regression
•UnsupervisedLearning:generalizingfromunlabeleddata
– Clustering
– Association-RuleLearning
– Summarization
65
MLTS: ML SPL
ML-SPL
– Modular interface to ML process and libraries
Miniconda
– Open source distribution of (almost) entire python data science ecosystem
– Distributing on Splunkbase as Python for Scientific Computing
6
| transform <script> <params>
| fit <script> as modelname by <feature> <params>
| apply modelname
ML-SPL
ML Libraries (In-Core,
Custom, or 3rd Party)
ITSI, ES, UBA, Core,
Customer Apps, etc.
66
Trending Anomaly Detection
6
Trending AD algorithm - Detects anomalies based on deviation from past behavior
67
Cohesive Anomaly Detection
6
Cohesive AD algorithm - Detects anomalies based on deviation from peer behavior
Clustering
Association Rule
6
70
Agenda
Problem Background and Motivation
Capabilities: Methodology
Capabilities: Data Integration and Feature Engineering
Capabilities: Statistics, Machine Learning, and Visualization
Operational Integration
Product Demonstration
KPI Management
7
72
Actionable Alerting
73
73
Business Value of Process Analytics
Save lives,
reduce error,
optimize time
1
Reduce cost,
increase
efficiency
2 3
Improve patient
outcome,
experience, and
engagement
74
Agenda
Problem Background and Motivation
Capabilities: Methodology
Capabilities: Data Integration and Feature Engineering
Capabilities: Statistics, Machine Learning, and Visualization
Operational Integration
Product Demonstration
75
Data Science Resources & more information
• Machine Learning Primer:
https://homes.cs.washington.edu/~pedrod/papers/cacm12.pdf
• Process Mining Research: http://www.processmining.org/
• Doing Data Science: http://www.amazon.com/Doing-Data-Science-Straight-
Frontline/dp/1449358659/ref=sr_1_1?ie=UTF8&qid=1436898138&sr=8-
1&keywords=doing+data+science
• Data Visualization: http://shop.oreilly.com/product/0636920026938.do
• Data Science Resources: http://www.partiallyderivative.com/resources/
76
Splunk Resources
• Search Tutorial:
http://docs.splunk.com/Documentation/Splunk/latest/SearchTutorial/Welcom
etotheSearchTutorial
• Training Videos: http://www.splunk.com/view/educa3on-videos/SP-CAAAGB6
• Splunk Docs: http://docs.splunk.com/Documentation/
• Splunkbase Apps & Answers: http://apps.splunk.com/
http://answers.splunk.com/
• Splunk Wiki: http://wiki.splunk.com/
• Developers: http://dev.splunk.com/
• Exploring Splunk Book: http://www.splunk.com/goto/book
77
 Interactive, cut/paste examples from popular source repositories:
D3, GitHub, jQuery
 Splunk 6.x Dashboard Examples App
https://splunkbase.splunk.com/app/1603
 Custom SimpleXML Extensions App
https://splunkbase.splunk.com/app/1772
 Splunk Web Framework Toolkit App
https://splunkbase.splunk.com/app/1613
 Machine Learning Toolkit:
https://splunkbase.splunk.com/app/2890/
Example Advanced Visualizations and
Machine Learning Toolkit
7
78
 http://dev.splunk.com/view/python-sdk/SP-
CAAAEU2
 http://dev.splunk.com/sdks
 http://dev.splunk.com/restapi
REST API, SDKs, and Custom Search Command
7
Thank You
Adrish Sannyasi
Healthcare Solution Architect
Splunk, asannyasi@splunk.com

More Related Content

PPTX
Improving Healthcare Operations Using Process Data Mining
PPTX
Develop Your Analysts and They'll Pay for Themselves
PPTX
How To Avoid The 3 Most Common Healthcare Analytics Pitfalls And Related Inef...
PPTX
There Is A 90% Probability That Your Son Is Pregnant: Predicting the Future ...
PPTX
Three Approaches to Predictive Analytics in Healthcare
PPTX
Predictive analytics in health insurance
PPTX
Predictive Analytics Usage and Implications in Healthcare
PPTX
Data Mining in Healthcare: How Health Systems Can Improve Quality and Reduce...
Improving Healthcare Operations Using Process Data Mining
Develop Your Analysts and They'll Pay for Themselves
How To Avoid The 3 Most Common Healthcare Analytics Pitfalls And Related Inef...
There Is A 90% Probability That Your Son Is Pregnant: Predicting the Future ...
Three Approaches to Predictive Analytics in Healthcare
Predictive analytics in health insurance
Predictive Analytics Usage and Implications in Healthcare
Data Mining in Healthcare: How Health Systems Can Improve Quality and Reduce...

What's hot (20)

PDF
Overcoming Big Data Bottlenecks in Healthcare - a Predictive Analytics Case S...
PPTX
Healthcare 2.0: The Age of Analytics
PPTX
The Case for Healthcare Data Literacy: It's Not About Big Data
PPTX
Data Driven Healthcare That Work: A Physician Group Perspective
PPTX
Healthcare Interoperability: New Tactics and Technology
PPTX
Machine Learning Misconceptions
PPTX
The Top Six Early Detection and Action Must-Haves for Improving Outcomes
PPTX
Turn Research Into Care Delivery Improvements Using the Research Analytics Ad...
PPTX
How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...
PPTX
The Changing Role of Healthcare Data Analysts
PPTX
Two-Midnight Rule: Ready For the Clock to Strike 12?
PPTX
The Why And How Of Machine Learning And AI: An Implementation Guide For Healt...
PPTX
Getting The Most Out of Your Data Analyst - HAS Session 9
PPT
Predictive Analytics in Healthcare
PPTX
Cloud-Based Open-Platform Data Solutions: The Best Way to Meet Today’s Growin...
PPTX
Looking Back on Clinical Decision Support and Data Warehousing
PPTX
How to Use Text Analytics in Healthcare to Improve Outcomes: Why You Need Mor...
PDF
Secondary Use of Healthcare Data for Translational Research
PPTX
Why Most Analytic Applications Will Never Be Able to Significantly Improve He...
PPTX
From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...
Overcoming Big Data Bottlenecks in Healthcare - a Predictive Analytics Case S...
Healthcare 2.0: The Age of Analytics
The Case for Healthcare Data Literacy: It's Not About Big Data
Data Driven Healthcare That Work: A Physician Group Perspective
Healthcare Interoperability: New Tactics and Technology
Machine Learning Misconceptions
The Top Six Early Detection and Action Must-Haves for Improving Outcomes
Turn Research Into Care Delivery Improvements Using the Research Analytics Ad...
How to Choose the Best Healthcare Analytics Software Solution in a Crowded Ma...
The Changing Role of Healthcare Data Analysts
Two-Midnight Rule: Ready For the Clock to Strike 12?
The Why And How Of Machine Learning And AI: An Implementation Guide For Healt...
Getting The Most Out of Your Data Analyst - HAS Session 9
Predictive Analytics in Healthcare
Cloud-Based Open-Platform Data Solutions: The Best Way to Meet Today’s Growin...
Looking Back on Clinical Decision Support and Data Warehousing
How to Use Text Analytics in Healthcare to Improve Outcomes: Why You Need Mor...
Secondary Use of Healthcare Data for Translational Research
Why Most Analytic Applications Will Never Be Able to Significantly Improve He...
From Installed to Stalled: Why Sustaining Outcomes Improvement Requires More ...
Ad

Viewers also liked (20)

PDF
MINING HEALTH EXAMINATION RECORDS A GRAPH-BASED APPROACH
PDF
Clinic Workflow Diagram
PPTX
Splunking HL7 Healthcare Data for Business Value
PPTX
PMI Presentation - Integration Projects.1
PDF
Osaoptimoinnista kokonaisuuksiin: tiedonhallinta ja tietojärjestelmät
PDF
Agile methodologies, Startup Technology Partner,Lean,Minimum Viable Product
PPTX
Digital Menu App
PDF
Using splunk6.2 labs
PPTX
Power of SPL
PPTX
Getting started with Splunk
PDF
SplunkLive! Washington DC May 2013 - Search Language Beginner
DOCX
Security Hands-On - Splunklive! Houston
PDF
CVPR2010: Semi-supervised Learning in Vision: Part 3: Algorithms and Applicat...
PPTX
Data Mining A Healthcare Database
PDF
Getting Started with Splunk Hands-on
PPTX
Customer Presentation
PDF
Field Extractions: Making Regex Your Buddy
PPT
Semi-supervised Learning
PDF
Machine Data 101
PDF
Splunk Enterprise for IT Troubleshooting Hands-On
MINING HEALTH EXAMINATION RECORDS A GRAPH-BASED APPROACH
Clinic Workflow Diagram
Splunking HL7 Healthcare Data for Business Value
PMI Presentation - Integration Projects.1
Osaoptimoinnista kokonaisuuksiin: tiedonhallinta ja tietojärjestelmät
Agile methodologies, Startup Technology Partner,Lean,Minimum Viable Product
Digital Menu App
Using splunk6.2 labs
Power of SPL
Getting started with Splunk
SplunkLive! Washington DC May 2013 - Search Language Beginner
Security Hands-On - Splunklive! Houston
CVPR2010: Semi-supervised Learning in Vision: Part 3: Algorithms and Applicat...
Data Mining A Healthcare Database
Getting Started with Splunk Hands-on
Customer Presentation
Field Extractions: Making Regex Your Buddy
Semi-supervised Learning
Machine Data 101
Splunk Enterprise for IT Troubleshooting Hands-On
Ad

Similar to Improving Healthcare Operations Using Process Data Mining (20)

PPTX
Analytics from data to better decision
PPTX
Healthcare Delivery Reimagined: Patient Flow and Care Coordination Analytics
PPTX
Impact of Data Science
PDF
MPCA-SAS-innovators-flight-plan-ai.pdf
PDF
Data driven decision making
DOCX
Title of Your Research Proposal Student Name Walde
PDF
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
PDF
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
PPTX
Jisc learninganalytics dec2016
PPTX
Jisc learninganalytics nov2016
PDF
Intelligent Clinical Supply Forecasting and Simulations through IRT-March02-2...
PPTX
Introduction to Data Analytics and Its Importance
PPTX
TA reporting metrics and analytics
PPTX
User Group Kickoff and New Product Roadmap - HAS Session 12
PPT
Managing The Business Risk Of Fraud
PDF
The galaxy of data analysis - School of ai Port Harcourt meetup
PPT
Super factory
PDF
Machine Learning for Business - Eight Best Practices for Getting Started
PDF
10.09.14 glassdoor webinar_all_slides
PDF
Sage - Clinical Laboratory Management System
Analytics from data to better decision
Healthcare Delivery Reimagined: Patient Flow and Care Coordination Analytics
Impact of Data Science
MPCA-SAS-innovators-flight-plan-ai.pdf
Data driven decision making
Title of Your Research Proposal Student Name Walde
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Ibm ofa ottawa_.gov_agencies_and_next_generation_analytics_tim_paydospdf
Jisc learninganalytics dec2016
Jisc learninganalytics nov2016
Intelligent Clinical Supply Forecasting and Simulations through IRT-March02-2...
Introduction to Data Analytics and Its Importance
TA reporting metrics and analytics
User Group Kickoff and New Product Roadmap - HAS Session 12
Managing The Business Risk Of Fraud
The galaxy of data analysis - School of ai Port Harcourt meetup
Super factory
Machine Learning for Business - Eight Best Practices for Getting Started
10.09.14 glassdoor webinar_all_slides
Sage - Clinical Laboratory Management System

More from Splunk (20)

PDF
Splunk Leadership Forum Wien - 20.05.2025
PDF
Splunk Security Update | Public Sector Summit Germany 2025
PDF
Building Resilience with Energy Management for the Public Sector
PDF
IT-Lagebild: Observability for Resilience (SVA)
PDF
Nach dem SOC-Aufbau ist vor der Automatisierung (OFD Baden-Württemberg)
PDF
Monitoring einer Sicheren Inter-Netzwerk Architektur (SINA)
PDF
Praktische Erfahrungen mit dem Attack Analyser (gematik)
PDF
Cisco XDR & Splunk SIEM - stronger together (DATAGROUP Cyber Security)
PDF
Security - Mit Sicherheit zum Erfolg (Telekom)
PDF
One Cisco - Splunk Public Sector Summit Germany April 2025
PDF
.conf Go 2023 - Data analysis as a routine
PDF
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
PDF
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
PDF
.conf Go 2023 - Raiffeisen Bank International
PDF
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
PDF
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
PDF
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
PDF
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
PDF
.conf go 2023 - De NOC a CSIRT (Cellnex)
PDF
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)
Splunk Leadership Forum Wien - 20.05.2025
Splunk Security Update | Public Sector Summit Germany 2025
Building Resilience with Energy Management for the Public Sector
IT-Lagebild: Observability for Resilience (SVA)
Nach dem SOC-Aufbau ist vor der Automatisierung (OFD Baden-Württemberg)
Monitoring einer Sicheren Inter-Netzwerk Architektur (SINA)
Praktische Erfahrungen mit dem Attack Analyser (gematik)
Cisco XDR & Splunk SIEM - stronger together (DATAGROUP Cyber Security)
Security - Mit Sicherheit zum Erfolg (Telekom)
One Cisco - Splunk Public Sector Summit Germany April 2025
.conf Go 2023 - Data analysis as a routine
.conf Go 2023 - How KPN drives Customer Satisfaction on IPTV
.conf Go 2023 - Navegando la normativa SOX (Telefónica)
.conf Go 2023 - Raiffeisen Bank International
.conf Go 2023 - På liv og død Om sikkerhetsarbeid i Norsk helsenett
.conf Go 2023 - Many roads lead to Rome - this was our journey (Julius Bär)
.conf Go 2023 - Das passende Rezept für die digitale (Security) Revolution zu...
.conf go 2023 - Cyber Resilienz – Herausforderungen und Ansatz für Energiever...
.conf go 2023 - De NOC a CSIRT (Cellnex)
conf go 2023 - El camino hacia la ciberseguridad (ABANCA)

Recently uploaded (20)

PPTX
First Aid and Basic Life Support Training.pptx
PPT
KULIAH UG WANITA Prof Endang 121110 (1).ppt
PDF
Selvita_Development-Strategy-2022-2025.pdf
PPTX
Current Treatment Of Heart Failure By Dr Masood Ahmed
PPTX
PEDIATRIC OSCE, MBBS, by Dr. Sangit Chhantyal(IOM)..pptx
PPT
Recent advances in Diagnosis of Autoimmune Disorders
PPTX
Galactosemia pathophysiology, clinical features, investigation and treatment ...
PDF
MINERAL & VITAMIN CHARTS fggfdtujhfd.pdf
PDF
Structure Composition and Mechanical Properties of Australian O.pdf
PDF
Pharmacology slides archer and nclex quest
PDF
NUTRITION THROUGHOUT THE LIFE CYCLE CHILDHOOD -AGEING
PPTX
General Pharmacology by Nandini Ratne, Nagpur College of Pharmacy, Hingna Roa...
PPTX
ABG advance Arterial Blood Gases Analysis
PPTX
Infection prevention and control for medical students
PDF
DAY-6. Summer class. Ppt. Cultural Nursing
PPTX
Immunity....(shweta).................pptx
PPTX
1. Drug Distribution System.pptt b pharmacy
PPT
Adrenergic drugs (sympathomimetics ).ppt
PPT
Parental-Carer-mental-illness-and-Potential-impact-on-Dependant-Children.ppt
PDF
2E-Learning-Together...PICS-PCISF con.pdf
First Aid and Basic Life Support Training.pptx
KULIAH UG WANITA Prof Endang 121110 (1).ppt
Selvita_Development-Strategy-2022-2025.pdf
Current Treatment Of Heart Failure By Dr Masood Ahmed
PEDIATRIC OSCE, MBBS, by Dr. Sangit Chhantyal(IOM)..pptx
Recent advances in Diagnosis of Autoimmune Disorders
Galactosemia pathophysiology, clinical features, investigation and treatment ...
MINERAL & VITAMIN CHARTS fggfdtujhfd.pdf
Structure Composition and Mechanical Properties of Australian O.pdf
Pharmacology slides archer and nclex quest
NUTRITION THROUGHOUT THE LIFE CYCLE CHILDHOOD -AGEING
General Pharmacology by Nandini Ratne, Nagpur College of Pharmacy, Hingna Roa...
ABG advance Arterial Blood Gases Analysis
Infection prevention and control for medical students
DAY-6. Summer class. Ppt. Cultural Nursing
Immunity....(shweta).................pptx
1. Drug Distribution System.pptt b pharmacy
Adrenergic drugs (sympathomimetics ).ppt
Parental-Carer-mental-illness-and-Potential-impact-on-Dependant-Children.ppt
2E-Learning-Together...PICS-PCISF con.pdf

Improving Healthcare Operations Using Process Data Mining

  • 1. Copyright © 2015 Splunk Inc. Data Informed Healthcare Delivery Process Improvement
  • 2. 2 Agenda Problem Background and Motivation Methodology Capabilities: Data Integration and Feature Engineering Capabilities: Statistics, Machine Learning, and Visualization Capabilities: Operational Integration Product Demonstration
  • 3. 3 3 1. Get Ready 2. Travel by Car 3. Conference Starts 4. Join Reception 5. Have Dinner 6. Go Home 1. Get Ready 2. Travel by Car 3. Conference Starts 4. Give a Talk 5. Join Reception 6. Have Dinner 7. Go Home
  • 4. 4 4 More Cases 1. Get Ready 2. Travel by Car 3. Conference Starts 4. Join Reception 5. Have Dinner 6. Go Home 7. Travel by Car 1. Get Ready 2. Travel by Car 3. Conference Starts 4. Give a Talk 5. Join Reception 6. Have Dinner 7. Go Home 8. Travel by Car 1. Get Ready 2. Travel by Air 3. Conference Starts 4. Give a Talk 5. Join Reception 6. Have Dinner 7. Go Home 8. Pay Parking 9. Travel by Car 1. Get Ready 2. Travel byTrain 3. Conference Starts 4. Join Reception 5. Have Dinner 6. Go Home 7. Pay Parking 8. Travel by Car
  • 5. 5
  • 6. 6 Generalized Information Flow for Chronic Care http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3002133/
  • 10. 10 Domains of Data Diversity in Health Data 1 Subjects Persons, Sensors, Actuators, Mobile Devices Information Users Clinical, Family, Patient System and Locations Home, Hospital, ER, Nursing Homes Ownership and Management
  • 11. 11 Virtual Physical Cloud Healthcare Data is Time Oriented and Diverse 1 EHR Systems Web Services Developers App Support Telecoms Networking Desktops Servers Security Devices Storage Messaging Claims Clickstream HIE Patient Portals Healthcare Apps IT Systems and Med Devices Patient-Facing Data Medical Devices CDR Medical Records PHI Access Audit Logs HL7 Messaging Billing Departmental and Homegrown Applications
  • 12. 12 Example of Events: Healthcare Services Entity ID Event ID Properties Timestamp Activity Resource 12345678 4798669 02/06/2015 14:00 Primary Care Visit Pete 4798670 04/06/2015 11:00 Surgery Rose 4798671 04/06/2015 12:00 Primary Care Visit Pete 4798672 04/06/2015 10:00 Chemotherapy John 4798673 04/06/2015 15:00 Evaluation Pete 98765432 5798670 03/06/2015 14:00 Primary Care Visit Pete 1
  • 13. 13 Example of Events: Resources (Devices/Beds) Entity ID Event ID Properties Timestamp (creation) Patient identifier Begin time End time D1 4798669 02/06/2015 14:00 p1 14:00 15:00 4798670 04/06/2015 11:00 p2 15:15 16:30 4798671 04/06/2015 12:00 p3 16:45 17:00 4798672 04/06/2015 10:00 p4 17:15 18:00 4798673 04/06/2015 15:00 p5 18:15 19:00 D2 5798670 03/06/2015 14:00 p6 15:00 17:00 1
  • 14. 14 Example of Events: Medications Entity ID Event ID Properties Timestamp NDCNUM Days Supply 12345678 4798669 02/06/2015 14:00 378214605 30 4798670 04/06/2015 11:00 378024301 60 4798671 04/06/2015 12:00 378024301 90 4798672 04/06/2015 10:00 378024301 90 4798673 04/06/2015 15:00 228202996 90 98765432 5798670 03/06/2015 14:00 378024301 60 1
  • 15. 15 Example of Events: Lab Entity ID Event ID Properties Timestamp Key Value 12345678 4798669 02/06/2015 14:00 HbA1C 8% 4798670 04/06/2015 11:00 LDL 100 mg/dl 4798671 04/06/2015 12:00 HDL 50 mg/dl 4798672 04/06/2015 10:00 Systolic 110 4798673 04/06/2015 15:00 Diastolic 75 97865432 5798670 03/06/2015 14:00 HbA1C 9% 1
  • 16. 16 Process Analytics Event Log Mining Techniques Mined Model 1. Start 2. Get Ready 3. Travel by Train 4. Beta Event Starts 5. Visit Brewery 6. Have Dinner 7. Go Home 8. Travel by Train 1. Start 2. Get Ready 3. Travel by Train 4. Beta Event Starts 5. Give a Talk 6. Visit Brewery 7. Have Dinner 8. Go Home 9. Travel by Train 1. Start 2. Get Ready 3. Travel by Car 4. Beta Event Starts 5. Give a Talk 6. Visit Brewery 7. Have Dinner 8. Go Home 9. Pay Parking 10. Travel by Car 1. Start 2. Get Ready 3. Travel by Car 4. Conference Starts 5. Join Reception 6. Have Dinner 7. Go Home 8. Pay Parking 9. Travel by Car 10. End Start Get Ready Travel by CarTravel by Train BETA PhD Day Starts Visit Brewery Have Dinner Go Home Travel by Train Pay for Parking Travel by Car End Give a Talk Start Get Ready Travel by Air Travel by Car Conference Starts Give a Talk Join Reception Have Dinner Go Home Travel by Train Travel by Car Pay Parking End
  • 17. What is Process Analytics? 1 Analyze Observed Behavior from event data and metadata to discover patterns, monitor compliance, and optimize workflow. Performance Analysis Auditing/Compliance Detect Bottlenecks, Deviations in Flow
  • 18. 18 Process Analytics Use Cases ACTION ORIENTED Redesign Care Process/Plan Adjust Parameters Intervene (ad-hoc problem solving) Detect deviations and bottlenecks Plan, Predict, Recommend GOAL ORIENTED Improve Process KPIs related to Timing Improve Process KPIs related to Frequency Improve KPIs related to Outcome Improve KPIs related to Cost Improve KPIs related to Risk
  • 19. 19 Traditional Process Analysis Vs. Splunk Approach Splunk Approach Discover actual behavior of people, organization, and machines and relate to modeled behavior. Correlate millions of ad-hoc events showing how reality is different from perceptions, opinions, and beliefs. Provide clue for standardization, reduce unwarranted variations, and better prepare to handle ad-hoc events. Traditional Approach Based on the opinion of the expert. Assemble an appropriate team and to organize modeling sessions. The knowledge of the team members is used to build an adequate process model.
  • 20. 20 Monitoring and Tracking Flow of “X”
  • 21. 21 Listen to Flow: Patient Scheduling
  • 22. 22 Listen to Flow: Clinician Staffing
  • 23. 23 Listen to Flow: Waiting Time
  • 24. 24 Listen to Flow: Asset Capacity
  • 25. 25 Listen to Flow: Surgical Checklist
  • 26. 26 Listen to Flow: Care Coordination
  • 27. 27 Care Coordination Analytics Integrating Patient Generated Data and Enterprise Data for Outcome and Cost Measurements, Proactive Health Interventions, and finding gaps, conflicts, and interaction issues 2
  • 28. 28 Selective Improvement Opportunities (Michael Porter) 2 http://www.isc.hbs.edu/health-care/vbhcd/Pages/default.aspx
  • 29. 29 Agenda Problem Background and Motivation Capabilities: Methodology Capabilities: Data Integration and Feature Engineering Capabilities: Statistics, Machine Learning, and Visualization Operational Integration Product Demonstration
  • 31. 3
  • 32. 3 Systems Engineering and Healthcare Delivery
  • 33. 3 Call for efficiency, consistency, and safety
  • 34. 3 Towards a Learning Healthcare System
  • 37. 37 3 Real World Business Questions, Improvement Opportunities Data Collection Data Preparation Exploratory Visualization, Statistics and Machine Learning Communication, Visualization Reports, Findings Evaluation Post Mortem: Knowledge Discovery Workflow Decision Support Product Think of the Process/Walk in the Patients’ shoes
  • 38. Analytics Engine Pre Mortem: Monitoring, Detection , and Predictions Health Management System Gap, Anomaly, and Conflict detections Dashboards /Alerts Predictive Modeling/M odel Maintenance Data Warehouse Patient Clinical Situations/Curren t Events Standard Reports/Que ries Data Archival Rules System
  • 39. 39 Agenda Problem Background and Motivation Capabilities: Methodology Capabilities: Data Integration and Feature Engineering Capabilities: Statistics, Machine Learning, and Visualization Operational Integration Product Demonstration
  • 40. 40 Required Capabilities 4 Schema-less approach/ late binding to schema Dynamic “normalization” of data Agile analytics and reporting Scalable search and analytics Seamless operational integration
  • 41. Process Data Mining Core Engine 41 Computational Framework Integrate Untapped Data: Any Source, Type, Volume, Velocity Healthcare Apps Data/HL7 Event Logs Healthcare Apps Audit Logs Medical Device (PACS)/RFID Metadata (logs) Patient Generated Data Hadoop Clusters Relational Database No SQL Data StoreSplunk Clusters Explore Visualize Dashboard ShareAnalyze Monitor and alert External Applications Integration (SDK, REST API)
  • 43. 43 Getting Data In 4 Universal and Heavy Forwarders Modular Input Stream, HTTP Event Collector RDBMS, Hadoop
  • 44. 44 Data Integration Data Types Example Vendors Application Log, Audit Logs from E.H.R. Cerner, Epic, McKesson, GE Enterprise Health Data Cerner, Epic, McKesson, GE, HL7 interface engines (InterSystem, Cloverleaf, Orion), Health Information Exchanges (DB Motion, Aetna, Optum) Device, Mobile, Sensors Qualcomm, Medtronic, Philips, GE, Roche, Apple HealthKit, Google Fit
  • 45. 45 Data Integration: Ingest any text data 4 MSH|^~&|EPIC|MGH||MGH|20150324190937|OHEDSCRIBE|ADT^A08|725 467|T|2.3||||||||| ……… PID|1||12345^^^EPI^MR||LUCUS^STEPHANEY||19751225|M|||^^^^^US^P |||||||6100215419|999-99-9999|||||||||||N|| ........ <recordTarget> <patientRole> <id extension="12345" root="PlaceholderOrganization" /> <addr use="HP”> <streetAddressLine>180 Fake Road</streetAddressLine> <city>Providence</city> <state>RI</state> <postalCode>02912</postalCode> <country>US</country> </addr> <telecom use="WP" value="tel:+1-401-867-7949" /> <patient> <name> <given>Stephaney</given> <family>Lucus</family> </name> <administrativeGenderCode code="F" codeSystem="2.16.840.1.113883.3.560.100.2" displayName="Male" /> { "resourceType": "Patient", "identifier": [ { "system": "urn:oid:1.2.36.146.595.217.0.1", "value": "12345", "period": { "start": "2001-05-06" } } ], "name": [ { "use": "official", "family": [”Lucus"], "given": [”Stephaney”] }, ], "gender": { "coding": [ { "system": "http://hl7.org/fhir/v3/AdministrativeGender", "code": "M", "display": "Male" } ] }, "birthDate": "1974-12-25", "address": [ { "use": "home", "line": ["534 Erewhon St"], "city": "PleasantVille", "state": "Vic", "zip": "3999" } ] } Patient identifier name telecom gender birthDate deceased address maritalStatus …. active
  • 47. 47  Search events with tag in any field  Search events with tag in a specific field  Search events with tag using wildcards Adding Metadata Knowledge: Search with Tags 4 Tag=GLYCEMIC, ASTHMA tag::DX=diabetes type 2 Tag=diabetes* 1 2 3
  • 48. Aliases 4  Normalize field labels to simplify search and correlation  Apply multiple aliases to a single field  Example: Username | cs_username | User  user  Example: pid | patient | patient_id  PATIENTID  Aliases appear alongside original fields
  • 49. Event Tagging 4  Classify and group common events  Capture and share knowledge  Based on search  Use in combination with fields and tags to define event topography
  • 50. 1) Regular Expression 2) Natural Language Processing using SDK and REST API 5 Feature Extraction from Texts
  • 51. 51  Defines least common denominator for a data domain  Standard method to parse, categorize, normalize data  Set of field names and tags by domain  Packaged as a Data Models in a Splunk App Common Information Model (CIM) 5
  • 52. 52 Agenda Problem Background and Motivation Capabilities: Methodology Capabilities: Data Integration and Feature Engineering Capabilities: Statistics, Machine Learning, and Visualization Operational Integration Product Demonstration
  • 53. Sparkline: Visualize frequency distributions Identify co-occurring spikes
  • 54. Sankey Diagram: Visualize flow and frequency
  • 55. Graphs: Visualize Network and Relations
  • 63. 63 Machine Learning Overview 6 ML Tools & Process Support ML Libraries (3rd Party, open source) ML Libraries (Core) + Metafor AD Customer AppsITSI, ES, SBA, etc. ML Toolkit & Showcase ML SPL Core App
  • 64. Supervised & Unsupervised Machine Learning •SupervisedLearning:generalizingfromlabeleddata – Classification – Prediction – Estimation – Regression •UnsupervisedLearning:generalizingfromunlabeleddata – Clustering – Association-RuleLearning – Summarization
  • 65. 65 MLTS: ML SPL ML-SPL – Modular interface to ML process and libraries Miniconda – Open source distribution of (almost) entire python data science ecosystem – Distributing on Splunkbase as Python for Scientific Computing 6 | transform <script> <params> | fit <script> as modelname by <feature> <params> | apply modelname ML-SPL ML Libraries (In-Core, Custom, or 3rd Party) ITSI, ES, UBA, Core, Customer Apps, etc.
  • 66. 66 Trending Anomaly Detection 6 Trending AD algorithm - Detects anomalies based on deviation from past behavior
  • 67. 67 Cohesive Anomaly Detection 6 Cohesive AD algorithm - Detects anomalies based on deviation from peer behavior
  • 70. 70 Agenda Problem Background and Motivation Capabilities: Methodology Capabilities: Data Integration and Feature Engineering Capabilities: Statistics, Machine Learning, and Visualization Operational Integration Product Demonstration
  • 73. 73 73 Business Value of Process Analytics Save lives, reduce error, optimize time 1 Reduce cost, increase efficiency 2 3 Improve patient outcome, experience, and engagement
  • 74. 74 Agenda Problem Background and Motivation Capabilities: Methodology Capabilities: Data Integration and Feature Engineering Capabilities: Statistics, Machine Learning, and Visualization Operational Integration Product Demonstration
  • 75. 75 Data Science Resources & more information • Machine Learning Primer: https://homes.cs.washington.edu/~pedrod/papers/cacm12.pdf • Process Mining Research: http://www.processmining.org/ • Doing Data Science: http://www.amazon.com/Doing-Data-Science-Straight- Frontline/dp/1449358659/ref=sr_1_1?ie=UTF8&qid=1436898138&sr=8- 1&keywords=doing+data+science • Data Visualization: http://shop.oreilly.com/product/0636920026938.do • Data Science Resources: http://www.partiallyderivative.com/resources/
  • 76. 76 Splunk Resources • Search Tutorial: http://docs.splunk.com/Documentation/Splunk/latest/SearchTutorial/Welcom etotheSearchTutorial • Training Videos: http://www.splunk.com/view/educa3on-videos/SP-CAAAGB6 • Splunk Docs: http://docs.splunk.com/Documentation/ • Splunkbase Apps & Answers: http://apps.splunk.com/ http://answers.splunk.com/ • Splunk Wiki: http://wiki.splunk.com/ • Developers: http://dev.splunk.com/ • Exploring Splunk Book: http://www.splunk.com/goto/book
  • 77. 77  Interactive, cut/paste examples from popular source repositories: D3, GitHub, jQuery  Splunk 6.x Dashboard Examples App https://splunkbase.splunk.com/app/1603  Custom SimpleXML Extensions App https://splunkbase.splunk.com/app/1772  Splunk Web Framework Toolkit App https://splunkbase.splunk.com/app/1613  Machine Learning Toolkit: https://splunkbase.splunk.com/app/2890/ Example Advanced Visualizations and Machine Learning Toolkit 7
  • 78. 78  http://dev.splunk.com/view/python-sdk/SP- CAAAEU2  http://dev.splunk.com/sdks  http://dev.splunk.com/restapi REST API, SDKs, and Custom Search Command 7
  • 79. Thank You Adrish Sannyasi Healthcare Solution Architect Splunk, asannyasi@splunk.com

Editor's Notes

  • #2: Do we know what a drug or diagnosis code means and does it mean the same in different EHRs? Similarly, do we know what an EHR event in an EHR event log means and does it mean the same in different systems. This last will be important for comparing process models, as EHRs are so user- customizable. “Check Meds” in one EHR might be called “Medications” in another. What exactly does “Check Meds” mean? Where, exactly, does it fit in a hierarchy of tasks, such as “checking” other things besides medications or involvement of medications in other activities besides “checking”? Is asking a patient about medications (or retrieving the medication list from online) an example of “Check Meds”? Is there a difference in the ordering and frequency of activities between patients that were treated by either a high- or low-volume surgeon? (control-flow perspective) Is there a difference in resource involvement between patients that were treated by either a high- or low-volume surgeon? (organisational perspective) Is there a difference in time-related performance between patients that were treated by either a high- or low-volume surgeon? (performance perspective) Is there a difference in the ordering and frequency of activities between patients that had a throughput time of 80 and 40 minutes or less in respectively the pre-operative and final postoperative examination and patients with a longer throughput time? Is there a difference in time-related performance between patients that had a throughput time of 80 and 40 minutes or less in respectively the pre-operative and final postoperative examination and patients with a longer throughput time? It is apparent that the business processes in the medical domain are dynamic, ad-hoc, unstructured and multi-disciplinary in nature. he goal of clustering is to obtain homogeneous group of patients.
  • #11: Subjects, locations, users, different data governance rules and standards that may conflict with each other
  • #12: A defining characteristic of modern health care is the rapidly accelerating increase in information that is available to assist with the delivery of care and system management. Time oriented data, 2. High diversity, 3. Some data is functional others are event logs generated by machines. Data came from activities which are part of sequential process Data is timestamped Activities are interdependent discrete events Machine data is generated by many different sources within the healthcare IT infrastructure. These sources include healthcare specific data sources such as electronic health record (EHR) systems, HL7 messaging, and connected medical devices. The data sources include core IT systems that support different applications such as desktops, servers, storage and network devices. Finally, they include all the patient facing applications and systems – portals, billing systems, claim management systems. Machine data generated by this infrastructure shares the core characteristics of big data – lot of data (high volume), created rapidly (high velocity), from different sources (variety), and data that changes over time (variability). Getting timely and relevant insight into this data can be a source of huge value for the healthcare ecosystem.
  • #18: Discover actual behavior of people, organization, machines and relate to modeled behavior. Insights showing reality is very different from perceptions, opinions, and beliefs Correlation of Millions of ad-hoc events provide clue for standardization and better prepare to handle ad-hoc events. Different people have different views about a same process. Information about the process mat be incomplete. Discover actual behavior of people, organization, and machines and relate to modeled behavior. Correlate millions of ad-hoc events showing how reality is different from perceptions, opinions, and beliefs. Provide clue for standardization and better prepare to handle ad-hoc events.
  • #31: Data Science: validate your assumptions, formulate your hypotheses and test it, find simple principles that may have large impacts and generalized across the population.
  • #32: End to End Value stream: Sequential time oriented tasks, resources are dependent on each other, multiple functional disciplines Value each others time and contributions. Any production problems would be quickly detected and corrected.
  • #41: One reason for agility is handling of data in scale using parallel data processing techniques. And lastly, we enable operational integration- two ways 1) speed of computations, 2) second is system integration through REST API support.
  • #42: Splunk products are being used for data volumes ranging from gigabytes to hundreds of terabytes per day. Splunk software and cloud services reliably collects and indexes machine data, from a single source to tens of thousands of sources. All in real time. Once data is in Splunk Enterprise, you can search, analyze, report on and share insights form your data. The Splunk Enterprise platform is optimized for real-time, low-latency and interactivity, making it easy to explore, analyze and visualize your data. This is described as Operational Intelligence. The insights gained from machine data support a number of use cases and can drive value across your organization. [In North America] Splunk Cloud is available in North America and offers Splunk Enterprise as a cloud-based service – essentially empowering you with Operational Intelligence without any operational effort.
  • #54: algorithm=LLB means “bivariate local level”
  • #55: Vmware – House of Demos app. VM forest, esx server. Status of VMs when you click on particular one. One of the most useful types of visualizations is a “Sankey diagram”, which is used to describe flows through systems. These can be customer flows through marketing or sales funnels, traffic flows through the actual network, energy flows through a physical system, capital flows through a financial system, etc. It’s a very streamlined form of visualization that cuts out everything unrelated to “flow”. Technically, this is a graph visualization: the nodes are smushed to these bars along the side, and edges are represented by these fat bars connecting nodes. The width of a node is proportional to the volume of flow in and out of the node, and the width of an edge is proportional to the flow from the start node to the end node.
  • #56: Customer journey: convert, repeat Mobile Patent Suits Dashed links are resolved suits; green links are licensing. “Thomson Reuters published a rather abysmal infographic showing the "bowl of spaghetti" that is current flurry of patent-related suits in the mobile communications industry. So, inspired by a comment by John Firebaugh, I remade the visualization to better convey the network. That company in the center? Yeah, it's the world's largest, so little wonder it has the most incoming suits.” mbostock’s block #1153292 August 18, 2011 http://bl.ocks.org/mbostock/1153292
  • #65: Most concepts in Machine Learning can be organized under two themes: Supervised Learning: finding hidden structure in labeled data Classification: learn labels from labeled data, apply labels to unlabeled data Prediction: guessing future data values given past and current values Estimation: filling in missing data Regression: fitting parameters of statistical models Unsupervised Learning: finding hidden structure in unlabeled data Clustering: grouping together events or rows by similarity Association-Rule Learning: finding relationships in the data Summarization: aggregating and selecting representative values from the data
  • #73: Alerts are triggered when certain conditions are met by the results of the search upon which it is based. Alerts can be based on both historical and real-time searches. When an alert is triggered, it performs an alert action. This action can be the sending of the alert information to a designated set of email addresses, or the posting of the alert information to an RSS feed. Alerts can also be set up to run a custom script when they are triggered. You can base these alerts on a wide range of threshold and trend-based scenarios, including empty shopping carts, brute force firewall attacks, and server system errors.
  • #80: Is there a difference in the ordering and frequency of activities between patients that were treated by either a high- or low-volume surgeon? (control-flow perspective) Is there a difference in resource involvement between patients that were treated by either a high- or low-volume surgeon? (organisational perspective) Is there a difference in time-related performance between patients that were treated by either a high- or low-volume surgeon? (performance perspective) Is there a difference in the ordering and frequency of activities between patients that had a throughput time of 80 and 40 minutes or less in respectively the pre-operative and final postoperative examination and patients with a longer throughput time? Is there a difference in time-related performance between patients that had a throughput time of 80 and 40 minutes or less in respectively the pre-operative and final postoperative examination and patients with a longer throughput time? It is apparent that the business processes in the medical domain are dynamic, ad-hoc, unstructured and multi-disciplinary in nature. he goal of clustering is to obtain homogeneous group of patients.