Infrastructures for
secure data analytics
Wessel Kraaij
TNO & Leiden University
EU: Digital technology will
redefine health and care
• Care: transition to Value Based Health Care
• PROMs
• Aggregated cost of care paths
• Patient: self management of health, patient science
• N=1 , personalized health
• Find comparable health trajectories
• Combining the right data may lead to new insights
• BUT Data storage is fragmented
• BUT The GDPR limits the combination of data sets
• Challenges:
• Provide a trusted environment, individual control on data
access and sharing
• Supporting secure and legal data analytics for combined
datasets
Some barriers for statistical analysis and ML
• Data is horizontally partitioned
• Distributed learning
• Personal Health Train (J van Soest)
• Data is vertically partitioned
• Existing practice: Trusted 3rd party (TTP)
• Health Data Cooperative (Midata)
• Prana Data (example of secure
multiparty computation)
ID age income sex
1 55 70000 M
2 45 60000 F
ID Age sex
1 55 M
2 45 F
3 20 F
4 22 M
ID age income sex
3 20 25000 F
4 22 20000 M
ID income
1 70000
2 60000
3 25000
4 20000
Traditional solution –
trusting a third party
• For research:
• create anonymized/ pseudonymized datasets,
possibly using a trusted third party
• Anonymization: remove identifiers
• Export anonymized dataset for research
(academic/ commercial)
Assessment
• The more personal data is combined, the easier it is to re-
identify a profile and the more difficult the anonymization
process is
• E.g. personal well-being data (Fitbit, smartphone apps)
are quite personal and could lead to re-identification
using external data.
• Personal data may seem innocent, but can lead to valuable
insights
• Strict regulation on data processing and storage
• Data leaks can lead to substantial fines
• Professionals are reluctant to use big data approaches
Towards a citizen driven
healthcare economy:
> Citizen’s together form a
Cooperative and a Community
> The cooperative delivers the platform
and governance structure
> Enables individuals to collect their
data (medical and lifestyle)
> Provides services for members and
delivers services to customers
> Data is controlled by citizen and
patients themselves
> Rely on their cooperative
for support
CONTROL
SELF-
MANAGEMENT
HOLLAND HEALTH DATA COOPERATIVE: THE GAMECHANGER
TRUST
REWARD
VIRTUAL SAFE
MIDATA Server(s)
Data Services
and Access Management
3rd Party
Data
Sources
Anony-
mised
export
Lifestyle Health Others
Research
environment
Smartphone 3rd Party Apps
MIDATA
Portal
Quan-
tified
self
App
Coach-
ing
App
Treat-
ment
App
Follow-
up
App
HHDC
HHDC architecture
PRANA DATA (WWW.PRANADATA.NL)
Privacy preserving analyse
of sensitive data
Results:
Proof of principle:
secure ‘babies like
mine’ , homomorphic
encryption, for
predictive mean
matching
(characteristics of
babies with similar
growth curve)
User study
White paper
Collaboration with
Personal health train,
PEP
Follow up in H2020 BigMedilytics
(ErasmusMC, Achmea, TNO and
VWDATA
PREDICT EVOLVEMENT USING PATIENT MATCHER
PREDICT EVOLVEMENT USING PATIENT MATCHER
PREDICT EVOLVEMENT USING PATIENTS MATCHER
Kraaij infrastructures for secure data analytics def brussel 2017
GOAL: ANSWERING CHILD DEVELOPMENT
QUESTIONS WITHOUT SHARING SENSITIVE DATA
SECURE PATIENTS LIKE ME: 1/3 DATA INGEST
SECURE PATIENTS LIKE ME: 2/3 ANALYSIS
Apply ‘predictive mean matching’ in
the encrypted domain
SECURE PATIENTS LIKE ME: 3/3 DECRYPTION
QUESTIONS ANSWERED? PRIVACY PRESERVED?
Alice’s parents/doctor can:
Plot expected growth curve
Spot growth issues that might occur
Obtain benchmarks about parent’s age, doctor visits, obesity, etc.
Privacy preserved:
Sensitive records nor the aggregations are learned by the system
Analysis server learns distribution of comparison metric
The distribution of child lengths is not sensitive

More Related Content

PDF
Towards an ecosystem for privacy respecting analysis of distributed health data
PDF
Enabling Analytics on Sensitive Medical Data with Secure Multiparty Computation
PDF
From personal health data to a personalized advice
PPTX
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
PPTX
Data Sharing and Release Legislation
PPTX
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...
PPTX
International perspective for sharing publicly funded medical research data
PPTX
Group project slides of informatics infrastructure (1)
Towards an ecosystem for privacy respecting analysis of distributed health data
Enabling Analytics on Sensitive Medical Data with Secure Multiparty Computation
From personal health data to a personalized advice
BDE SC1 Workshop 3 - iASiS (Guillermo Palma)
Data Sharing and Release Legislation
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...
International perspective for sharing publicly funded medical research data
Group project slides of informatics infrastructure (1)

What's hot (20)

PPTX
Infrastructure of an informatics department4
PPTX
Legal and regulatory challenges to data sharing for clinical genetics and ge...
PPTX
Clinical Informatics: some lessons learned
PPTX
Architecture and Standards
PPTX
Investigator-initiated clinical trials: a community perspective
PPTX
Analytics in Action - Health
PPTX
Hospital Cloud Forum - thoughts for panel
PDF
P17 fhir chain- applying blockchain to securely and scalably share
PPTX
Introduction to vision and scope
PPTX
The application of new technologies and IT in Health: standards as infrastruc...
PPTX
AMIA 2015 Registries in Accountable Care poster
PDF
Building a National Data Infrastructure to Advance Patient-Centered Comparati...
PDF
Brisbane Health-y Data: What are health and sensitive data and why are they t...
PPT
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...
PDF
Laurila presentation VTT SmartHealth Ecosystem Event 12.6.2019
PPTX
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
PPT
Towards Building a Person-Centred and Provider-Friendly Health System
PPTX
Interoperability in health care information systems
PPTX
Using Big Data to Personalize the Healthcare Experience in Cancer, Genomics a...
PDF
Blockchain and Patient-Centered Outcomes Measures - Goldwater
Infrastructure of an informatics department4
Legal and regulatory challenges to data sharing for clinical genetics and ge...
Clinical Informatics: some lessons learned
Architecture and Standards
Investigator-initiated clinical trials: a community perspective
Analytics in Action - Health
Hospital Cloud Forum - thoughts for panel
P17 fhir chain- applying blockchain to securely and scalably share
Introduction to vision and scope
The application of new technologies and IT in Health: standards as infrastruc...
AMIA 2015 Registries in Accountable Care poster
Building a National Data Infrastructure to Advance Patient-Centered Comparati...
Brisbane Health-y Data: What are health and sensitive data and why are they t...
BioSHaRE: The DataSHIELD Legal Analysis Template - Susan Wallace - University...
Laurila presentation VTT SmartHealth Ecosystem Event 12.6.2019
BDE SC1 Workshop 3 - MIDAS (Michaela Black)
Towards Building a Person-Centred and Provider-Friendly Health System
Interoperability in health care information systems
Using Big Data to Personalize the Healthcare Experience in Cancer, Genomics a...
Blockchain and Patient-Centered Outcomes Measures - Goldwater

Similar to Kraaij infrastructures for secure data analytics def brussel 2017 (20)

PDF
Citizen controlled health data lockers as a game changer
PPTX
Knowing me, knowing you, knowing your disease
PDF
The shared value of personal and population data
PPTX
Applications of Data Science in Healthcare
PDF
ISD2016_Solution_G_Serge_Bignens
PDF
Privacy Issues in Data-Driven Health Care
DOCX
ScienceDirectAvailable online at www.sciencedirect.com
PDF
A BIG DATA REVOLUTION IN HEALTH CARE SECTOR: OPPORTUNITIES, CHALLENGES AND TE...
PDF
Rock Report: Big Data by @Rock_Health
DOC
2014 IEEE JAVA DATA MINING PROJECT M privacy for collaborative data publishing
DOC
IEEE 2014 JAVA DATA MINING PROJECTS M privacy for collaborative data publishing
PDF
future of health UMCG 3 june1630-a safe society
PPTX
Use Cases - Healthcare & Banking.pptx
PDF
Private Hidden Data for Health Care
PPTX
Using Big Data for Improved Healthcare Operations and Analytics
PDF
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
DOCX
M privacy for collaborative data publishing
PDF
eBook - Data Analytics in Healthcare
PPTX
Icdcn healthcare ws_arpanpal
PPTX
Big data analytics in healthcare
Citizen controlled health data lockers as a game changer
Knowing me, knowing you, knowing your disease
The shared value of personal and population data
Applications of Data Science in Healthcare
ISD2016_Solution_G_Serge_Bignens
Privacy Issues in Data-Driven Health Care
ScienceDirectAvailable online at www.sciencedirect.com
A BIG DATA REVOLUTION IN HEALTH CARE SECTOR: OPPORTUNITIES, CHALLENGES AND TE...
Rock Report: Big Data by @Rock_Health
2014 IEEE JAVA DATA MINING PROJECT M privacy for collaborative data publishing
IEEE 2014 JAVA DATA MINING PROJECTS M privacy for collaborative data publishing
future of health UMCG 3 june1630-a safe society
Use Cases - Healthcare & Banking.pptx
Private Hidden Data for Health Care
Using Big Data for Improved Healthcare Operations and Analytics
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
M privacy for collaborative data publishing
eBook - Data Analytics in Healthcare
Icdcn healthcare ws_arpanpal
Big data analytics in healthcare

Recently uploaded (20)

PPTX
Arthritis Types, Signs & Treatment with physiotherapy management
PDF
chapter 14.pdf Ch+12+SGOB.docx hilighted important stuff on exa,
PPTX
Benign prostatic hyperplasia, uro anaesthesia
PPTX
guidance--unit 1 semester-5 bsc nursing.
PPTX
ANALGESIC AND ANTI-INFLAMMssssssATORY DRUGS.pptx
PPTX
Hospital Services healthcare management in india
PPT
Pyramid Points Acid Base Power Point (10).ppt
PDF
ENT MedMap you can study for the exam with this.pdf
PPTX
Public Health. Disasater mgt group 1.pptx
PDF
01. Histology New Classification of histo is clear calssification
PDF
crisisintervention-210721062718.presentatiodnf
PDF
Essentials of Hysteroscopy at World Laparoscopy Hospital
PPTX
HIGHLIGHTS of NDCT 2019 WITH IMPACT ON CLINICAL RESEARCH.pptx
PDF
Culturally Sensitive Health Solutions: Engineering Localized Practices (www....
PPT
Pyramid Points Lab Values Power Point(11).ppt
DOCX
PT10 continues to explose your mind right after reading
PPTX
ACUTE CALCULAR CHOLECYSTITIS: A CASE STUDY
PPTX
Nancy Caroline Emergency Paramedic Chapter 15
PPTX
POSTURE.pptx......,............. .........
PPTX
Nancy Caroline Emergency Paramedic Chapter 16
Arthritis Types, Signs & Treatment with physiotherapy management
chapter 14.pdf Ch+12+SGOB.docx hilighted important stuff on exa,
Benign prostatic hyperplasia, uro anaesthesia
guidance--unit 1 semester-5 bsc nursing.
ANALGESIC AND ANTI-INFLAMMssssssATORY DRUGS.pptx
Hospital Services healthcare management in india
Pyramid Points Acid Base Power Point (10).ppt
ENT MedMap you can study for the exam with this.pdf
Public Health. Disasater mgt group 1.pptx
01. Histology New Classification of histo is clear calssification
crisisintervention-210721062718.presentatiodnf
Essentials of Hysteroscopy at World Laparoscopy Hospital
HIGHLIGHTS of NDCT 2019 WITH IMPACT ON CLINICAL RESEARCH.pptx
Culturally Sensitive Health Solutions: Engineering Localized Practices (www....
Pyramid Points Lab Values Power Point(11).ppt
PT10 continues to explose your mind right after reading
ACUTE CALCULAR CHOLECYSTITIS: A CASE STUDY
Nancy Caroline Emergency Paramedic Chapter 15
POSTURE.pptx......,............. .........
Nancy Caroline Emergency Paramedic Chapter 16

Kraaij infrastructures for secure data analytics def brussel 2017

  • 1. Infrastructures for secure data analytics Wessel Kraaij TNO & Leiden University
  • 2. EU: Digital technology will redefine health and care • Care: transition to Value Based Health Care • PROMs • Aggregated cost of care paths • Patient: self management of health, patient science • N=1 , personalized health • Find comparable health trajectories • Combining the right data may lead to new insights • BUT Data storage is fragmented • BUT The GDPR limits the combination of data sets • Challenges: • Provide a trusted environment, individual control on data access and sharing • Supporting secure and legal data analytics for combined datasets
  • 3. Some barriers for statistical analysis and ML • Data is horizontally partitioned • Distributed learning • Personal Health Train (J van Soest) • Data is vertically partitioned • Existing practice: Trusted 3rd party (TTP) • Health Data Cooperative (Midata) • Prana Data (example of secure multiparty computation) ID age income sex 1 55 70000 M 2 45 60000 F ID Age sex 1 55 M 2 45 F 3 20 F 4 22 M ID age income sex 3 20 25000 F 4 22 20000 M ID income 1 70000 2 60000 3 25000 4 20000
  • 4. Traditional solution – trusting a third party • For research: • create anonymized/ pseudonymized datasets, possibly using a trusted third party • Anonymization: remove identifiers • Export anonymized dataset for research (academic/ commercial)
  • 5. Assessment • The more personal data is combined, the easier it is to re- identify a profile and the more difficult the anonymization process is • E.g. personal well-being data (Fitbit, smartphone apps) are quite personal and could lead to re-identification using external data. • Personal data may seem innocent, but can lead to valuable insights • Strict regulation on data processing and storage • Data leaks can lead to substantial fines • Professionals are reluctant to use big data approaches
  • 6. Towards a citizen driven healthcare economy: > Citizen’s together form a Cooperative and a Community > The cooperative delivers the platform and governance structure > Enables individuals to collect their data (medical and lifestyle) > Provides services for members and delivers services to customers > Data is controlled by citizen and patients themselves > Rely on their cooperative for support CONTROL SELF- MANAGEMENT HOLLAND HEALTH DATA COOPERATIVE: THE GAMECHANGER TRUST REWARD VIRTUAL SAFE
  • 7. MIDATA Server(s) Data Services and Access Management 3rd Party Data Sources Anony- mised export Lifestyle Health Others Research environment Smartphone 3rd Party Apps MIDATA Portal Quan- tified self App Coach- ing App Treat- ment App Follow- up App HHDC HHDC architecture
  • 8. PRANA DATA (WWW.PRANADATA.NL) Privacy preserving analyse of sensitive data Results: Proof of principle: secure ‘babies like mine’ , homomorphic encryption, for predictive mean matching (characteristics of babies with similar growth curve) User study White paper Collaboration with Personal health train, PEP Follow up in H2020 BigMedilytics (ErasmusMC, Achmea, TNO and VWDATA
  • 9. PREDICT EVOLVEMENT USING PATIENT MATCHER
  • 10. PREDICT EVOLVEMENT USING PATIENT MATCHER
  • 11. PREDICT EVOLVEMENT USING PATIENTS MATCHER
  • 13. GOAL: ANSWERING CHILD DEVELOPMENT QUESTIONS WITHOUT SHARING SENSITIVE DATA
  • 14. SECURE PATIENTS LIKE ME: 1/3 DATA INGEST
  • 15. SECURE PATIENTS LIKE ME: 2/3 ANALYSIS Apply ‘predictive mean matching’ in the encrypted domain
  • 16. SECURE PATIENTS LIKE ME: 3/3 DECRYPTION
  • 17. QUESTIONS ANSWERED? PRIVACY PRESERVED? Alice’s parents/doctor can: Plot expected growth curve Spot growth issues that might occur Obtain benchmarks about parent’s age, doctor visits, obesity, etc. Privacy preserved: Sensitive records nor the aggregations are learned by the system Analysis server learns distribution of comparison metric The distribution of child lengths is not sensitive