SlideShare a Scribd company logo
The Norwegian Primary Care Research
Network IT infrastructure: The Snow system
Johan Gustav Bellika
Professor Nasjonalt Senter for e-helseforskning
Professor II Institutt for Klinisk Medisin, Helsevitenskapelig fakultet, UiT
Seminar on Practical Privacy-Preserving Distributed Statistical Computations
2018.03.05
Dr. John Snow
(1813 – 1858)
Source: WMA Declaration of Helsinki. URL: https://www.wma.net/policies-post/wma-declaration-of-helsinki-ethical-principles-for-medical-research-involving-human-subjects/
http://chrisricecooper.blogspot.no/2015/02/photoessay-on-year-of-ram-by-asian.html
Declaration of Helsinki, Article 6
The primary purpose of medical research involving human
subjects is to understand the causes, development and effects of
diseases and improve preventive, diagnostic and therapeutic
interventions (methods, procedures and treatments).
Even the best proven interventions must be evaluated
continually through research for their safety, effectiveness,
efficiency, accessibility and quality.
Declaration of Helsinki, Article 9
It is the duty of physicians who are involved in medical
research to protect the life, health, dignity, integrity, right to
self-determination, privacy, and confidentiality of personal
information of research subjects.
Source: WMA Declaration of Helsinki. URL: https://www.wma.net/policies-post/wma-declaration-of-helsinki-ethical-principles-for-medical-research-involving-human-subjects/
Medical research should be privacy preserving!
Objectives for the research infrastructure
• Make participation in research projects easier and more efficient for the GPs
• Reuse health data in a safe and privacy preserving manner
• Complete research projects according to scheduled time and resources
consumption
• Recruit 90-110 GP practices
• Cover 7,5 % of the Norwegian population
The IT infrastructure
What is Snow?
• A distributed system
• Enables collection and reuse of anonymous medical data
• Builds and maintains a national online epidemiology-model
• Use the epidemiology model to provide automated IT based health services
• Enable privacy preserving distributed computations on EHR data
• Directed at research, quality improvements, audit, disease surveillance,…
Source:http://upload.wikimedia.org/wikipedia/commons/f/f6/Vibrio_cholerae.jpg
Snow architecture
- enables coordinated computations on distributed resources
- a “collaborative Edge computing” infrastructure [1]
S Coord
S
S
S
S
ClientCoord=Snow Coordination server
S= Snow Server in local health institution
Source: [1] Shi W, Cao J, Zhang Q, Li Y, Xu L. Edge Computing: Vision and Challenges. IEEE Internet Things J. oktober 2016;3(5):637–46.
Edge computing
“Edge computing refers to the enabling technologies allowing computation to be
performed at the edge of the network”[1].
Beneficial when data is:
• To sensitive (health data)
• To big (genetic data)
• To competitive (data will expose profile of owner)
• +++
Source: [1] Shi W, Cao J, Zhang Q, Li Y, Xu L. Edge Computing: Vision and Challenges. IEEE Internet Things J. oktober 2016;3(5):637–46.
The computing entities
• The individual computing process – an “agent”
• One instantiation at each participating Snow server
• One unique communication address for each agent:
agent-user@snow-server-domain/mission_id
• Agents communicates among each other using XMPP messages
• Coordinated computations: “Missions” of multiple agents:
• One “main” coordinating agent
• Multiple computation agents performing computations in parallel
Agent distribution scheme
(Collaborative computations at the edges)
Snow coordinator
Main
agent
Snow server Snow server Snow server
Health network
Comput.
agent
Comput.
agent
Comput.
agent
Health institution Health institutionHealth institution
• A small computer that fits everywhere
• Snow server software is pre-installed
• Very easy installation
• Remote system administration by the Snow
team at UiT / NSE
• Remove the risk of affecting the stability or
performance of operation critical IT systems
– the electronic health record system
• All data in the box is pseudonymised, both
patient and GPs
• Agents compute on the box
Snow appliance box:
The nodes of the
network
11
Data flow in PCRN
Internet Secure health net
GP office 1
Snow
GP
server
EMR
GP office 2
Snow
GP
server
EMR
GP office 3
Snow
GP
server
EMR
Aggregated
data/statistics
Snow
coordinator
server
PCRN net portal
• Distributed data analysis
• Establish projects, invite GPs,
initiate data extraction etc
PCRN internal data
• Epidemiological analyses
• GP and patient data
• Consultation statistics
PCRN CN
Safe haven for data
Research data set
(individual patient data)Secure data storage for
research data set and
advanced data analyses
Local net
= data storage
Using secure multiparty computations to support
research in primary care
Virtualdataset
Creating a virtual dataset with Emnet/Snow
Researcher/PCRN staff Coordinator
Def Def
Def
Def
Clinical
practice 1
Clinical
practice 2
Clinical
practice 3
Aimed at:
1. Make participation in research projects easier and more efficient for the GPs
2. Support researchers in inclusion of sufficient number of patients in clinical research
3. Support article 9 in Helsinki declaration: Privacy preservation
GP tool to identify the eligible patients
Virtualdataset
Distributed statistical computations with Emnet/Snow
Clinical
practice 1
Clinical
practice 2
Clinical
practice 3
Researcher/PCRN staff Coordinator
Query Query
Query
Query
Result
Secure multi-party computation
(SMC)
Aimed at:
1. Support researchers in inclusion of sufficient number of patients in clinical research
Report
database
Virtualdataset
Automated processing
Clinical
practice 1
Clinical
practice 2
Clinical
practice 3
Coordinator
Query
Query
Query
Resultat
Secure multi-party computation
(SMC)
PCRN interne data
Aimed at:
1. Supporting article 6 in the Helsinki declaration: Continuous evaluation
Benefits
• Centralised resources as PCRN staff/researchers can help GPs become more
efficient in research.
• Knowledge about the patient populations can be generated directly from the
distributed sources, spanning administrative borders as municipalities, regions,
countries and continents
• Aggregated (non sensitive) statistics can be produced automatically directly from
the sources.
Drawbacks
• Two other comparable approaches exists, no standard established
• How to validate correctness of computed statistics is an open research question
Thanks for listening
Questions?

More Related Content

PDF
A Survey: Privacy Preserving Using Obfuscated Attribute In e-Health Cloud
PDF
Publishing and sharing sensitive data 28 June
PPTX
Enabling simultaneous analysis of multiple cohort studies: A BRISSKit use case
PDF
Tasmanian data linkage unit
PDF
Intelligent data analysis for medicinal diagnosis
PDF
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
PDF
Anonymizing and Confidential Databases for Privacy Protection Using Suppressi...
PDF
Current trends in data security nursing research ppt
A Survey: Privacy Preserving Using Obfuscated Attribute In e-Health Cloud
Publishing and sharing sensitive data 28 June
Enabling simultaneous analysis of multiple cohort studies: A BRISSKit use case
Tasmanian data linkage unit
Intelligent data analysis for medicinal diagnosis
How Data Commons are Changing the Way that Large Datasets Are Analyzed and Sh...
Anonymizing and Confidential Databases for Privacy Protection Using Suppressi...
Current trends in data security nursing research ppt

What's hot (19)

PPT
Privacy Preserving DB Systems
PPTX
Evolving Open Health Knowledge Network
PDF
78201919
PPTX
Privacy preserving in data mining with hybrid approach
PDF
Performance Analysis of Hybrid Approach for Privacy Preserving in Data Mining
PDF
Survey on Medical Data Sharing Systems with NTRU
PPTX
EDI Training Module 2: EDI Project
PDF
Using Randomized Response Techniques for Privacy-Preserving Data Mining
PPTX
20160523 23 Research Data Things
PDF
Trust threads: Provenance for Data Reuse in Long Tail Science
PPTX
20160719 23 Research Data Things
PDF
A Survey Paper on an Integrated Approach for Privacy Preserving In High Dimen...
PDF
Privacy Preserving Data Mining
PDF
Framework for efficient transformation for complex medical data for improving...
PPTX
Pistoia Alliance conference April 2016: Big Data: Eric Little
PDF
Nist tech beat for january 7, 2015
PDF
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian network
PDF
Cluster Based Access Privilege Management Scheme for Databases
PPTX
The Role of the FAIR Guiding Principles for an effective Learning Health System
Privacy Preserving DB Systems
Evolving Open Health Knowledge Network
78201919
Privacy preserving in data mining with hybrid approach
Performance Analysis of Hybrid Approach for Privacy Preserving in Data Mining
Survey on Medical Data Sharing Systems with NTRU
EDI Training Module 2: EDI Project
Using Randomized Response Techniques for Privacy-Preserving Data Mining
20160523 23 Research Data Things
Trust threads: Provenance for Data Reuse in Long Tail Science
20160719 23 Research Data Things
A Survey Paper on an Integrated Approach for Privacy Preserving In High Dimen...
Privacy Preserving Data Mining
Framework for efficient transformation for complex medical data for improving...
Pistoia Alliance conference April 2016: Big Data: Eric Little
Nist tech beat for january 7, 2015
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian network
Cluster Based Access Privilege Management Scheme for Databases
The Role of the FAIR Guiding Principles for an effective Learning Health System
Ad

Similar to BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Computation (20)

PPTX
170131 tryggve-at ssi-biobanks-ap
PPT
Sinnott Paper
PPTX
Data safe havens: A future EOSC service?
PDF
Security Issues in Biomedical Wireless Sensor Networks Applications: A Survey
PPTX
HealthVault this is a diff presentatikon
PDF
[IJET V2I3P4] Authors: Manjunath Aski, Prathibha P
PDF
A Data-centric perspective on Data-driven healthcare: a short overview
PDF
HPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson's
PPTX
ANDS health and medical data webinar 16 May. Storing and Publishing Health an...
PPTX
Private & Secure.pptx
PDF
IRJET-A Survey on provide security to wireless medical sensor data
PDF
A Survey on provide security to wireless medical sensor data
PPTX
Private & Secure Data Tx Presentation I (1).pptx
DOCX
Case Study 4by Anil NayakiSubmission dat e 12- Dec- 20.docx
PPTX
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
PPTX
Data Harmonization for a Molecularly Driven Health System
PPT
The Future: Overcoming the Barriers to Using NHS Clinical Data For Research P...
PPTX
Data Harmonization for a Molecularly Driven Health System
PDF
Patient Privacy Control for Health Care in Cloud Computing System
PPTX
HEALTH PREDICTION ANALYSIS USING DATA MINING
170131 tryggve-at ssi-biobanks-ap
Sinnott Paper
Data safe havens: A future EOSC service?
Security Issues in Biomedical Wireless Sensor Networks Applications: A Survey
HealthVault this is a diff presentatikon
[IJET V2I3P4] Authors: Manjunath Aski, Prathibha P
A Data-centric perspective on Data-driven healthcare: a short overview
HPC and Precision Medicine: A New Framework for Alzheimer's and Parkinson's
ANDS health and medical data webinar 16 May. Storing and Publishing Health an...
Private & Secure.pptx
IRJET-A Survey on provide security to wireless medical sensor data
A Survey on provide security to wireless medical sensor data
Private & Secure Data Tx Presentation I (1).pptx
Case Study 4by Anil NayakiSubmission dat e 12- Dec- 20.docx
Aleksandar Zivaljevic - Annotation of clinical datasets using openEHR Archety...
Data Harmonization for a Molecularly Driven Health System
The Future: Overcoming the Barriers to Using NHS Clinical Data For Research P...
Data Harmonization for a Molecularly Driven Health System
Patient Privacy Control for Health Care in Cloud Computing System
HEALTH PREDICTION ANALYSIS USING DATA MINING
Ad

More from Statistisk sentralbyrå (20)

PDF
Den europeiske studentundersøkelsen 2018
PDF
Befolkningsframskrivingene 2018, seminar 26. juni
PPTX
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
PPTX
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
PPTX
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
PPTX
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
PPTX
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
PDF
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
PPTX
Innvandrere i Norge 2017, presentasjon fra frokostseminar 11.12.2017
PDF
SSBs API mot Statistikkbanken
PPTX
Norsk kulturbarometer 2016
PPTX
Presentasjon rapport: Levekår blant innvandrere i Norge 2016
PPTX
SSB: Fagseminar om innvandring og inntektsutvikling 16. mars 2017
PPTX
Flyktninger i Norge, presentasjoner fra seminar 14. desember 2016
PPTX
SSBs API mot Statistikkbanken
PPTX
Flyktninger bosatt i Norge: Hvem er de, og hvordan går det med dem?
PPTX
Hva vet vi om verdens flyktninger?
PPTX
4. Hva vet vi om verdens flyktninger?
PPTX
Hva vet vi om verdens flyktninger?
PPTX
Hva vet vi om verdens flyktninger?
Den europeiske studentundersøkelsen 2018
Befolkningsframskrivingene 2018, seminar 26. juni
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Co...
Innvandrere i Norge 2017, presentasjon fra frokostseminar 11.12.2017
SSBs API mot Statistikkbanken
Norsk kulturbarometer 2016
Presentasjon rapport: Levekår blant innvandrere i Norge 2016
SSB: Fagseminar om innvandring og inntektsutvikling 16. mars 2017
Flyktninger i Norge, presentasjoner fra seminar 14. desember 2016
SSBs API mot Statistikkbanken
Flyktninger bosatt i Norge: Hvem er de, og hvordan går det med dem?
Hva vet vi om verdens flyktninger?
4. Hva vet vi om verdens flyktninger?
Hva vet vi om verdens flyktninger?
Hva vet vi om verdens flyktninger?

Recently uploaded (20)

PPTX
Introduction to machine learning and Linear Models
PPT
ISS -ESG Data flows What is ESG and HowHow
PPTX
Business Ppt On Nestle.pptx huunnnhhgfvu
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PDF
Introduction to Data Science and Data Analysis
PPTX
Introduction to Knowledge Engineering Part 1
PDF
Business Analytics and business intelligence.pdf
PDF
Clinical guidelines as a resource for EBP(1).pdf
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PDF
annual-report-2024-2025 original latest.
PPTX
Computer network topology notes for revision
PPTX
1_Introduction to advance data techniques.pptx
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
Introduction to machine learning and Linear Models
ISS -ESG Data flows What is ESG and HowHow
Business Ppt On Nestle.pptx huunnnhhgfvu
Supervised vs unsupervised machine learning algorithms
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
Introduction-to-Cloud-ComputingFinal.pptx
Introduction to Data Science and Data Analysis
Introduction to Knowledge Engineering Part 1
Business Analytics and business intelligence.pdf
Clinical guidelines as a resource for EBP(1).pdf
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
annual-report-2024-2025 original latest.
Computer network topology notes for revision
1_Introduction to advance data techniques.pptx
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
IBA_Chapter_11_Slides_Final_Accessible.pptx
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
STERILIZATION AND DISINFECTION-1.ppthhhbx
oil_refinery_comprehensive_20250804084928 (1).pptx

BigInsight seminar on Practical Privacy-Preserving Distributed Statistical Computation

  • 1. The Norwegian Primary Care Research Network IT infrastructure: The Snow system Johan Gustav Bellika Professor Nasjonalt Senter for e-helseforskning Professor II Institutt for Klinisk Medisin, Helsevitenskapelig fakultet, UiT Seminar on Practical Privacy-Preserving Distributed Statistical Computations 2018.03.05 Dr. John Snow (1813 – 1858)
  • 2. Source: WMA Declaration of Helsinki. URL: https://www.wma.net/policies-post/wma-declaration-of-helsinki-ethical-principles-for-medical-research-involving-human-subjects/ http://chrisricecooper.blogspot.no/2015/02/photoessay-on-year-of-ram-by-asian.html Declaration of Helsinki, Article 6 The primary purpose of medical research involving human subjects is to understand the causes, development and effects of diseases and improve preventive, diagnostic and therapeutic interventions (methods, procedures and treatments). Even the best proven interventions must be evaluated continually through research for their safety, effectiveness, efficiency, accessibility and quality.
  • 3. Declaration of Helsinki, Article 9 It is the duty of physicians who are involved in medical research to protect the life, health, dignity, integrity, right to self-determination, privacy, and confidentiality of personal information of research subjects. Source: WMA Declaration of Helsinki. URL: https://www.wma.net/policies-post/wma-declaration-of-helsinki-ethical-principles-for-medical-research-involving-human-subjects/ Medical research should be privacy preserving!
  • 4. Objectives for the research infrastructure • Make participation in research projects easier and more efficient for the GPs • Reuse health data in a safe and privacy preserving manner • Complete research projects according to scheduled time and resources consumption • Recruit 90-110 GP practices • Cover 7,5 % of the Norwegian population
  • 6. What is Snow? • A distributed system • Enables collection and reuse of anonymous medical data • Builds and maintains a national online epidemiology-model • Use the epidemiology model to provide automated IT based health services • Enable privacy preserving distributed computations on EHR data • Directed at research, quality improvements, audit, disease surveillance,… Source:http://upload.wikimedia.org/wikipedia/commons/f/f6/Vibrio_cholerae.jpg
  • 7. Snow architecture - enables coordinated computations on distributed resources - a “collaborative Edge computing” infrastructure [1] S Coord S S S S ClientCoord=Snow Coordination server S= Snow Server in local health institution Source: [1] Shi W, Cao J, Zhang Q, Li Y, Xu L. Edge Computing: Vision and Challenges. IEEE Internet Things J. oktober 2016;3(5):637–46.
  • 8. Edge computing “Edge computing refers to the enabling technologies allowing computation to be performed at the edge of the network”[1]. Beneficial when data is: • To sensitive (health data) • To big (genetic data) • To competitive (data will expose profile of owner) • +++ Source: [1] Shi W, Cao J, Zhang Q, Li Y, Xu L. Edge Computing: Vision and Challenges. IEEE Internet Things J. oktober 2016;3(5):637–46.
  • 9. The computing entities • The individual computing process – an “agent” • One instantiation at each participating Snow server • One unique communication address for each agent: agent-user@snow-server-domain/mission_id • Agents communicates among each other using XMPP messages • Coordinated computations: “Missions” of multiple agents: • One “main” coordinating agent • Multiple computation agents performing computations in parallel
  • 10. Agent distribution scheme (Collaborative computations at the edges) Snow coordinator Main agent Snow server Snow server Snow server Health network Comput. agent Comput. agent Comput. agent Health institution Health institutionHealth institution
  • 11. • A small computer that fits everywhere • Snow server software is pre-installed • Very easy installation • Remote system administration by the Snow team at UiT / NSE • Remove the risk of affecting the stability or performance of operation critical IT systems – the electronic health record system • All data in the box is pseudonymised, both patient and GPs • Agents compute on the box Snow appliance box: The nodes of the network 11
  • 12. Data flow in PCRN Internet Secure health net GP office 1 Snow GP server EMR GP office 2 Snow GP server EMR GP office 3 Snow GP server EMR Aggregated data/statistics Snow coordinator server PCRN net portal • Distributed data analysis • Establish projects, invite GPs, initiate data extraction etc PCRN internal data • Epidemiological analyses • GP and patient data • Consultation statistics PCRN CN Safe haven for data Research data set (individual patient data)Secure data storage for research data set and advanced data analyses Local net = data storage
  • 13. Using secure multiparty computations to support research in primary care
  • 14. Virtualdataset Creating a virtual dataset with Emnet/Snow Researcher/PCRN staff Coordinator Def Def Def Def Clinical practice 1 Clinical practice 2 Clinical practice 3 Aimed at: 1. Make participation in research projects easier and more efficient for the GPs 2. Support researchers in inclusion of sufficient number of patients in clinical research 3. Support article 9 in Helsinki declaration: Privacy preservation
  • 15. GP tool to identify the eligible patients
  • 16. Virtualdataset Distributed statistical computations with Emnet/Snow Clinical practice 1 Clinical practice 2 Clinical practice 3 Researcher/PCRN staff Coordinator Query Query Query Query Result Secure multi-party computation (SMC) Aimed at: 1. Support researchers in inclusion of sufficient number of patients in clinical research
  • 17. Report database Virtualdataset Automated processing Clinical practice 1 Clinical practice 2 Clinical practice 3 Coordinator Query Query Query Resultat Secure multi-party computation (SMC) PCRN interne data Aimed at: 1. Supporting article 6 in the Helsinki declaration: Continuous evaluation
  • 18. Benefits • Centralised resources as PCRN staff/researchers can help GPs become more efficient in research. • Knowledge about the patient populations can be generated directly from the distributed sources, spanning administrative borders as municipalities, regions, countries and continents • Aggregated (non sensitive) statistics can be produced automatically directly from the sources.
  • 19. Drawbacks • Two other comparable approaches exists, no standard established • How to validate correctness of computed statistics is an open research question