SlideShare a Scribd company logo
Cancer Research:
Computing and Data
Warren A. Kibbe, Ph.D.
Professor, Biostats & Bioinformatics
Chief Data Officer, Duke Cancer Institute
warren.kibbe@duke.edu
@wakibbe #HECBDML
Take homes
• Cancer is a grand challenge
• Data generation is no longer the
bottleneck in biomedical research –
data management, analysis, reasoning
are
• Highlight two technologies enabling a
much more dynamic view of biology
• Two vignettes highlighting
computational and big data challenges
in biomedical research
Data access is pervasive
(10,000+ patient tumors and increasing)
Courtesy of P. Kuhn (USC)
2006-2015:
A Decade of Illuminating the
Underlying Causes of Primary
Untreated Tumors Omics
Characterization
Cancer is a grand challenge
• Deep biological understanding
• Advances in scientific methods
• Advances in instrumentation
• Advances in technology
• Data and computation
• Mathematical models
Cancer Research and Care generate
detailed data that is critical to
create a learning health system for cancer
Requires:
Genomics – the poster child
Genomics – the poster child
Cost per genome Consumer genetic testing
Biology is so much more than DNA
https://xkcd.com/1605/
HPC, Machine Learning, and Big Data
Biological Scales
Molecular to Systems Biology
DNA
Large molecule
Sequence
encodes
identity of protein
Humans: 23 pairs
of
DNA molecules
over 4 billion
nucleotides per
pair
Protein
109 10-12 -10-4
Complexes
30,000 proteins
with variants
proteins,
cofactors,
metabolites,
2nd msgrs
10-6 -103
Organelles,
cells
10-3 -104
Tissues
1 -108
Organs
1 -109
10-9- 10-4 10-9 -10-8
10-8 -10-7 10-7 -10-5
10-6 -10-2 10-3 -1
Size scale (meters)
Time scale (seconds)
compartments
structures
function
signaling,
networks
emergent
properties
© 2004 WAKibbe, Northwestern University
Cryo-EM
• Able to get atomic resolution of
flexible molecules, like membrane-
bound proteins
Single cell techniques
• Sequencing
• Proteomics
• Metabolomics
• Microenvironment
https://arxiv.org/abs/1704.01379
Growing ability to focus
on dynamics!
Example Basic Science Problem
NCI RAS Initiative +
NCI-DOE Joint Initiative
https://science.energy.gov/~/media/ascr/ascac/pdf/meetings/201809/ASCR2018_streitz_nomovies.pdf Fred Streitz
Multi-modal experimental
data, image reconstruction,
analytics
Adaptive spatial
resolution
Adaptive time
stepping
High-fidelity subgrid modeling
Experiments
on nanodisc
CryoEM
imaging
X-ray/neutron
scattering
Protein structure
databases
Adaptive sampling molecular dynamics
simulation codes
Unsupervised deep
feature learning
Uncertainty quantification
Mechanistic
network models
RAS activation
experiments
(FNLCR)
Phase field
model
Coarse-
grain MD
Classical
MD
Machine learning guided
dynamic validation
Granular RAS
membrane
interaction
simulations
Atomic resolution
RAS-RAF interaction
RAS Activation
Predictive simulation
and analysis of RAS
Phase Field model of
lipid membrane
Cancer Moonshot Pilot 2
Lipid content: RAS/HVR binding by SPR, alpha
assays in nanodiscs, liposomes, imaging in GVUs,
lipidomics, SANS (possibly with contrast variation)
RAS/HVR mobility & dynamics: single particle
tracking, FCS, CG simulations of farnesylated
HVR and RAS on nanodiscs and membranes,
use to constrain phase field coupling
RAF-membrane affinity: SPR in liposomes,
biophysical measurements, MD
simulations to identify regions of interest
that interact with membrane
RAS/HVR-membrane binding: SPR in liposomes,
biophysical measurements, SANS (with contrast
variation), AA and free-energy calculations of RAS/HVR
binding to constrain CG parameters, free energies to
inform phase field
HVR structure/dynamics: crystallization,
CD, MD of HVR in multi-component lipid
platform to inform mobility in phase field
model
RAS activity & structure: GTPase, GTP off-rate,
crystallization, NMR, cryo-EM?, SANS, AA MD
simulations constrain CG parameters
RAS-RBD structure: crystallization, NMR, AA
simulations to constrain CG parameters
RBD-CRD and CRAF structure: crystallization, NMR,
cryo-EM, CG simulations validated against AA
simulations
RAS-RBD binding: SPR, ITC, alpha assays in
nanodiscs, TIRF, SANS (possibly with contrast
variation), compare with AA simulations and
constrain CG simualtions
RAF activation: dimerization, phosphorylation state(s), long time-scale CG simulations
and kinetic estimation, multi-scale simulations multi-scale simulations of RAS/RAF
dynamics on membrane
RAS/HVR multimeric state: BRET,
step photobleaching, PALM, AA and
CG MD of KRAS/HV R on
nanodisc and multi-component lipid
platform
Experimental data to inform modelSimulations to build model
Farnesyl dynamics: solid state NMR,
AA and CG simulations of farnesyl in
membranes and lipid bilayers
informs phase field model
Lipid domains: Confocal microscopy
RAS/HVR localization in GUVs, Calibrate
coarse-grained (CG) simulations with all-
atom (AA) Simulations, Calculate free
energies of domains
Close collaboration of experimentalists and
theorists to build predictive model
Team Science is critical
Clinical Trials
Biostatists
Bioinformatics
Clinical Care
Clinical Research
EHRs, Imaging, Lab Systems
Data Science, ML, BD, HPC
Analytics and Visualization
Open Data enhances collaboration and team science!
NCI RAS Initiative +
NCI-DOE Joint Initiative
https://science.energy.gov/~/media/ascr/ascac/pdf/meetings/201809/ASCR2018_streitz_nomovies.pdf Fred Streitz
NCI RAS Initiative +
NCI-DOE Joint Initiative
https://science.energy.gov/~/media/ascr/ascac/pdf/meetings/201809/ASCR2018_streitz_nomovies.pdf Fred Streitz
NCI RAS Initiative +
NCI-DOE Joint Initiative
https://science.energy.gov/~/media/ascr/ascac/pdf/meetings/201809/ASCR2018_streitz_nomovies.pdf Fred Streitz
Example from Precision Medicine
Population vs Individual vs Clinic
Health vs Disease
• What is ’normal’?
• Systematic and measurement error
• Biological heterogeneity
• Population Health
Machine Learning
• v
Clinical, Lab, Molecular data
http://blog.75health.com/what-components-constitute-an-electronic-health-record/
Access to data has changed-Epic
From Apple Health App
Healthcare
• Evidence is not consistently
accessible and structured
• Outcomes are not connected to care
• Patient trajectories are not calculated
or accessible
Healthcare
• More data is ‘digital first’ every day
• Decision aids are needed
• Good UX and responsive computing
and analytics are critical for
improving health outcomes
Understanding Cancer
• Precision medicine will lead to fundamental
understanding of the complex interplay between
genetics, epigenetics, nutrition, environment and
clinical presentation and direct effective,
evidence-based prevention and treatment.
Ramifications across many aspects of health care
Questions?
Warren Kibbe, Ph.D.
warren.kibbe@duke.edu
@wakibbe

More Related Content

PPTX
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...
PPTX
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
PPTX
Precision oncology ncabr kibbe oct 2017
PPTX
A Vision for a Cancer Research Knowledge System
PPTX
Cancer moonshot and data sharing
PPTX
National Cancer Data Ecosystem and Data Sharing
PPTX
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
PPTX
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...
SAMSI Precision Medicine Keynote, August 2018: Data: where Precision Oncology...
C-Change Cancer Big Data, NCI Genomic Data Commons, Cloud Pilots
Precision oncology ncabr kibbe oct 2017
A Vision for a Cancer Research Knowledge System
Cancer moonshot and data sharing
National Cancer Data Ecosystem and Data Sharing
Day 2 Big Data panel at the NIH BD2K All Hands 2016 meeting
DOE-NCI Pilots presentation at the Frederick National Laboratory Advisory Com...

What's hot (20)

PPTX
NCI Cancer Genomics, Open Science and PMI: FAIR
PPTX
NCI Cancer Genomic Data Commons for NCAB September 2016
PPT
Kibbe One Voice Against Cancer 20170605
PPTX
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
PPTX
Genomics and Computation in Precision Medicine March 2017
PPTX
SuperComputing 16 HPC Matters Panel on Precision Medicine
PPTX
Converged IT Summit - NCI Data Sharing
PPTX
Precision Medicine in the Age of NCI MATCH and the Beau Biden Cancer Moonshot
PPTX
NCI Support for Cancer Data Sharing
PPTX
Data Commons & Data Science Workshop
PPTX
CI4CC Cancer Informatics Perspectives
PPTX
FDA NGS and Big Data Conference September 2014
PPTX
ISCB ECCB BD2K keynote Kibbe 201707
PPTX
Big data sharing
PPTX
CI4CC Moonshot Blue Ribbon Panel Report 20161010
PPTX
Oncoset symposium kibbe 201704
PPTX
Enabling Translational Medicine with e-Science
PPTX
Federal Research & Development for the Florida system Sept 2014
PPTX
ICBO 2014, October 8, 2014
PDF
Big Data in Cancer Control
NCI Cancer Genomics, Open Science and PMI: FAIR
NCI Cancer Genomic Data Commons for NCAB September 2016
Kibbe One Voice Against Cancer 20170605
NCI Cancer Imaging Program - Cancer Research Data Ecosystem
Genomics and Computation in Precision Medicine March 2017
SuperComputing 16 HPC Matters Panel on Precision Medicine
Converged IT Summit - NCI Data Sharing
Precision Medicine in the Age of NCI MATCH and the Beau Biden Cancer Moonshot
NCI Support for Cancer Data Sharing
Data Commons & Data Science Workshop
CI4CC Cancer Informatics Perspectives
FDA NGS and Big Data Conference September 2014
ISCB ECCB BD2K keynote Kibbe 201707
Big data sharing
CI4CC Moonshot Blue Ribbon Panel Report 20161010
Oncoset symposium kibbe 201704
Enabling Translational Medicine with e-Science
Federal Research & Development for the Florida system Sept 2014
ICBO 2014, October 8, 2014
Big Data in Cancer Control
Ad

Similar to HPC, Machine Learning, and Big Data (20)

PPTX
NCI HTAN, cancer trajectories, precision oncology
PPT
Quantitative Medicine Feb 2009
PPTX
Bioinformatics tools for NGS data analysis
PPTX
Prediction for breast cancer using various machine learning algorithms
PPTX
VariantSpark: applying Spark-based machine learning methods to genomic inform...
PPTX
Giab for jax long read 190917
PDF
Building bioinformatics resources for the global community
PPTX
Bioinformatics as a tool for understanding carcinogenesis
PPTX
Artificial Intelligence in Radiation Oncology
PDF
Cancer genome repository_berkeley
PPTX
UNMSymposium2014
PDF
Open Source Networking Solving Molecular Analysis of Cancer
PDF
Computational Pathology Workshop July 8 2014
PDF
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
PDF
2nd CRISPR Precision Genome Editing congress Berlin 2017 Agenda
PDF
The Clinical Genome Conference 2014
PPTX
Cancer Moonshot, Data sharing and the Genomic Data Commons
PDF
Collins seattle-2014-final
PDF
Breast Cancer Diagnostics with Bayesian Networks
PPTX
Nci clinical genomics data sharing ncra sept 2016
NCI HTAN, cancer trajectories, precision oncology
Quantitative Medicine Feb 2009
Bioinformatics tools for NGS data analysis
Prediction for breast cancer using various machine learning algorithms
VariantSpark: applying Spark-based machine learning methods to genomic inform...
Giab for jax long read 190917
Building bioinformatics resources for the global community
Bioinformatics as a tool for understanding carcinogenesis
Artificial Intelligence in Radiation Oncology
Cancer genome repository_berkeley
UNMSymposium2014
Open Source Networking Solving Molecular Analysis of Cancer
Computational Pathology Workshop July 8 2014
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
2nd CRISPR Precision Genome Editing congress Berlin 2017 Agenda
The Clinical Genome Conference 2014
Cancer Moonshot, Data sharing and the Genomic Data Commons
Collins seattle-2014-final
Breast Cancer Diagnostics with Bayesian Networks
Nci clinical genomics data sharing ncra sept 2016
Ad

More from Warren Kibbe (20)

PPTX
CCDI Kibbe Wake Forest University Dec 2023.pptx
PPTX
Big Data Training for Cancer Research, Purdue, May 2023
PPTX
CCDI Overview November 2022
PPTX
RADx-UP CDCC Overview November 2022
PPTX
CCDI Kibbe Big Data Training May 2022
PPTX
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
PPTX
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
PPTX
RADx-UP CDCC presentation for the NIH Disaster Interest Group
PPTX
DCHI webinar on N3C January 2021
PPTX
NCATS CTSA N3C
PPTX
NAACCR June 2020
PPTX
ENAR 2020
PPTX
ENAR 2020
PPTX
Technology and connected health for population science kibbe duke jan 2020
PPTX
Super computing 19 Cancer Computing Workshop Keynote
PPTX
Data Harmonization for a Molecularly Driven Health System
PPTX
Data supporting precision oncology fda wakibbe
PPTX
Role of data in precision oncology
PPTX
Data Harmonization for a Molecularly Driven Health System
PPTX
Data sharing Webinar March 2019
CCDI Kibbe Wake Forest University Dec 2023.pptx
Big Data Training for Cancer Research, Purdue, May 2023
CCDI Overview November 2022
RADx-UP CDCC Overview November 2022
CCDI Kibbe Big Data Training May 2022
Real world data, the National COVID-19 Cohort Consortium, and Oncology 2021
Childhood Cancer Data Initiative presentation to the Children’s Brain Tumor N...
RADx-UP CDCC presentation for the NIH Disaster Interest Group
DCHI webinar on N3C January 2021
NCATS CTSA N3C
NAACCR June 2020
ENAR 2020
ENAR 2020
Technology and connected health for population science kibbe duke jan 2020
Super computing 19 Cancer Computing Workshop Keynote
Data Harmonization for a Molecularly Driven Health System
Data supporting precision oncology fda wakibbe
Role of data in precision oncology
Data Harmonization for a Molecularly Driven Health System
Data sharing Webinar March 2019

Recently uploaded (20)

PPT
ASRH Presentation for students and teachers 2770633.ppt
PPT
Obstructive sleep apnea in orthodontics treatment
PPT
Breast Cancer management for medicsl student.ppt
PPTX
ACID BASE management, base deficit correction
PPTX
Human Reproduction: Anatomy, Physiology & Clinical Insights.pptx
PDF
NEET PG 2025 | 200 High-Yield Recall Topics Across All Subjects
DOCX
RUHS II MBBS Microbiology Paper-II with Answer Key | 6th August 2025 (New Sch...
PPTX
DENTAL CARIES FOR DENTISTRY STUDENT.pptx
PDF
شيت_عطا_0000000000000000000000000000.pdf
PPTX
Imaging of parasitic D. Case Discussions.pptx
PDF
Human Health And Disease hggyutgghg .pdf
PPTX
POLYCYSTIC OVARIAN SYNDROME.pptx by Dr( med) Charles Amoateng
PPTX
neonatal infection(7392992y282939y5.pptx
PPTX
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus
PDF
Copy of OB - Exam #2 Study Guide. pdf
DOC
Adobe Premiere Pro CC Crack With Serial Key Full Free Download 2025
PPTX
Transforming Regulatory Affairs with ChatGPT-5.pptx
PDF
Medical Evidence in the Criminal Justice Delivery System in.pdf
PPTX
post stroke aphasia rehabilitation physician
PPTX
Clinical approach and Radiotherapy principles.pptx
ASRH Presentation for students and teachers 2770633.ppt
Obstructive sleep apnea in orthodontics treatment
Breast Cancer management for medicsl student.ppt
ACID BASE management, base deficit correction
Human Reproduction: Anatomy, Physiology & Clinical Insights.pptx
NEET PG 2025 | 200 High-Yield Recall Topics Across All Subjects
RUHS II MBBS Microbiology Paper-II with Answer Key | 6th August 2025 (New Sch...
DENTAL CARIES FOR DENTISTRY STUDENT.pptx
شيت_عطا_0000000000000000000000000000.pdf
Imaging of parasitic D. Case Discussions.pptx
Human Health And Disease hggyutgghg .pdf
POLYCYSTIC OVARIAN SYNDROME.pptx by Dr( med) Charles Amoateng
neonatal infection(7392992y282939y5.pptx
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus
Copy of OB - Exam #2 Study Guide. pdf
Adobe Premiere Pro CC Crack With Serial Key Full Free Download 2025
Transforming Regulatory Affairs with ChatGPT-5.pptx
Medical Evidence in the Criminal Justice Delivery System in.pdf
post stroke aphasia rehabilitation physician
Clinical approach and Radiotherapy principles.pptx

HPC, Machine Learning, and Big Data

  • 1. Cancer Research: Computing and Data Warren A. Kibbe, Ph.D. Professor, Biostats & Bioinformatics Chief Data Officer, Duke Cancer Institute warren.kibbe@duke.edu @wakibbe #HECBDML
  • 2. Take homes • Cancer is a grand challenge • Data generation is no longer the bottleneck in biomedical research – data management, analysis, reasoning are • Highlight two technologies enabling a much more dynamic view of biology • Two vignettes highlighting computational and big data challenges in biomedical research
  • 3. Data access is pervasive
  • 4. (10,000+ patient tumors and increasing) Courtesy of P. Kuhn (USC) 2006-2015: A Decade of Illuminating the Underlying Causes of Primary Untreated Tumors Omics Characterization Cancer is a grand challenge • Deep biological understanding • Advances in scientific methods • Advances in instrumentation • Advances in technology • Data and computation • Mathematical models Cancer Research and Care generate detailed data that is critical to create a learning health system for cancer Requires:
  • 5. Genomics – the poster child
  • 6. Genomics – the poster child Cost per genome Consumer genetic testing
  • 7. Biology is so much more than DNA https://xkcd.com/1605/
  • 9. Biological Scales Molecular to Systems Biology DNA Large molecule Sequence encodes identity of protein Humans: 23 pairs of DNA molecules over 4 billion nucleotides per pair Protein 109 10-12 -10-4 Complexes 30,000 proteins with variants proteins, cofactors, metabolites, 2nd msgrs 10-6 -103 Organelles, cells 10-3 -104 Tissues 1 -108 Organs 1 -109 10-9- 10-4 10-9 -10-8 10-8 -10-7 10-7 -10-5 10-6 -10-2 10-3 -1 Size scale (meters) Time scale (seconds) compartments structures function signaling, networks emergent properties © 2004 WAKibbe, Northwestern University
  • 10. Cryo-EM • Able to get atomic resolution of flexible molecules, like membrane- bound proteins
  • 11. Single cell techniques • Sequencing • Proteomics • Metabolomics • Microenvironment https://arxiv.org/abs/1704.01379 Growing ability to focus on dynamics!
  • 13. NCI RAS Initiative + NCI-DOE Joint Initiative https://science.energy.gov/~/media/ascr/ascac/pdf/meetings/201809/ASCR2018_streitz_nomovies.pdf Fred Streitz
  • 14. Multi-modal experimental data, image reconstruction, analytics Adaptive spatial resolution Adaptive time stepping High-fidelity subgrid modeling Experiments on nanodisc CryoEM imaging X-ray/neutron scattering Protein structure databases Adaptive sampling molecular dynamics simulation codes Unsupervised deep feature learning Uncertainty quantification Mechanistic network models RAS activation experiments (FNLCR) Phase field model Coarse- grain MD Classical MD Machine learning guided dynamic validation Granular RAS membrane interaction simulations Atomic resolution RAS-RAF interaction RAS Activation Predictive simulation and analysis of RAS Phase Field model of lipid membrane Cancer Moonshot Pilot 2
  • 15. Lipid content: RAS/HVR binding by SPR, alpha assays in nanodiscs, liposomes, imaging in GVUs, lipidomics, SANS (possibly with contrast variation) RAS/HVR mobility & dynamics: single particle tracking, FCS, CG simulations of farnesylated HVR and RAS on nanodiscs and membranes, use to constrain phase field coupling RAF-membrane affinity: SPR in liposomes, biophysical measurements, MD simulations to identify regions of interest that interact with membrane RAS/HVR-membrane binding: SPR in liposomes, biophysical measurements, SANS (with contrast variation), AA and free-energy calculations of RAS/HVR binding to constrain CG parameters, free energies to inform phase field HVR structure/dynamics: crystallization, CD, MD of HVR in multi-component lipid platform to inform mobility in phase field model RAS activity & structure: GTPase, GTP off-rate, crystallization, NMR, cryo-EM?, SANS, AA MD simulations constrain CG parameters RAS-RBD structure: crystallization, NMR, AA simulations to constrain CG parameters RBD-CRD and CRAF structure: crystallization, NMR, cryo-EM, CG simulations validated against AA simulations RAS-RBD binding: SPR, ITC, alpha assays in nanodiscs, TIRF, SANS (possibly with contrast variation), compare with AA simulations and constrain CG simualtions RAF activation: dimerization, phosphorylation state(s), long time-scale CG simulations and kinetic estimation, multi-scale simulations multi-scale simulations of RAS/RAF dynamics on membrane RAS/HVR multimeric state: BRET, step photobleaching, PALM, AA and CG MD of KRAS/HV R on nanodisc and multi-component lipid platform Experimental data to inform modelSimulations to build model Farnesyl dynamics: solid state NMR, AA and CG simulations of farnesyl in membranes and lipid bilayers informs phase field model Lipid domains: Confocal microscopy RAS/HVR localization in GUVs, Calibrate coarse-grained (CG) simulations with all- atom (AA) Simulations, Calculate free energies of domains Close collaboration of experimentalists and theorists to build predictive model
  • 16. Team Science is critical Clinical Trials Biostatists Bioinformatics Clinical Care Clinical Research EHRs, Imaging, Lab Systems Data Science, ML, BD, HPC Analytics and Visualization Open Data enhances collaboration and team science!
  • 17. NCI RAS Initiative + NCI-DOE Joint Initiative https://science.energy.gov/~/media/ascr/ascac/pdf/meetings/201809/ASCR2018_streitz_nomovies.pdf Fred Streitz
  • 18. NCI RAS Initiative + NCI-DOE Joint Initiative https://science.energy.gov/~/media/ascr/ascac/pdf/meetings/201809/ASCR2018_streitz_nomovies.pdf Fred Streitz
  • 19. NCI RAS Initiative + NCI-DOE Joint Initiative https://science.energy.gov/~/media/ascr/ascac/pdf/meetings/201809/ASCR2018_streitz_nomovies.pdf Fred Streitz
  • 22. Health vs Disease • What is ’normal’? • Systematic and measurement error • Biological heterogeneity • Population Health
  • 24. Clinical, Lab, Molecular data http://blog.75health.com/what-components-constitute-an-electronic-health-record/
  • 25. Access to data has changed-Epic
  • 27. Healthcare • Evidence is not consistently accessible and structured • Outcomes are not connected to care • Patient trajectories are not calculated or accessible
  • 28. Healthcare • More data is ‘digital first’ every day • Decision aids are needed • Good UX and responsive computing and analytics are critical for improving health outcomes
  • 29. Understanding Cancer • Precision medicine will lead to fundamental understanding of the complex interplay between genetics, epigenetics, nutrition, environment and clinical presentation and direct effective, evidence-based prevention and treatment. Ramifications across many aspects of health care

Editor's Notes

  • #15:   Biochemical/biophysical properties of fully processed KRAS4b RAS on nanodiscs Lipid composition of RAS:membrane interaction RAS:RAF binding in the context of nanodisc membranes  Structure of RAS on membranes Crystallography of KRAS, and KRAS:effector complexes NMR of KRAS bound to nanodiscs X-ray/neutron scattering of KRAS on nanodiscs Cryo-EM imaging of KRAS protein complexes (+effectors)   Dynamics of RAS in membranes Supported bilayers in vitro Live cell imaging with single molecule tracking Adaptive spatial resolution (e.g., sub-grid modeling) Propagating both coarse-grained and classical (atomistic) MD information, we aim to maintain the highest fidelity possible at the point of interactions while capturing long distance effects Multiple time scales By judiciously switching between spatial scales we enable investigation of timescales that are orders of magnitude longer than possible with fine-scale simulation alone. Automated hypothesis generation and dynamic validation We will efficiently and accurately explore, e.g., possible interaction sequences by coupling Machine Learning techniques with large-scale predictive simulation. Extreme scale simulation Requried novel computational algorithms and techniques will be developed for use on Sierra-class architectures, and will be designed for exascale. Deep learning algorithms Powerful pattern recognition tools will accelerate our predictive simulation capability by giving rapidly identifying, e.g., the time or region where a sub-grid model is needed or by logically exploring an intractably large decision tree. Uncertainty quantification Application of our extensive capability will be tested in the new (highly uncertain) world of biology and healthcare, leading to new insights and the development of new methods Scalable statistical inference tools The continued convergence of data analytics and predictive simulation as we approach exascale will require statistical tools that scale far beyond what is current, requiring the development of new strategies.
  • #23: We now have tools to let us both understand social determinants of health and build ‘early warning systems’ to identify people who are have acute medical issues