SlideShare a Scribd company logo
Expert Panel on Data Challenges
in Translational Research
in Pharmaceutical and Biomedical Organisations
13.12.2017 – 3PM GMT
Xose Fernandez
Chief Data Officer
Panelists
Alexandre Passioukov
VP Translational Medicine
Abel Ureta-Vidal
Founder & CEO
Based in Cambridge, UK since 2008, on the Wellcome Genome Campus
AI-augmented knowledge discovery platform for Life Sciences R&D
•  Human & animal health
•  Personal care and cosmeceuticals
•  Food and nutraceuticals
Delivering the innovation platform for the genomics era:
e[automateddatascientist]
to enable data driven decisions
to increase the success rate of innovation
About Eagle Genomics
Today: Significant inertia in biologists doing data-rich tasks
Data sources
meta data
summary
documents
protocols
experiment
al & clinical
reports
Concept entailment
pathways
patients
genes
supporting
evidence
drugs
Jennifer
Biologist
Capability 1:
ingest/curate/enrich
ability to ingest/curate/enrich
patient in a semi-automatic
manner and build a
catalog of patients
	
Capability 2:
prioritise/select
ability to prioritise/select the
most valuable/relevant
patient or groups of patients	
Capability 3:
comparison analysis
ability to run complex
comparison analysis
between groups of
patients, using data
	
Capability 4:
visualise/pinpoint
ability to pinpoint the new
insight generated by the
analysis and assess its validity/
robustness/value
	
A translational medicine platform needs to have 4 main
capabilities:
In order to deliver its promise for new insight discovery:
e[automateddatascientist] in translational medicine
Framework applicable in other areas such as preclinical research, animal health, personal care, agri-tech
[to bedside] Interventional Observational
Application Therapeutic
targets
Diagnostic
biomarkers
Biological
mechanisms
Curate patient
data
Select
cohorts
Compare
cohorts
Pinpoint
insight
Patient data
[from bench]
Medical history Lab results
Biomolecular
assays
Images
Internal Collaborative
e[automateddatascientist]
e[curate]
Build
catalog
e[catalog] e[discover] e[discover]+e[nsembl]e[hive]
Capability 1 Capability 4Capability 3Capability 2
Drug
repurposing
…
Capability 1:
ingest/curate/enrich	
value-based curation and
cataloguing	
Capability 2:
prioritise/select
prioritisation selector
Capability 3:
comparison analysis
workflow management
system and analysis
pipeline builder
	
Capability 4:
visualise/pinpoint
insight navigator
&
genome browser
e[automateddatascientist]
Eagle’s platform - the e[automateddatascientist] has each capability embedded in
a module or a set of modules.
Main hurdles and challenges
•  User experience for the specific business cases to drive adoption
•  Interoperability and integration with existing systems
•  Use and/or creation of standards ie FAIR principles (not a standard) for
smooth data/insight exchange
•  Beyond finding relevant data, representing insight for systematic capture
•  Federated data governance, involving cultural change
Data Challenges in Translational Research in
Pharmaceutical and Biomedical Organisations
Dr Xosé M Fernández
Chief Data Officer, Institut Curie
Institut Curie
10
Ins$tut	Curie	-	nom	de	l'éme+eur	-	Titre	de	
la	présenta.on
11
Current Status
Institut Curie
Service Transform
Health and social care services
are under extreme financial
pressure. Data is key and
underpin the capabilities that
will enable insight-based
service transformation.
Digital Maturity
Healthcare information systems
generate vast amounts of data
that must be used for enabling
high-value care.
Information ecosystem
Integration of data is an
increasingly critical goal.
Connecting the “system” and
enabling data to flow across
the organisation.
Strategic alignment
Our current health and social
care service model is changing
(e.g. new models of care)
moving away from traditional
mapping service type and
organisation.
Analysis capability
High-value and effective
service delivery and improved
outcomes cannot happen
without converting rich, high
quality, contextual data into
insight.
12
Tackling some Data Challenges
Data creation, access, use and reuse
Promote sharing policies enshrined in institutional Data Management Plan
Metadata as a means of providing context for datasets, in order to
facilitate future discovery, access, aggregation, use and reuse of data
Support data creators to submit their data or other research materials to
a trusted and sustainable repository, for further curation and long-term
preservation, in line with documented collecting policies
Identification of digital objects, to facilitate discovery and linking of
datasets
Provide quality support and resources as a means of building capacity
and data skills within the Institut Curie.
Institut Curie
13
Heterogeneous Data
Imaging
 Radiographic imaging
  2D/3D ultrasound
 Electron microscopy
 4D microscopy
 Histopathology
  Single-cell imaging…
60%	
EHRs
 Dossier médicale personnel
 Dossier Communiquant en
Cancérologie
5%	
Genomic data
 Genomic platforms
 France Médecine Génomique 2025
20%	
Research data
 Clinical trials
 Phenotypes
 LIMS
 Multiple research databases…
15%	
Institut Curie
14
Information at your Fingertips
Institut Curie
*Continuum Soins Recherche
*
15
Digital Ecosystem
Connecting different datasets requires work on
descriptors (critical time consuming step), as
historical data collections tend to be poorly
annotated.
METADATA
INFRASTRUCTURE
A central data hub hosting pseudo-anonymised
datasets in line with GDPR† requirements, with
links to raw data, as well as a ConSoRe instance
for EMR querying.
Suitable pilot project is the key for success, it
must combine diverse datasets (genomics,
transcriptomics, proteins, EHRs) with a team
keen to invest time to improve metadata.
PILOT PROJECT
Data Aggregation – How it works?
Institut Curie
SEMANTIC LAYER
Discoverability is guaranteed by APIs* which
aggregate pseudo-anonymised datasets
without revealing IDs hosted in secured space.
*Application programming interface
† General Data Protection Regulation
Institut Curie
Most sequencing will be happening at healthcare and not research
By 2025 over 60 million genomes
Data geographically distributed
Clinical data not interoperable
Healthcare not used to handle Terabytes or Exabytes
Technical knowhow currently in the research community
Secure access and governance
Human Genomics in Healthcare
Institut Curie
Main Challenges faced as Chief Data Officer
GDPR* as the main data challenge
Establish new governance (Curie Data Charter)
Outlining policies and procedures (Data Management Plan)
Data aggregation to consolidate digital ecosystem
Interoperability
Cultural change – Information as an asset
Translate data into insightful information
*General Data Protection Regulation
Eagle Genomics - Translational Webinar, Dec 13
Introduction – Dr A. Passioukov
KEY FIGURES
Holding	
€30m	
Dermatology	
€113m	
Ethics	
€390m	
Oncology		
€174m	
CDMO	
€53m	
Dermo-cosme$cs	
€1,143m	
Consumer		
Health	Care	
€333m	
FROM HEALTH TO BEAUTY
Data
Warehouse
Data Analytics
Knowledge
Base
Structured data Analysis Mechanistic
Insight
Genome
Etc..
Transcriptome
Proteome
Metabolome
Phenome
Reseach &
Dev.
global
network
Global Precision
medicine-based
healthcare system
Bioinformatics platform foundational for future healthcare
SEMI-VIRTUAL TM CONCEPT: COMMON DATABASE TO
3 INNOVATION UNITS (ONCO/DERMA/CNS)
1.  Use of cutting edge technologies (incl. digital);
2.  Semi-virtual structure, with a common DataBase ‘agnostic’ of
therapeutic area;
3.  Probability-of-Success (PoS) mindedness - focus on priority R&D
programs: CDx, go-no-go decisions;
4.  Strong synergy with research: using patient data to generate
innovation & early inform IUs of winning options;
5.  Exploration interfaces between IUs - identification of niches based on
molecular biology
Pierre Fabre TM: key principles
Stronger Together & with Others
	
	
	
Turn	to	External	OpportuniCes/CollaboraCon							
In-Silico	Pla5orm	to	connect	to	external	collabora.ons	
	
Acquire	Data		
and/or	
Generate	Data		
	
Data
BIG DIGITAL CHALLENGE FOR BIOPHARMA
1.  Lack of vision for business value creation potential
2.  Data access  – data silos
3.  Formats / Standards / Dictionaries in a heterogeneous “data lake”
4.  Weak signals detection for competitive advantage
www.eaglegenomics.com
Panel discussion & Q&A
www.eaglegenomics.com
Thank you
info@eaglegenomics.com

More Related Content

PDF
Digital transformation of translational medicine
PDF
Validating microbiome claims – including the latest DNA techniques
PDF
Considerations and challenges in building an end to-end microbiome workflow
PPTX
AI in translational medicine webinar
PDF
Healthcare Conference 2013 : Genes, Clouds and Cancer - dr. Andrew Litt
PDF
Pattern diagnostics 2015
PDF
EY Drug R&D: Big DATA for big returns
PDF
Caulder - DIVOS BioITWorld 2015
Digital transformation of translational medicine
Validating microbiome claims – including the latest DNA techniques
Considerations and challenges in building an end to-end microbiome workflow
AI in translational medicine webinar
Healthcare Conference 2013 : Genes, Clouds and Cancer - dr. Andrew Litt
Pattern diagnostics 2015
EY Drug R&D: Big DATA for big returns
Caulder - DIVOS BioITWorld 2015

What's hot (20)

PPTX
X team 2 - presentation
PPTX
Introduction to vision and scope
PPTX
Clinical trials data sharing
PPTX
Conference presentation from #iccs2014 in Noordwijkerhout
PPTX
Enhancing Our Capacity for Large Health Dataset Analysis
PDF
1.4 Holger Woehrle
PDF
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
PDF
Bioinformatics Research Scientist Careers at St. Jude Children's Research Hos...
PPT
Data Mining and Big Data Analytics in Pharma
PDF
Bioinformatics Research Scientist Careers at St. Jude
PPTX
Using the CDD Vault for MM4TB
PDF
BigDataAnalytics_Talk_KOCH_FINAL
PPTX
Lessons learned in scaling up
PDF
Pine Biotech
PPTX
NCRIS and the health domain
PPTX
Impact Through Innovation: The Wellcome Sanger Institute
PDF
ELSS use cases and strategy
PPTX
Genomics-Based Medicine Coming into View
PPTX
Watson – from Jeopardy to healthcare
PPTX
lecture 9 B
X team 2 - presentation
Introduction to vision and scope
Clinical trials data sharing
Conference presentation from #iccs2014 in Noordwijkerhout
Enhancing Our Capacity for Large Health Dataset Analysis
1.4 Holger Woehrle
Pine.Bio slide deck - Idea Village CAPITALx (New Orleans Entrepreneur Week 2017)
Bioinformatics Research Scientist Careers at St. Jude Children's Research Hos...
Data Mining and Big Data Analytics in Pharma
Bioinformatics Research Scientist Careers at St. Jude
Using the CDD Vault for MM4TB
BigDataAnalytics_Talk_KOCH_FINAL
Lessons learned in scaling up
Pine Biotech
NCRIS and the health domain
Impact Through Innovation: The Wellcome Sanger Institute
ELSS use cases and strategy
Genomics-Based Medicine Coming into View
Watson – from Jeopardy to healthcare
lecture 9 B
Ad

Similar to Expert Panel on Data Challenges in Translational Research (20)

PPT
Cloud Computing and Innovations for Optimizing Life Sciences Research
PDF
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
PDF
From algorithms to advancing care: genomics data drives progress
PPTX
2016 09 cxo forum
PDF
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
PDF
2015 04-18-wilson cg
PDF
Fair by design
PPTX
Atul Butte NIPS 2017 ML4H
PDF
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
PPTX
Powering Medical Research With Data: The Research Analytics Adoption Model
PPTX
2018 10 igneous
PPTX
Building a Next Generation Clinical and Scientific Data Management Solution
PDF
Transforming Big Data into Big Value
PPTX
E.Gombocz: Changing the Model in Pharma and Healthcare (DILS Keynote 2013-07...
PDF
2016 iHT2 San Diego Health IT Summit
PPTX
Data Use Cases - Healthcare & Banking3.pptx
PPTX
Enterprise Analytics: Serving Big Data Projects for Healthcare
PDF
Sun==big data analytics for health care
PDF
Sharing and standards christopher hart - clinical innovation and partnering...
PPTX
Next Gen Sequencing and Associated Big Data / AI problem
Cloud Computing and Innovations for Optimizing Life Sciences Research
Big Data and its Impact on Industry (Example of the Pharmaceutical Industry)
From algorithms to advancing care: genomics data drives progress
2016 09 cxo forum
Combining Patient Records, Genomic Data and Environmental Data to Enable Tran...
2015 04-18-wilson cg
Fair by design
Atul Butte NIPS 2017 ML4H
Leveraging Data Analysis for Advancements in Healthcare and Medical Research.pdf
Powering Medical Research With Data: The Research Analytics Adoption Model
2018 10 igneous
Building a Next Generation Clinical and Scientific Data Management Solution
Transforming Big Data into Big Value
E.Gombocz: Changing the Model in Pharma and Healthcare (DILS Keynote 2013-07...
2016 iHT2 San Diego Health IT Summit
Data Use Cases - Healthcare & Banking3.pptx
Enterprise Analytics: Serving Big Data Projects for Healthcare
Sun==big data analytics for health care
Sharing and standards christopher hart - clinical innovation and partnering...
Next Gen Sequencing and Associated Big Data / AI problem
Ad

Recently uploaded (20)

PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PDF
An interstellar mission to test astrophysical black holes
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
famous lake in india and its disturibution and importance
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PDF
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
DOCX
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PPTX
microscope-Lecturecjchchchchcuvuvhc.pptx
PPT
Chemical bonding and molecular structure
PPTX
Microbiology with diagram medical studies .pptx
PPTX
Cell Membrane: Structure, Composition & Functions
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PDF
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
An interstellar mission to test astrophysical black holes
HPLC-PPT.docx high performance liquid chromatography
famous lake in india and its disturibution and importance
Classification Systems_TAXONOMY_SCIENCE8.pptx
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
IFIT3 RNA-binding activity primores influenza A viruz infection and translati...
Q1_LE_Mathematics 8_Lesson 5_Week 5.docx
7. General Toxicologyfor clinical phrmacy.pptx
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
microscope-Lecturecjchchchchcuvuvhc.pptx
Chemical bonding and molecular structure
Microbiology with diagram medical studies .pptx
Cell Membrane: Structure, Composition & Functions
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
Mastering Bioreactors and Media Sterilization: A Complete Guide to Sterile Fe...
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
AlphaEarth Foundations and the Satellite Embedding dataset

Expert Panel on Data Challenges in Translational Research

  • 1. Expert Panel on Data Challenges in Translational Research in Pharmaceutical and Biomedical Organisations 13.12.2017 – 3PM GMT
  • 2. Xose Fernandez Chief Data Officer Panelists Alexandre Passioukov VP Translational Medicine Abel Ureta-Vidal Founder & CEO
  • 3. Based in Cambridge, UK since 2008, on the Wellcome Genome Campus AI-augmented knowledge discovery platform for Life Sciences R&D •  Human & animal health •  Personal care and cosmeceuticals •  Food and nutraceuticals Delivering the innovation platform for the genomics era: e[automateddatascientist] to enable data driven decisions to increase the success rate of innovation About Eagle Genomics
  • 4. Today: Significant inertia in biologists doing data-rich tasks Data sources meta data summary documents protocols experiment al & clinical reports Concept entailment pathways patients genes supporting evidence drugs Jennifer Biologist
  • 5. Capability 1: ingest/curate/enrich ability to ingest/curate/enrich patient in a semi-automatic manner and build a catalog of patients Capability 2: prioritise/select ability to prioritise/select the most valuable/relevant patient or groups of patients Capability 3: comparison analysis ability to run complex comparison analysis between groups of patients, using data Capability 4: visualise/pinpoint ability to pinpoint the new insight generated by the analysis and assess its validity/ robustness/value A translational medicine platform needs to have 4 main capabilities: In order to deliver its promise for new insight discovery:
  • 6. e[automateddatascientist] in translational medicine Framework applicable in other areas such as preclinical research, animal health, personal care, agri-tech [to bedside] Interventional Observational Application Therapeutic targets Diagnostic biomarkers Biological mechanisms Curate patient data Select cohorts Compare cohorts Pinpoint insight Patient data [from bench] Medical history Lab results Biomolecular assays Images Internal Collaborative e[automateddatascientist] e[curate] Build catalog e[catalog] e[discover] e[discover]+e[nsembl]e[hive] Capability 1 Capability 4Capability 3Capability 2 Drug repurposing …
  • 7. Capability 1: ingest/curate/enrich value-based curation and cataloguing Capability 2: prioritise/select prioritisation selector Capability 3: comparison analysis workflow management system and analysis pipeline builder Capability 4: visualise/pinpoint insight navigator & genome browser e[automateddatascientist] Eagle’s platform - the e[automateddatascientist] has each capability embedded in a module or a set of modules.
  • 8. Main hurdles and challenges •  User experience for the specific business cases to drive adoption •  Interoperability and integration with existing systems •  Use and/or creation of standards ie FAIR principles (not a standard) for smooth data/insight exchange •  Beyond finding relevant data, representing insight for systematic capture •  Federated data governance, involving cultural change
  • 9. Data Challenges in Translational Research in Pharmaceutical and Biomedical Organisations Dr Xosé M Fernández Chief Data Officer, Institut Curie Institut Curie
  • 11. 11 Current Status Institut Curie Service Transform Health and social care services are under extreme financial pressure. Data is key and underpin the capabilities that will enable insight-based service transformation. Digital Maturity Healthcare information systems generate vast amounts of data that must be used for enabling high-value care. Information ecosystem Integration of data is an increasingly critical goal. Connecting the “system” and enabling data to flow across the organisation. Strategic alignment Our current health and social care service model is changing (e.g. new models of care) moving away from traditional mapping service type and organisation. Analysis capability High-value and effective service delivery and improved outcomes cannot happen without converting rich, high quality, contextual data into insight.
  • 12. 12 Tackling some Data Challenges Data creation, access, use and reuse Promote sharing policies enshrined in institutional Data Management Plan Metadata as a means of providing context for datasets, in order to facilitate future discovery, access, aggregation, use and reuse of data Support data creators to submit their data or other research materials to a trusted and sustainable repository, for further curation and long-term preservation, in line with documented collecting policies Identification of digital objects, to facilitate discovery and linking of datasets Provide quality support and resources as a means of building capacity and data skills within the Institut Curie. Institut Curie
  • 13. 13 Heterogeneous Data Imaging  Radiographic imaging   2D/3D ultrasound  Electron microscopy  4D microscopy  Histopathology   Single-cell imaging… 60% EHRs  Dossier médicale personnel  Dossier Communiquant en Cancérologie 5% Genomic data  Genomic platforms  France Médecine Génomique 2025 20% Research data  Clinical trials  Phenotypes  LIMS  Multiple research databases… 15% Institut Curie
  • 14. 14 Information at your Fingertips Institut Curie *Continuum Soins Recherche *
  • 15. 15 Digital Ecosystem Connecting different datasets requires work on descriptors (critical time consuming step), as historical data collections tend to be poorly annotated. METADATA INFRASTRUCTURE A central data hub hosting pseudo-anonymised datasets in line with GDPR† requirements, with links to raw data, as well as a ConSoRe instance for EMR querying. Suitable pilot project is the key for success, it must combine diverse datasets (genomics, transcriptomics, proteins, EHRs) with a team keen to invest time to improve metadata. PILOT PROJECT Data Aggregation – How it works? Institut Curie SEMANTIC LAYER Discoverability is guaranteed by APIs* which aggregate pseudo-anonymised datasets without revealing IDs hosted in secured space. *Application programming interface † General Data Protection Regulation
  • 16. Institut Curie Most sequencing will be happening at healthcare and not research By 2025 over 60 million genomes Data geographically distributed Clinical data not interoperable Healthcare not used to handle Terabytes or Exabytes Technical knowhow currently in the research community Secure access and governance Human Genomics in Healthcare
  • 17. Institut Curie Main Challenges faced as Chief Data Officer GDPR* as the main data challenge Establish new governance (Curie Data Charter) Outlining policies and procedures (Data Management Plan) Data aggregation to consolidate digital ecosystem Interoperability Cultural change – Information as an asset Translate data into insightful information *General Data Protection Regulation
  • 18. Eagle Genomics - Translational Webinar, Dec 13 Introduction – Dr A. Passioukov
  • 20. Data Warehouse Data Analytics Knowledge Base Structured data Analysis Mechanistic Insight Genome Etc.. Transcriptome Proteome Metabolome Phenome Reseach & Dev. global network Global Precision medicine-based healthcare system Bioinformatics platform foundational for future healthcare
  • 21. SEMI-VIRTUAL TM CONCEPT: COMMON DATABASE TO 3 INNOVATION UNITS (ONCO/DERMA/CNS)
  • 22. 1.  Use of cutting edge technologies (incl. digital); 2.  Semi-virtual structure, with a common DataBase ‘agnostic’ of therapeutic area; 3.  Probability-of-Success (PoS) mindedness - focus on priority R&D programs: CDx, go-no-go decisions; 4.  Strong synergy with research: using patient data to generate innovation & early inform IUs of winning options; 5.  Exploration interfaces between IUs - identification of niches based on molecular biology Pierre Fabre TM: key principles
  • 23. Stronger Together & with Others Turn to External OpportuniCes/CollaboraCon In-Silico Pla5orm to connect to external collabora.ons Acquire Data and/or Generate Data Data
  • 24. BIG DIGITAL CHALLENGE FOR BIOPHARMA 1.  Lack of vision for business value creation potential 2.  Data access  – data silos 3.  Formats / Standards / Dictionaries in a heterogeneous “data lake” 4.  Weak signals detection for competitive advantage