SlideShare a Scribd company logo
Leveraging functional genomics
analytics for target discovery
Enrico Ferrero, PhD
Computational Biology @ GSK
Data Science for Pharma
27.01.2016
The drug discovery pipeline
New medicine: $2.5+ bn, 20+ years
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
2
Challenges in the pharma industry
Time and costs are increasing but success rate is declining
3Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Late failure costs more
How to reduce late phase attrition?
4Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
0
200
400
600
800
1000
1200
0
10
20
30
40
50
60
70
80
90
100
Lead discovery Lead optimization Pre-clinical FTIH Phase 2 Phase 3
Relativecost(permolecule)
Nmolecules
Manhattan Institute, 2012
Rethink the drug discovery pipeline
Spend more time and resources in target validation to reduce attrition in later phases
5Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Targetvalidation
Potentialtargets
Pre-clinical FTIH LaunchPhase 2 Phase 3
Lead discovery
Lead optimisation
Launch
PotentialtargetsPotentialtargets
Lead discovery Lead optimisation Pre-clinical FTIH Phase 2 Phase 3
Target
validation
6
Supporting the drug discovery pipeline and drive innovation
Target Preclinical Clinical Launch
Disease
understanding
Target
discovery
Drug
MOA
Indication
mining
Patient
stratification
Efficacy and
safety
Drug
repositioning
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Computational Biology @ GSK
Functional genomics and high-throughput sequencing
Transcriptomics Epigenomics Regulomics
RNA-seq ChIP-seq DNase-seq BS-seq
Disease understanding
Disease progression in rheumatoid arthritis
RNA-seq + BS-seq
 Part of the BTCURE research project, in collaboration with the Academisch Medisch Centrum
(Amsterdam, NL).
 Pilot study involving a small number of synovial biopsies from RA patients at different stages and
degrees of severity profiled by RNA-seq and BS-seq.
 Objective: identify gene expression and methylation signatures that could highlight disease
progression mechanisms.
Differential expression analysis
10
RNA-seq
 Challenges:
 Data-driven identification
of clinical parameters that
are indicative of disease
progression
 Differential expression
analysis with limited
number of samples and
high variability
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Methylation data generation and processing optimization
BS-seq
11
 Challenges:
 Set up and optimise protocol(s) in the lab
 Big strain on sequencing facilities and computational environment
 Identification of appropriate analytical methods
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Genomic responses to viral infection
RNA-seq + DNase-seq
 Part of an ongoing collaboration with the University of Washington Department of Genome
Sciences (Seattle, WA, USA).
 Pilot study with primary epithelial cells from healthy volunteers infected with human rhinovirus.
 Samples profiled by RNA-seq and DNase-seq to identify gene expression and regulatory chromatin
responses to viral infection.
 Objective: Identification of biological mechanisms and pathways relevant for respiratory diseases
with a strong infection component.
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Genomic responses to viral infection
DNase-seq
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
 Challenges:
 Differential analytical
framework for DNase-seq
data
 Interpretation of biological
signal from DNase
hypersensitive sites
Target discovery
Identifying novel Crohn’s targets with strong genetic evidence
Integration of disease genetics with cell-specific functional genomics data
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Identifying novel Crohn’s targets with strong genetic evidence
Crohn’s-associated SNPs in T cell-specific regulatory elements and putative regulated genes
16
Overlapping gene
Correlated gene
ChIA-PET gene
Nearest gene
TFBS
TF motif
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Drug MOA
Neurogenesis-inducing compounds MOA
RNA-seq
 Study to understand the mechanisms of action of two neurogenesis-inducing compounds and
discriminate between the pathways they activate.
 Neural progenitor cells profiled by RNA-seq to identify gene expression responses to the two
compounds.
 Objective: Identification of off-target effects and safety risks.
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Neurogenesis-inducing compounds MOA
RNA-seq
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
What else?
GSK partnerships with academic institutions
A collaborative and pre-competitive effort to improve the target discovery process
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Centre for Therapeutic Target Validation (CTTV)
https://www.targetvalidation.org/
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Conclusions
Leveraging functional genomics analytics for target discovery
 Making drugs is a very failure-prone business. To increase our chances of success, we need to have
better understanding of the biology of:
– Our diseases;
– Our targets;
– Our drugs.
 High-throughput sequencing assays and functional genomic data are more and more widely used
in GSK to drive and support these activities.
 This type of data poses two main challenges:
– Data plumbing: create an infrastructure that is able to deal with the size of these datasets, in terms of both
storage and processing power.
– Data analytics: develop appropriate analytical pipelines that allow to integrate, visualise, analyse and
interpret the data.
 Partnerships with CTTV and Altius demonstrate our vision of a pre-competitive, collaborative space
for target identification and validation.
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
Acknowledgements
 Disease progression in rheumatoid arthritis
(in collaboration with BTCURE and AMC)
– Rab Prinjha (Epinova DPU, GSK)
– Paul-Peter Tak (Immuno-inflammation TA, GSK)
– Danielle Gerlag (Clinical Unit Cambridge, GSK)
– Huw Lewis (Epinova DPU, GSK)
– Erika Cule (Target Sciences, GSK)
– Klio Maratou (Target Sciences, GSK)
– George Royal (Target Sciences, GSK)
 Neurogenesis-inducing compounds MOA
– Hong Lin (Regenerative Medicine DPU, GSK)
– Aaron Chuang (Regenerative Medicine DPU, GSK)
– Julie Holder (Regenerative Medicine DPU, GSK)
– Jing Zhao (Regenerative Medicine DPU, GSK)
– Erika Cule (Target Sciences, GSK)
 Genomic responses to viral infection
(in collaboration with StamLab and UW)
– Edith Hessel (Refractory Respiratory Inflammation DPU,
GSK)
– John Stamatoyannopoulos (StamLab, UW)
– David Michalovich (Refractory Respiratory Inflammation
DPU, GSK)
– Soren Beinke (Refractory Respiratory Inflammation DPU,
GSK)
– Nikolai Belyaev (Refractory Respiratory Inflammation DPU,
GSK)
– Peter Sabo (StamLab, UW)
– Eric Rynes (StamLab, UW)
 Identifying novel Crohn’s targets with strong genetic
evidence
– David Michalovich (Refractory Respiratory Inflammation
DPU, GSK)
– Chris Larminie ( Target Sciences, GSK)
Leveraging functional genomics analytics for target discovery
Enrico Ferrero – Computational Biology @ GSK
We’re hiring!
Computational Biology jobs at:
 http://www.gsk.com/en-gb/careers/search-jobs-and-apply
 https://www.linkedin.com/company/glaxosmithkline/careers

More Related Content

PDF
Prediction of novel targets using disease association data from Open Targets
PPTX
Computational prediction of novel drug targets using gene disease association...
PDF
In silico prediction of novel therapeutic targets using gene - disease associ...
PDF
Automating drug target discovery with machine learning
PPTX
Applications of high-throughput sequencing (HTS) technologies in the pharma i...
PDF
Prediction of novel targets using disease association data from Open Targets
PPTX
Prediction of therapeutic targets using the Open Targets data
PPTX
ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY "AN OVERVIEW OF AWARENESS"
Prediction of novel targets using disease association data from Open Targets
Computational prediction of novel drug targets using gene disease association...
In silico prediction of novel therapeutic targets using gene - disease associ...
Automating drug target discovery with machine learning
Applications of high-throughput sequencing (HTS) technologies in the pharma i...
Prediction of novel targets using disease association data from Open Targets
Prediction of therapeutic targets using the Open Targets data
ARTIFICIAL INTELLIGENCE IN DRUG DISCOVERY "AN OVERVIEW OF AWARENESS"

What's hot (20)

PDF
Artificial Intelligence for Discovery
PDF
Discovery_Schreiner
PDF
SMi Group's AI in Drug Discovery 2020 conference
PPTX
Very brief overview of AI in drug discovery
PPTX
Ai in drug discovery and drug development
PPTX
AI applications in life sciences - drug development
PPTX
Artificial intelligence in drug discovery
PPTX
Role of AI in Drug Discovery and Development
PDF
Rmc phenotypic screening
PPT
Data mining in pharmacovigilance
PDF
Overcoming obstacles to repurposing for neurodegenerative disease
PPT
Quantitative methods of Signal detection on spontaneous reporting systems - S...
PPTX
Data Science in Drug Discovery
DOCX
Paras new CV`..
PDF
Computational prediction of antimicrobial peptide activity
PDF
Bayesian estimations of strong toxic signals [compatibility mode]
PPTX
BioVariance - Pediatric Pharmacogenomics in Drug Discovery
PPTX
Pistoia Alliance datathon for drug repurposing for rare diseases
PDF
Big Data in Pharma - Overview and Use Cases
PPTX
Combination of informative biomarkers in small pilot studies and estimation ...
Artificial Intelligence for Discovery
Discovery_Schreiner
SMi Group's AI in Drug Discovery 2020 conference
Very brief overview of AI in drug discovery
Ai in drug discovery and drug development
AI applications in life sciences - drug development
Artificial intelligence in drug discovery
Role of AI in Drug Discovery and Development
Rmc phenotypic screening
Data mining in pharmacovigilance
Overcoming obstacles to repurposing for neurodegenerative disease
Quantitative methods of Signal detection on spontaneous reporting systems - S...
Data Science in Drug Discovery
Paras new CV`..
Computational prediction of antimicrobial peptide activity
Bayesian estimations of strong toxic signals [compatibility mode]
BioVariance - Pediatric Pharmacogenomics in Drug Discovery
Pistoia Alliance datathon for drug repurposing for rare diseases
Big Data in Pharma - Overview and Use Cases
Combination of informative biomarkers in small pilot studies and estimation ...
Ad

Similar to Leveraging functional genomics analytics for target discovery (20)

PPTX
Genomics
PDF
Amia tb-review-10
PDF
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
PDF
Zen and the Art of Data Science Maintenance
PPTX
AWS HCLS Virtual Symposium 2021_Maze-Nichols.pptx
PPTX
Pistoia Alliance-Elsevier Datathon
PPTX
Focus on the Evidence: a knowledge graph approach to profiling drug targets
PDF
Amia tb-review-12
PDF
Bioinformatics in dermato-oncology
PDF
The Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft
PPTX
Genomic proteomics
PDF
Open Targets workshop at C4X in 2019
PDF
Gary bader fged_toronto_2012
PDF
Forum on Personalized Medicine: Challenges for the next decade
PDF
Slas2012 Whoeck
PPT
INBIOMEDvision Workshop at MIE 2011. Victoria López
PPTX
2013 nas-ehs-data-integration-dc
PDF
Amia tb-review-13
PDF
Multigenic (mechanistic) biomarkers
PDF
Amia tbi-14-final
Genomics
Amia tb-review-10
Insights from Building the Future of Drug Discovery with Apache Spark with Lu...
Zen and the Art of Data Science Maintenance
AWS HCLS Virtual Symposium 2021_Maze-Nichols.pptx
Pistoia Alliance-Elsevier Datathon
Focus on the Evidence: a knowledge graph approach to profiling drug targets
Amia tb-review-12
Bioinformatics in dermato-oncology
The Future of Healthcare with Big Data and AI with Ion Stoica and Frank Nothaft
Genomic proteomics
Open Targets workshop at C4X in 2019
Gary bader fged_toronto_2012
Forum on Personalized Medicine: Challenges for the next decade
Slas2012 Whoeck
INBIOMEDvision Workshop at MIE 2011. Victoria López
2013 nas-ehs-data-integration-dc
Amia tb-review-13
Multigenic (mechanistic) biomarkers
Amia tbi-14-final
Ad

Recently uploaded (20)

PDF
annual-report-2024-2025 original latest.
PPTX
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
PPTX
IBA_Chapter_11_Slides_Final_Accessible.pptx
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPT
DATA COLLECTION METHODS-ppt for nursing research
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Business Analytics and business intelligence.pdf
PPTX
Database Infoormation System (DBIS).pptx
PPTX
importance of Data-Visualization-in-Data-Science. for mba studnts
PDF
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
PPTX
Leprosy and NLEP programme community medicine
PDF
Introduction to the R Programming Language
PDF
Introduction to Data Science and Data Analysis
PPTX
A Complete Guide to Streamlining Business Processes
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PPTX
Topic 5 Presentation 5 Lesson 5 Corporate Fin
PPTX
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
PDF
[EN] Industrial Machine Downtime Prediction
PPTX
iec ppt-1 pptx icmr ppt on rehabilitation.pptx
annual-report-2024-2025 original latest.
The THESIS FINAL-DEFENSE-PRESENTATION.pptx
IBA_Chapter_11_Slides_Final_Accessible.pptx
SAP 2 completion done . PRESENTATION.pptx
DATA COLLECTION METHODS-ppt for nursing research
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Business Analytics and business intelligence.pdf
Database Infoormation System (DBIS).pptx
importance of Data-Visualization-in-Data-Science. for mba studnts
REAL ILLUMINATI AGENT IN KAMPALA UGANDA CALL ON+256765750853/0705037305
Leprosy and NLEP programme community medicine
Introduction to the R Programming Language
Introduction to Data Science and Data Analysis
A Complete Guide to Streamlining Business Processes
Qualitative Qantitative and Mixed Methods.pptx
Pilar Kemerdekaan dan Identi Bangsa.pptx
Topic 5 Presentation 5 Lesson 5 Corporate Fin
mbdjdhjjodule 5-1 rhfhhfjtjjhafbrhfnfbbfnb
[EN] Industrial Machine Downtime Prediction
iec ppt-1 pptx icmr ppt on rehabilitation.pptx

Leveraging functional genomics analytics for target discovery

  • 1. Leveraging functional genomics analytics for target discovery Enrico Ferrero, PhD Computational Biology @ GSK Data Science for Pharma 27.01.2016
  • 2. The drug discovery pipeline New medicine: $2.5+ bn, 20+ years Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK 2
  • 3. Challenges in the pharma industry Time and costs are increasing but success rate is declining 3Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 4. Late failure costs more How to reduce late phase attrition? 4Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK 0 200 400 600 800 1000 1200 0 10 20 30 40 50 60 70 80 90 100 Lead discovery Lead optimization Pre-clinical FTIH Phase 2 Phase 3 Relativecost(permolecule) Nmolecules Manhattan Institute, 2012
  • 5. Rethink the drug discovery pipeline Spend more time and resources in target validation to reduce attrition in later phases 5Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK Targetvalidation Potentialtargets Pre-clinical FTIH LaunchPhase 2 Phase 3 Lead discovery Lead optimisation Launch PotentialtargetsPotentialtargets Lead discovery Lead optimisation Pre-clinical FTIH Phase 2 Phase 3 Target validation
  • 6. 6 Supporting the drug discovery pipeline and drive innovation Target Preclinical Clinical Launch Disease understanding Target discovery Drug MOA Indication mining Patient stratification Efficacy and safety Drug repositioning Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK Computational Biology @ GSK
  • 7. Functional genomics and high-throughput sequencing Transcriptomics Epigenomics Regulomics RNA-seq ChIP-seq DNase-seq BS-seq
  • 9. Disease progression in rheumatoid arthritis RNA-seq + BS-seq  Part of the BTCURE research project, in collaboration with the Academisch Medisch Centrum (Amsterdam, NL).  Pilot study involving a small number of synovial biopsies from RA patients at different stages and degrees of severity profiled by RNA-seq and BS-seq.  Objective: identify gene expression and methylation signatures that could highlight disease progression mechanisms.
  • 10. Differential expression analysis 10 RNA-seq  Challenges:  Data-driven identification of clinical parameters that are indicative of disease progression  Differential expression analysis with limited number of samples and high variability Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 11. Methylation data generation and processing optimization BS-seq 11  Challenges:  Set up and optimise protocol(s) in the lab  Big strain on sequencing facilities and computational environment  Identification of appropriate analytical methods Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 12. Genomic responses to viral infection RNA-seq + DNase-seq  Part of an ongoing collaboration with the University of Washington Department of Genome Sciences (Seattle, WA, USA).  Pilot study with primary epithelial cells from healthy volunteers infected with human rhinovirus.  Samples profiled by RNA-seq and DNase-seq to identify gene expression and regulatory chromatin responses to viral infection.  Objective: Identification of biological mechanisms and pathways relevant for respiratory diseases with a strong infection component. Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 13. Genomic responses to viral infection DNase-seq Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK  Challenges:  Differential analytical framework for DNase-seq data  Interpretation of biological signal from DNase hypersensitive sites
  • 15. Identifying novel Crohn’s targets with strong genetic evidence Integration of disease genetics with cell-specific functional genomics data Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 16. Identifying novel Crohn’s targets with strong genetic evidence Crohn’s-associated SNPs in T cell-specific regulatory elements and putative regulated genes 16 Overlapping gene Correlated gene ChIA-PET gene Nearest gene TFBS TF motif Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 18. Neurogenesis-inducing compounds MOA RNA-seq  Study to understand the mechanisms of action of two neurogenesis-inducing compounds and discriminate between the pathways they activate.  Neural progenitor cells profiled by RNA-seq to identify gene expression responses to the two compounds.  Objective: Identification of off-target effects and safety risks. Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 19. Neurogenesis-inducing compounds MOA RNA-seq Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 21. GSK partnerships with academic institutions A collaborative and pre-competitive effort to improve the target discovery process Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 22. Centre for Therapeutic Target Validation (CTTV) https://www.targetvalidation.org/ Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 23. Conclusions Leveraging functional genomics analytics for target discovery  Making drugs is a very failure-prone business. To increase our chances of success, we need to have better understanding of the biology of: – Our diseases; – Our targets; – Our drugs.  High-throughput sequencing assays and functional genomic data are more and more widely used in GSK to drive and support these activities.  This type of data poses two main challenges: – Data plumbing: create an infrastructure that is able to deal with the size of these datasets, in terms of both storage and processing power. – Data analytics: develop appropriate analytical pipelines that allow to integrate, visualise, analyse and interpret the data.  Partnerships with CTTV and Altius demonstrate our vision of a pre-competitive, collaborative space for target identification and validation. Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 24. Acknowledgements  Disease progression in rheumatoid arthritis (in collaboration with BTCURE and AMC) – Rab Prinjha (Epinova DPU, GSK) – Paul-Peter Tak (Immuno-inflammation TA, GSK) – Danielle Gerlag (Clinical Unit Cambridge, GSK) – Huw Lewis (Epinova DPU, GSK) – Erika Cule (Target Sciences, GSK) – Klio Maratou (Target Sciences, GSK) – George Royal (Target Sciences, GSK)  Neurogenesis-inducing compounds MOA – Hong Lin (Regenerative Medicine DPU, GSK) – Aaron Chuang (Regenerative Medicine DPU, GSK) – Julie Holder (Regenerative Medicine DPU, GSK) – Jing Zhao (Regenerative Medicine DPU, GSK) – Erika Cule (Target Sciences, GSK)  Genomic responses to viral infection (in collaboration with StamLab and UW) – Edith Hessel (Refractory Respiratory Inflammation DPU, GSK) – John Stamatoyannopoulos (StamLab, UW) – David Michalovich (Refractory Respiratory Inflammation DPU, GSK) – Soren Beinke (Refractory Respiratory Inflammation DPU, GSK) – Nikolai Belyaev (Refractory Respiratory Inflammation DPU, GSK) – Peter Sabo (StamLab, UW) – Eric Rynes (StamLab, UW)  Identifying novel Crohn’s targets with strong genetic evidence – David Michalovich (Refractory Respiratory Inflammation DPU, GSK) – Chris Larminie ( Target Sciences, GSK) Leveraging functional genomics analytics for target discovery Enrico Ferrero – Computational Biology @ GSK
  • 25. We’re hiring! Computational Biology jobs at:  http://www.gsk.com/en-gb/careers/search-jobs-and-apply  https://www.linkedin.com/company/glaxosmithkline/careers