SlideShare a Scribd company logo
Bioinformatics t9-t10-biocheminformatics v2014
FBW 
9-12-2014 
Wim Van Criekinge
Bioinformatics t9-t10-biocheminformatics v2014
Examen 
<html> 
<title>Examen Bioinformatica</title> 
<center> 
<head> 
<script> 
rnd.today=new Date(); 
rnd.seed=rnd.today.getTime(); 
function rnd() { 
rnd.seed = (rnd.seed*9301+49297) % 233280; 
return rnd.seed/(233280.0); 
}; 
function rand(number) { 
return Math.ceil(rnd()*number); 
}; 
</SCRIPT> 
</head> 
<body bgcolor="#FFFFFF" text="#00FF00" link="#00FF00"> 
<script language="JavaScript"> 
document.write('<table>'); 
document.write('<tr>'); 
document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" 
height="360"></a></td>'); 
rand(98); 
document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" 
height="360"></a></td>'); 
rand(98); 
document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" 
height="360"></a></td>'); 
rand(98); 
document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" 
height="360"></a></td>'); 
rand(98);
Comparative Genomics: The biological Rosetta 
• The keywords can be 
– genome structure 
– gene-organisation 
– known promoter regions 
– known critical amino acid residues. 
• Combination of functional 
modelorganism knowledge 
• Structure-function 
• Identify similar areas of biology 
• Identify orthologous pathways (might 
have different endpoints)
Bioinformatics t9-t10-biocheminformatics v2014
Example: Agro 
Sequence Genome 
Known “lethal” genes 
from worm, drosphila 
Filter for drugability”, 
tractibility & novelty
Example: Extremophiles 
Known lipases 
Filter for 
“workable”lipases 
at 90º C 
Look for species 
with interesting 
phenotypes 
Functional Foods 
Convert Highly Energetic Monosaccharides to Dextrane 
Washing Powder additives 
Sequence Genome 
Clone and produce in large quantities
Bioinformatics t9-t10-biocheminformatics v2014
Drug Discovery: Design new drugs by computer ? 
Problem: pipeline cost rise linear, NCE steady 
Money: bypassing difficult, work on attrition 
Every step requires specific computational tools
Drug Discovery: What is a drug ? 
• Drugs are generally defined as molecules which 
affect biological processes. 
• In order to be effective, the molecule must be 
present in the body at an adequate concentration 
for it to act at the specific site in the body where 
it can exert its effect. 
• Additionally, the molecule must be safe -- that 
is, metabolized and eliminated from the body 
without causing injury. 
• Assumption: next 50 years still a big market in 
small chemical entities which can be 
administered orally in form of a pill (in contrast 
to antibodies) or gene therapy …
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
• Taxol a drug which is an unmodified natural 
compound, is the exception 
• Most drugs require “work” -> need for target 
driven pipeline 
• Humane genome is available so all target are 
identified 
• How to validate (within a given disease area) ?
Drug Discovery: What is a target ? 
• target - a molecule (often a protein) that is instrumental 
to a disease process (though not necessarily directly 
involved), which may be targeted with a potential 
therapeutic. 
• target identification - identifying a molecule (often a 
protein) that is instrumental to a disease process (though 
not necessarily directly involved), with the intention of 
finding a way to regulate that molecule's activity for 
therapeutic purposes. 
• target validation - a crucial step in the drug 
development process. Following the identification of a 
potential disease target, target validation verifies that a 
drug that specifically acts on the target can have a 
significant therapeutic benefit in the treatment of a given 
disease.
Phenotypic Gap 
Total # genes 
# genes with 
known function 
Number of genes 
1980 1990 2000 2010 
Functional Genomics ? 
More than running chip experiments ! 
Proposal to prioritize 
hypothetical protein 
without annotation, nice 
for bioinformatics and 
biologist
Bioinformatics t9-t10-biocheminformatics v2014
Where is optimal drug target ? 
“Optimal” drug target 
Predict side effect 
How to correct disease state 
Side effects ?
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
Genome-wide RNAi 
RNAI vector 
bacteria producing ds RNA for 
each of the 20.000 genes 
proprietary nematode 
responding to RNAi 
20.000 responses 
20.000 genes insert 
library
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
Type-II Diabetes 
Normal insulin signaling 
fat storage LOW 
Reduced insulin signaling 
fat storage HIGH
Industrialized knock-downs 
20,000 bacteria 
each containing 
selected 
C. elegans gene 
proprietary C.elegans strains 
• sensitized to silencing 
• sensitized to relevant pathway 
select genes with desired phenotypes
Pharma is conservative
Bioinformatics t9-t10-biocheminformatics v2014
Structural Genomics 
Molecular functions of 26 383 human genes
Bioinformatics t9-t10-biocheminformatics v2014
Lipinsky for the target ? 
Database of all “drugable” human genes
Drug Discovery: Design new drugs by computer ?
Drug Discovery: Screening definitions 
screening - the automated examination and 
testing of libraries of synthetic and/or organic 
compounds and extracts to identify potential drug 
leads, based on the compound's binding affinity 
for a target molecule. 
screening library - a large collection of 
compounds with different chemical properties or 
shapes, generated either by combinatorial 
chemistry or some other process or by collecting 
samples with interesting biological properties. 
High Throughput Screening: Quick and Dirty… 
from 5000 compounds per day
Drug Discovery: Screening Throughput 
• At the beginning of the 1990s, when the 
term "high-throughput screening" was 
coined, a department of 20 would 
typically be able to screen around 1.5 
million samples in a year, each 
researcher handling around 75,000 
samples. Today, four researchers using 
fully automated robotic technology can 
screen 50,000 samples a day, or around 
2.5 million samples each year.
Robotic arm 
Drug Discovery: HTS – The Wet Lab 
Distribution 
96 / 384 wells 
Read-out 
Fluorescence / 
luminescence 
Optical Bank 
for stability
Drug Discovery: Chemistry Sources 
• Available molecules collections from pharma, 
chemical and agro industry, also from 
academics (Eastern Europe) 
• Natural products from fungi, algae, exotic 
plants, Chinese and ethnobotanic medicines 
• Combinatorial chemistry: it is the generation 
of large numbers of diverse chemical 
compounds (a library) for use in screening 
assays against disease target molecules. 
• Computer drug design (from model 
substrates or X-ray structure)
Drug Discovery 
HIT LEAD
• initial screen established 
• Compounds screened 
• IC50s established 
• Structures verified 
• Minimum of three independent 
chemical series to evaluate 
• Positive in silico PK data 
Drug Discovery: HIT
Drug Discovery: Hit/lead computational approaches 
• When the structure of the target is unknown, 
the activity data can be used to construct a 
pharmacophore model for the positioning of 
key features like hydrogen-bonding and 
hydrophobic groups. 
• Such a model can be used as a template to 
select the most promising candidates from the 
library.
• lead compound - a potential drug candidate emerging from a 
screening process of a large library of compounds. 
• It basically affects specifically a biological process. 
Mechanism of activity (reversible/ irreversible, kinetics) 
established 
• Its is effective at a low concentration: usually nanomolar 
activity 
• It is not toxic to live cells 
• It has been shown to have some in vivo activity 
• It is chemically feasible. Specificity of key compound(s) from 
each lead series against selected number of receptors/enzymes 
• Preliminary PK in vivo (rodent) to establish benchmark for in 
vitro SAR 
• In vitro PK data good predictor for in vivo activity 
• Its is of course New and Original. 
Drug Discovery: Lead ?
Lipinski: « rule of 5 » 
"In the USAN set we found that the sum of Ns and Os in the molecular formula was 
greater than 10 in 12% of the compounds. Eleven percent of compounds had 
a MWT of over 500. Ten percent of compounds had a CLogP larger than 5 (or 
an MLogP larger than 4.15) and in 8% of compounds the sum of OHs and NHs 
in the chemical structure was larger than 5. The "rule of 5" states that: poor 
absorption or permeation is more likely when: 
A. There are less than 5 H-bond donors (expressed as the sum of OHs and 
NHs); 
B. The MWT is less than 500; 
C. The LogP is less than 5 (or MLogP is < 4.15); 
D. There are less than 10 H-bond acceptors (expressed as the sum of Ns and 
Os). 
Compound classes that are substrates for biological transporters are exceptions to 
the rule." 
Christopher A. Lipinski, Franco Lombardo, Beryl W. Dominy, Paul J. Feeney 
"Experimental and computational approaches to estimate solubility and 
permeability in drug discovery and development settings":
• A quick sketch with ChemDraw, conversion to a 
3D structure with Chem3D, and processing by 
QuikProp, reveals that the problem appears to be 
poor cell permeability for this relatively polar 
molecule, with predicted PCaco and PMDCK 
values near 10 nm/s. 
• Free alternative (Chemsketch / PreADME)
(Celebrex) 
Methyl in this position makes it a weaker cox-2 inhibitor, 
but site of metabolic oxidation and ensures an acceptable clearance 
Drug-like-ness
To assist combinatorial chemistry, buy specific compunds
Bioinformatics t9-t10-biocheminformatics v2014
Structural Descriptors: (15 descriptors) 
Molecular Formula, Molecular Weight, Formal Charge, The Number of Rotatable Bonds, The Number of Rigid 
Bonds, The Number of Rings, The Number of Aromatic Rings, The Number of H Bond Acceptors, The 
Number of H Bond Donors, The Number of (+) Charged Groups, The Number of (-) Charged Groups, No. 
single, double, triple, aromatic bonds 
Topological Descriptors:(350 descriptors) 
• Topological descriptors on the adjustancy and distance matrix 
• Count descriptors 
• Kier & Hall molecular connectivity Indices 
• Kier Shape Indices 
• Galvez topological charge Indices 
• Narumi topological index 
• Autocorrelation descriptor of atomic masses, atomic polarizability, Pauling electronegativity and van der 
Waals radius 
• Information content descriptors 
• Electrotopological state index (E-state) 
• Atomic-Level-Based AI topological descriptors 
Physicochemical Descriptor:(10 descriptors) 
AlogP98 (calculated logP), SKlogP (calculated logP), SKlogS in pure water (calculated water solubility), SKlogS in 
buffer system (calculated water solubility),SK vap (calculated vapor pressure), SK bp (calculated boiling 
point), SK mp (calculated meling point), AMR (calculated molecular refractivity), APOL(calculated 
polarizability), Water Solvation Free Energy 
Geometrical Descriptor:(9 descriptors) 
Topological Polar Surface Area, 2D van der Waals Volume, 2D van der Waals Surface Area, 2D van der Waals 
Hydrophobic Surface Area, 2D van der Waals Polar Surface Area, 2D van der Waals H-bond Acceptor Surface 
Area, 2D van der Waals H-bond Donor Surface Area, 2D van der Waals (+) Charged Groups Surface Area, 2D 
van der Waals (-) Charged Groups Surface Area
Drug Discovery: Hit/lead computational approaches 
• What can you do with these descriptors ? 
• Cluster entire chemical library 
– Diversity set 
– Focused set
Drug Discovery: Docking 
• Structure is known, virtual screening -> docking 
• Many different approaches 
– DOCK 
– FlexX 
– Glide 
– GOLD 
• Including conformational sampling of the ligand 
• Problem: 
– host flexibility 
– solvatation 
• Example: Bissantz et al. 
– Hit rate of 10% for single scoring function 
– Up to 70% with triple scoring (bagging)
Drug Discovery: De novo design / rational drug design 
• Given the target site: 
• Docking + structure generator 
• Specialized approach: growing 
substituent on a core 
– LUDI 
– SPROUT 
– BOMB (biochemical and organic model 
builder) 
– SYNOPSIS 
• Problem is the scoring function 
which is different for every protein 
class
Drug Discovery: Novel strategies using bio/cheminformatics 
- HTS ? Chemical space is big (1041) 
- Biased sets/focussed libraries -> bioinformatics !!! 
- How ? Use phylogenetics and known structures to define 
accesible (conserved) functional implicated residues to 
define small molecule pharmacophores (minimal 
requirements) 
- Desciptor search (cheminformatics) to construct/select 
biased compound set 
- ensure serendipity by iterative screening of these 
predesigned sets
Drug Discovery 
Toxigenomics 
Metabogenomics
Bioinformatics t9-t10-biocheminformatics v2014
Drug Discovery: Clinical studies 
• Preclinical - An early phase of development 
including initial safety assessment 
Phase I - Evaluation of clinical pharmacology, 
usually conducted in volunteers 
Phase II - Determination of dose and initial 
evaluation of efficacy, conducted in a small 
number of patients 
Phase III - Large comparative study 
(compound versus placebo and/or established 
treatment) in patients to establish clinical 
benefit and safety 
Phase IV - Post marketing study
Bioinformatics t9-t10-biocheminformatics v2014
Bioinformatics t9-t10-biocheminformatics v2014
Drug Discovery & Development: IND filing
Hapmap
Pharmacogenomics 
Predictive/preventive – systems biology
Sneak preview 
Bioinformatics (re)loaded
Sneak preview 
Bioinformatics (re)loaded 
• Relational datamodels 
– BioSQL (MySQL) 
• Data Visualisation 
– Interface 
• Apache 
• PHP 
• Large Scale Statistics 
– Using R

More Related Content

PDF
Thesis bio bix_2014
PPT
2015 04 22_time_labs_shared
PPTX
2015 bioinformatics personal_genomics_wim_vancriekinge
PPT
2015 03 13_puurs_v_public
PPTX
2015 bioinformatics wim_vancriekinge
PPTX
2015 bioinformatics bio_cheminformatics_wim_vancriekinge
PDF
Using methylation patterns to determine origin of biological material and age
PPTX
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge
Thesis bio bix_2014
2015 04 22_time_labs_shared
2015 bioinformatics personal_genomics_wim_vancriekinge
2015 03 13_puurs_v_public
2015 bioinformatics wim_vancriekinge
2015 bioinformatics bio_cheminformatics_wim_vancriekinge
Using methylation patterns to determine origin of biological material and age
2016 bioinformatics i_bio_cheminformatics_wimvancriekinge

What's hot (20)

PPTX
2016 bioinformatics i_wim_vancriekinge_vupload
PPT
Genome annotation 2013
PPTX
Personalized Medicine and the Omics Revolution by Professor Mike Snyder
PPT
Genomics & Epigenomics
PDF
The Microbiome of Research Animals : Implications for Reproducibility, Transl...
PPTX
Plant genome sequencing and crop improvement
PPTX
Python meetup 2014
PDF
Functional annotation
PDF
2 md2016 annotation
PPTX
Embed Repro Test
PDF
Aug2015 deanna church analytical validation
PPTX
20140613 Analysis of High Throughput DNA Methylation Profiling
PDF
BIOL335: Functional genomics
PPTX
Clinical molecular diagnostics for drug guidance
PPTX
Whole genome sequencing of bacteria & analysis
PDF
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
PPTX
Introduction to Proteogenomics
PPTX
SAGE- Serial Analysis of Gene Expression
PDF
Next Generation Sequencing Informatics - Challenges and Opportunities
PDF
SPIN Workshop Microbial Genomics @NIST
2016 bioinformatics i_wim_vancriekinge_vupload
Genome annotation 2013
Personalized Medicine and the Omics Revolution by Professor Mike Snyder
Genomics & Epigenomics
The Microbiome of Research Animals : Implications for Reproducibility, Transl...
Plant genome sequencing and crop improvement
Python meetup 2014
Functional annotation
2 md2016 annotation
Embed Repro Test
Aug2015 deanna church analytical validation
20140613 Analysis of High Throughput DNA Methylation Profiling
BIOL335: Functional genomics
Clinical molecular diagnostics for drug guidance
Whole genome sequencing of bacteria & analysis
Ernesto Picardi – Bioinformatica e genomica comparata: nuove strategie sperim...
Introduction to Proteogenomics
SAGE- Serial Analysis of Gene Expression
Next Generation Sequencing Informatics - Challenges and Opportunities
SPIN Workshop Microbial Genomics @NIST
Ad

Viewers also liked (20)

PPTX
Bioinformatica p1-perl-introduction
PPTX
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
PPTX
2015 bioinformatics score_matrices_wim_vancriekinge
PPTX
2015 bioinformatics alignments_wim_vancriekinge
PPTX
2015 09 imec_wim_vancriekinge_v42_to_present
PPTX
2015 bioinformatics bio_python
PPT
2014 05 21_personal_genomics_v_n2n_vfinal
PPTX
Bioinformatics life sciences_2012
PPTX
Bioinformatics t7-proteinstructure v2014
PPTX
Bioinformatics t8-go-hmm v2014
PPTX
2011 10 26_quantitative_cell_biology_molecular_profiling_v_twitter
PPTX
2015 bioinformatics bio_python_part4
PPTX
2015 bioinformatics python_io_wim_vancriekinge
PPT
Bioinformatica 10-11-2011-t5-database searching
PPTX
2012 12 12_adam_v_final
PPTX
Jose María Ordovás-El impacto de las ciencias ómicas en la nutrición, la medi...
PPTX
Bioinformatica t3-scoring matrices
PPT
2013 03 12_epigenetic_profiling
PPTX
Bioinformatics t5-databasesearching v2014
PDF
NXTGNT kick off
Bioinformatica p1-perl-introduction
2015 bioinformatics python_introduction_wim_vancriekinge_vfinal
2015 bioinformatics score_matrices_wim_vancriekinge
2015 bioinformatics alignments_wim_vancriekinge
2015 09 imec_wim_vancriekinge_v42_to_present
2015 bioinformatics bio_python
2014 05 21_personal_genomics_v_n2n_vfinal
Bioinformatics life sciences_2012
Bioinformatics t7-proteinstructure v2014
Bioinformatics t8-go-hmm v2014
2011 10 26_quantitative_cell_biology_molecular_profiling_v_twitter
2015 bioinformatics bio_python_part4
2015 bioinformatics python_io_wim_vancriekinge
Bioinformatica 10-11-2011-t5-database searching
2012 12 12_adam_v_final
Jose María Ordovás-El impacto de las ciencias ómicas en la nutrición, la medi...
Bioinformatica t3-scoring matrices
2013 03 12_epigenetic_profiling
Bioinformatics t5-databasesearching v2014
NXTGNT kick off
Ad

Similar to Bioinformatics t9-t10-biocheminformatics v2014 (20)

PPTX
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
PPT
Bioinformatica 15-12-2011-t9-t10-bio cheminformatics
PPT
Session 1 part 2
PDF
A_Pope_RQRM_LeadDisc_June_2016
PPTX
Multiplexing analysis of 1000 approved drugs in PubChem
PDF
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
PPTX
computer aided drug design chapter 1 unit 1
PPT
Sparsh bioinfo.ppt
PPTX
Mashing Up Drug Discovery
PPTX
Basic of the Drug Discovery. and Study pptx
PPTX
Drug discovery clinical evaluation of new drugs
PPTX
Drug discovery clinical evaluation of new drugs
PDF
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
PPTX
Drug discovery anthony crasto
PPTX
Next-Gen Drug Discovery: An Integrated Micro-Droplet Based Platform
PPT
Drug discovery approaches lecture.ppt pharma
PPTX
Natural products in drug discovery
PPTX
Back Rapid lead compounds discovery through high-throughput screening
PPTX
naturalproductsindrugdiscovery-180307072123.pptx
PPTX
MRCT's Centre for Therapeutics Discovery
Bioinformatics t9-t10-bio cheminformatics-wimvancriekinge_v2013
Bioinformatica 15-12-2011-t9-t10-bio cheminformatics
Session 1 part 2
A_Pope_RQRM_LeadDisc_June_2016
Multiplexing analysis of 1000 approved drugs in PubChem
Computational Drug Discovery: Machine Learning for Making Sense of Big Data i...
computer aided drug design chapter 1 unit 1
Sparsh bioinfo.ppt
Mashing Up Drug Discovery
Basic of the Drug Discovery. and Study pptx
Drug discovery clinical evaluation of new drugs
Drug discovery clinical evaluation of new drugs
EDSP Prioritization: Collaborative Estrogen Receptor Activity Prediction Proj...
Drug discovery anthony crasto
Next-Gen Drug Discovery: An Integrated Micro-Droplet Based Platform
Drug discovery approaches lecture.ppt pharma
Natural products in drug discovery
Back Rapid lead compounds discovery through high-throughput screening
naturalproductsindrugdiscovery-180307072123.pptx
MRCT's Centre for Therapeutics Discovery

More from Prof. Wim Van Criekinge (20)

PPTX
2020 02 11_biological_databases_part1
PPTX
2019 03 05_biological_databases_part5_v_upload
PPTX
2019 03 05_biological_databases_part4_v_upload
PPTX
2019 03 05_biological_databases_part3_v_upload
PPTX
2019 02 21_biological_databases_part2_v_upload
PPTX
2019 02 12_biological_databases_part1_v_upload
PPTX
P7 2018 biopython3
PPTX
P6 2018 biopython2b
PPTX
P4 2018 io_functions
PPTX
P3 2018 python_regexes
PPTX
T1 2018 bioinformatics
PPTX
P1 2018 python
PDF
Bio ontologies and semantic technologies[2]
PPTX
2018 05 08_biological_databases_no_sql
PPTX
2018 03 27_biological_databases_part4_v_upload
PPTX
2018 03 20_biological_databases_part3
PPTX
2018 02 20_biological_databases_part2_v_upload
PPTX
2018 02 20_biological_databases_part1_v_upload
PPTX
P7 2017 biopython3
PPTX
P6 2017 biopython2
2020 02 11_biological_databases_part1
2019 03 05_biological_databases_part5_v_upload
2019 03 05_biological_databases_part4_v_upload
2019 03 05_biological_databases_part3_v_upload
2019 02 21_biological_databases_part2_v_upload
2019 02 12_biological_databases_part1_v_upload
P7 2018 biopython3
P6 2018 biopython2b
P4 2018 io_functions
P3 2018 python_regexes
T1 2018 bioinformatics
P1 2018 python
Bio ontologies and semantic technologies[2]
2018 05 08_biological_databases_no_sql
2018 03 27_biological_databases_part4_v_upload
2018 03 20_biological_databases_part3
2018 02 20_biological_databases_part2_v_upload
2018 02 20_biological_databases_part1_v_upload
P7 2017 biopython3
P6 2017 biopython2

Recently uploaded (20)

PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PDF
Weekly quiz Compilation Jan -July 25.pdf
PDF
Classroom Observation Tools for Teachers
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PPTX
Lesson notes of climatology university.
PDF
Yogi Goddess Pres Conference Studio Updates
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
LDMMIA Reiki Yoga Finals Review Spring Summer
PDF
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
Trump Administration's workforce development strategy
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Weekly quiz Compilation Jan -July 25.pdf
Classroom Observation Tools for Teachers
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Lesson notes of climatology university.
Yogi Goddess Pres Conference Studio Updates
Microbial diseases, their pathogenesis and prophylaxis
Microbial disease of the cardiovascular and lymphatic systems
LDMMIA Reiki Yoga Finals Review Spring Summer
LNK 2025 (2).pdf MWEHEHEHEHEHEHEHEHEHEHE
Paper A Mock Exam 9_ Attempt review.pdf.
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Trump Administration's workforce development strategy
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Anesthesia in Laparoscopic Surgery in India
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3

Bioinformatics t9-t10-biocheminformatics v2014

  • 2. FBW 9-12-2014 Wim Van Criekinge
  • 4. Examen <html> <title>Examen Bioinformatica</title> <center> <head> <script> rnd.today=new Date(); rnd.seed=rnd.today.getTime(); function rnd() { rnd.seed = (rnd.seed*9301+49297) % 233280; return rnd.seed/(233280.0); }; function rand(number) { return Math.ceil(rnd()*number); }; </SCRIPT> </head> <body bgcolor="#FFFFFF" text="#00FF00" link="#00FF00"> <script language="JavaScript"> document.write('<table>'); document.write('<tr>'); document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" height="360"></a></td>'); rand(98); document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" height="360"></a></td>'); rand(98); document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" height="360"></a></td>'); rand(98); document.write('<td><a href="index.html" ><img border=0 src="' + rand(713) + '.jpg" width="520" height="360"></a></td>'); rand(98);
  • 5. Comparative Genomics: The biological Rosetta • The keywords can be – genome structure – gene-organisation – known promoter regions – known critical amino acid residues. • Combination of functional modelorganism knowledge • Structure-function • Identify similar areas of biology • Identify orthologous pathways (might have different endpoints)
  • 7. Example: Agro Sequence Genome Known “lethal” genes from worm, drosphila Filter for drugability”, tractibility & novelty
  • 8. Example: Extremophiles Known lipases Filter for “workable”lipases at 90º C Look for species with interesting phenotypes Functional Foods Convert Highly Energetic Monosaccharides to Dextrane Washing Powder additives Sequence Genome Clone and produce in large quantities
  • 10. Drug Discovery: Design new drugs by computer ? Problem: pipeline cost rise linear, NCE steady Money: bypassing difficult, work on attrition Every step requires specific computational tools
  • 11. Drug Discovery: What is a drug ? • Drugs are generally defined as molecules which affect biological processes. • In order to be effective, the molecule must be present in the body at an adequate concentration for it to act at the specific site in the body where it can exert its effect. • Additionally, the molecule must be safe -- that is, metabolized and eliminated from the body without causing injury. • Assumption: next 50 years still a big market in small chemical entities which can be administered orally in form of a pill (in contrast to antibodies) or gene therapy …
  • 16. • Taxol a drug which is an unmodified natural compound, is the exception • Most drugs require “work” -> need for target driven pipeline • Humane genome is available so all target are identified • How to validate (within a given disease area) ?
  • 17. Drug Discovery: What is a target ? • target - a molecule (often a protein) that is instrumental to a disease process (though not necessarily directly involved), which may be targeted with a potential therapeutic. • target identification - identifying a molecule (often a protein) that is instrumental to a disease process (though not necessarily directly involved), with the intention of finding a way to regulate that molecule's activity for therapeutic purposes. • target validation - a crucial step in the drug development process. Following the identification of a potential disease target, target validation verifies that a drug that specifically acts on the target can have a significant therapeutic benefit in the treatment of a given disease.
  • 18. Phenotypic Gap Total # genes # genes with known function Number of genes 1980 1990 2000 2010 Functional Genomics ? More than running chip experiments ! Proposal to prioritize hypothetical protein without annotation, nice for bioinformatics and biologist
  • 20. Where is optimal drug target ? “Optimal” drug target Predict side effect How to correct disease state Side effects ?
  • 27. Genome-wide RNAi RNAI vector bacteria producing ds RNA for each of the 20.000 genes proprietary nematode responding to RNAi 20.000 responses 20.000 genes insert library
  • 30. Type-II Diabetes Normal insulin signaling fat storage LOW Reduced insulin signaling fat storage HIGH
  • 31. Industrialized knock-downs 20,000 bacteria each containing selected C. elegans gene proprietary C.elegans strains • sensitized to silencing • sensitized to relevant pathway select genes with desired phenotypes
  • 34. Structural Genomics Molecular functions of 26 383 human genes
  • 36. Lipinsky for the target ? Database of all “drugable” human genes
  • 37. Drug Discovery: Design new drugs by computer ?
  • 38. Drug Discovery: Screening definitions screening - the automated examination and testing of libraries of synthetic and/or organic compounds and extracts to identify potential drug leads, based on the compound's binding affinity for a target molecule. screening library - a large collection of compounds with different chemical properties or shapes, generated either by combinatorial chemistry or some other process or by collecting samples with interesting biological properties. High Throughput Screening: Quick and Dirty… from 5000 compounds per day
  • 39. Drug Discovery: Screening Throughput • At the beginning of the 1990s, when the term "high-throughput screening" was coined, a department of 20 would typically be able to screen around 1.5 million samples in a year, each researcher handling around 75,000 samples. Today, four researchers using fully automated robotic technology can screen 50,000 samples a day, or around 2.5 million samples each year.
  • 40. Robotic arm Drug Discovery: HTS – The Wet Lab Distribution 96 / 384 wells Read-out Fluorescence / luminescence Optical Bank for stability
  • 41. Drug Discovery: Chemistry Sources • Available molecules collections from pharma, chemical and agro industry, also from academics (Eastern Europe) • Natural products from fungi, algae, exotic plants, Chinese and ethnobotanic medicines • Combinatorial chemistry: it is the generation of large numbers of diverse chemical compounds (a library) for use in screening assays against disease target molecules. • Computer drug design (from model substrates or X-ray structure)
  • 43. • initial screen established • Compounds screened • IC50s established • Structures verified • Minimum of three independent chemical series to evaluate • Positive in silico PK data Drug Discovery: HIT
  • 44. Drug Discovery: Hit/lead computational approaches • When the structure of the target is unknown, the activity data can be used to construct a pharmacophore model for the positioning of key features like hydrogen-bonding and hydrophobic groups. • Such a model can be used as a template to select the most promising candidates from the library.
  • 45. • lead compound - a potential drug candidate emerging from a screening process of a large library of compounds. • It basically affects specifically a biological process. Mechanism of activity (reversible/ irreversible, kinetics) established • Its is effective at a low concentration: usually nanomolar activity • It is not toxic to live cells • It has been shown to have some in vivo activity • It is chemically feasible. Specificity of key compound(s) from each lead series against selected number of receptors/enzymes • Preliminary PK in vivo (rodent) to establish benchmark for in vitro SAR • In vitro PK data good predictor for in vivo activity • Its is of course New and Original. Drug Discovery: Lead ?
  • 46. Lipinski: « rule of 5 » "In the USAN set we found that the sum of Ns and Os in the molecular formula was greater than 10 in 12% of the compounds. Eleven percent of compounds had a MWT of over 500. Ten percent of compounds had a CLogP larger than 5 (or an MLogP larger than 4.15) and in 8% of compounds the sum of OHs and NHs in the chemical structure was larger than 5. The "rule of 5" states that: poor absorption or permeation is more likely when: A. There are less than 5 H-bond donors (expressed as the sum of OHs and NHs); B. The MWT is less than 500; C. The LogP is less than 5 (or MLogP is < 4.15); D. There are less than 10 H-bond acceptors (expressed as the sum of Ns and Os). Compound classes that are substrates for biological transporters are exceptions to the rule." Christopher A. Lipinski, Franco Lombardo, Beryl W. Dominy, Paul J. Feeney "Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings":
  • 47. • A quick sketch with ChemDraw, conversion to a 3D structure with Chem3D, and processing by QuikProp, reveals that the problem appears to be poor cell permeability for this relatively polar molecule, with predicted PCaco and PMDCK values near 10 nm/s. • Free alternative (Chemsketch / PreADME)
  • 48. (Celebrex) Methyl in this position makes it a weaker cox-2 inhibitor, but site of metabolic oxidation and ensures an acceptable clearance Drug-like-ness
  • 49. To assist combinatorial chemistry, buy specific compunds
  • 51. Structural Descriptors: (15 descriptors) Molecular Formula, Molecular Weight, Formal Charge, The Number of Rotatable Bonds, The Number of Rigid Bonds, The Number of Rings, The Number of Aromatic Rings, The Number of H Bond Acceptors, The Number of H Bond Donors, The Number of (+) Charged Groups, The Number of (-) Charged Groups, No. single, double, triple, aromatic bonds Topological Descriptors:(350 descriptors) • Topological descriptors on the adjustancy and distance matrix • Count descriptors • Kier & Hall molecular connectivity Indices • Kier Shape Indices • Galvez topological charge Indices • Narumi topological index • Autocorrelation descriptor of atomic masses, atomic polarizability, Pauling electronegativity and van der Waals radius • Information content descriptors • Electrotopological state index (E-state) • Atomic-Level-Based AI topological descriptors Physicochemical Descriptor:(10 descriptors) AlogP98 (calculated logP), SKlogP (calculated logP), SKlogS in pure water (calculated water solubility), SKlogS in buffer system (calculated water solubility),SK vap (calculated vapor pressure), SK bp (calculated boiling point), SK mp (calculated meling point), AMR (calculated molecular refractivity), APOL(calculated polarizability), Water Solvation Free Energy Geometrical Descriptor:(9 descriptors) Topological Polar Surface Area, 2D van der Waals Volume, 2D van der Waals Surface Area, 2D van der Waals Hydrophobic Surface Area, 2D van der Waals Polar Surface Area, 2D van der Waals H-bond Acceptor Surface Area, 2D van der Waals H-bond Donor Surface Area, 2D van der Waals (+) Charged Groups Surface Area, 2D van der Waals (-) Charged Groups Surface Area
  • 52. Drug Discovery: Hit/lead computational approaches • What can you do with these descriptors ? • Cluster entire chemical library – Diversity set – Focused set
  • 53. Drug Discovery: Docking • Structure is known, virtual screening -> docking • Many different approaches – DOCK – FlexX – Glide – GOLD • Including conformational sampling of the ligand • Problem: – host flexibility – solvatation • Example: Bissantz et al. – Hit rate of 10% for single scoring function – Up to 70% with triple scoring (bagging)
  • 54. Drug Discovery: De novo design / rational drug design • Given the target site: • Docking + structure generator • Specialized approach: growing substituent on a core – LUDI – SPROUT – BOMB (biochemical and organic model builder) – SYNOPSIS • Problem is the scoring function which is different for every protein class
  • 55. Drug Discovery: Novel strategies using bio/cheminformatics - HTS ? Chemical space is big (1041) - Biased sets/focussed libraries -> bioinformatics !!! - How ? Use phylogenetics and known structures to define accesible (conserved) functional implicated residues to define small molecule pharmacophores (minimal requirements) - Desciptor search (cheminformatics) to construct/select biased compound set - ensure serendipity by iterative screening of these predesigned sets
  • 56. Drug Discovery Toxigenomics Metabogenomics
  • 58. Drug Discovery: Clinical studies • Preclinical - An early phase of development including initial safety assessment Phase I - Evaluation of clinical pharmacology, usually conducted in volunteers Phase II - Determination of dose and initial evaluation of efficacy, conducted in a small number of patients Phase III - Large comparative study (compound versus placebo and/or established treatment) in patients to establish clinical benefit and safety Phase IV - Post marketing study
  • 61. Drug Discovery & Development: IND filing
  • 65. Sneak preview Bioinformatics (re)loaded • Relational datamodels – BioSQL (MySQL) • Data Visualisation – Interface • Apache • PHP • Large Scale Statistics – Using R