SlideShare a Scribd company logo
Autonomous model building with a preponderance
of well annotated assay protocols
Alex M. Clark
http://www.bioassayexpress.com
⚡
COLLABORATIVE DRUG DISCOVERY
Background
◈ Assay protocols are text
◈ Institutions suffering from

data overload:
▷ much interest in using 

consistent markup
▷ semantic web ontologies: 

BAO, DTO, CLO, GO...
◈ Since last talk: many refinements, some new
structure-activity analysis features
2
Cell-Free Homogeneous Primary HTS to Identify
Inhibitors of GSK3beta Activity
(1) Dispense 1 uL/well of CABPE, 0.5 uL of ATP, and
1 uL of positive control GW8510 or AB in
respective wells according to plate design to
1536-well assay ready plates (Aurora 29847) that
contain 2.5 nL/well of 10 mM compound using
BioRAPTR (Beckman) to start the reaction.
Incubate at room temperature for 60 minutes.
(2) Add 2.5 uL/well of ADP-glo (Promega, V9103)
with BioRAPTR, incubate at room temperature for
40 minutes
(3) Add 5 uL/well of ADP-glo (Promega, V9103) with
Combi nL (Thermo), incubate at room
temperature for 30 minutes
Curation
◈ Web UI
◈ Machine
learning
◈ Common
Assay
Template
◈ PubChem
data
3
Public Data
◈ Designed Common Assay
Template for summary
◈ Data from PubChem:
▷ 3500 assays selected
▷ mostly MLPCN
◈ Curated as a validation set
◈ PubChem also has
molecules...
4
Analysis
◈ Selected:
▷ Secondary assays
▷ Infectious diseases
▷ IC50 results
5
Analysis
◈ Selected:
▷ Secondary assays
▷ Infectious diseases
▷ IC50 results
5
Molecules
◈ Per assay: convenient way to view & download...
6
Groups of Assays
7
Bayesian Models
8
Bayesian Models
8
Bayesian Models
8
Bayesian Models
8
Proposed Molecules
9
Proposed Molecules
9
Proposed Molecules
9
Proposed Molecules
9
Iterative Structures
10
Iterative Structures
10
Iterative Structures
10
Grids
◈ View known & calculated measurements, plotting
assays vs. compounds...
11
Grids
◈ View known & calculated measurements, plotting
assays vs. compounds...
11
Real Data Only
12
◈ Assays are singletons by default, can be grouped...
Real Data Only
12
Filling in Blanks
13
Filling in Blanks
13
◈ Bayesian model created
for each column
◈ SAR for all assays used
◈ Missing cells replaced
with predictions
◈ Calculations realtime:
▷ model on server
▷ predictions on client
Trend Detection
14
◈ Model compound found using the
Selectivity option:
Trend Detection
14
◈ Model compound found using the
Selectivity option: multidrug resistant
tuberculosis
Trend Detection
14
◈ Model compound found using the
Selectivity option: multidrug resistant
tuberculosis trypanosomiasis
(ROC 0.56)
Possible Interest
15
Possible Interest
15
Future Work
◈ Current focus is UI/UX and proof-of-concept...
▷ intensive data mining is next: annotations + SAR
▷ data is ready to reveal some amazing discoveries
◈ Crowd curation may be necessary, and more detailed
templates
◈ More modelling techniques, far beyond Bayesians:
parallel project is underway...
16
Deployment
◈ Public
▷ http://www.bioassayexpress.com
▷ http://beta.bioassayexpress.com (curation)
◈ Vault
▷ sync with CDD Vault through API
◈ Private
▷ custom integration, installed behind firewall
17
Acknowledgments
◈ Collaborative Drug Discovery
⇨ Barry Bunin
⇨ Janice Kranz
⇨ Hande Küçük
⇨ Peter Gedeck
18
◈ And we're hiring
⇨ resumes to:
◈ More information
http://github.com/cdd/bioassay-template
http://www.bioassayexpress.com
http://collaborativedrug.com
alex@collaborativedrug.com

More Related Content

PDF
Bringing bioassay protocols to the world of informatics, using semantic annot...
PDF
ACS Denver 2024: Assay annotation with ontologies
PDF
CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...
PDF
2015.04.08-Next-generation-sequencing-issues
PPTX
Next-Gen Drug Discovery: An Integrated Micro-Droplet Based Platform
PDF
BioAssay Express
PDF
What can your library do for you?
PPT
Molecular modelling for in silico drug discovery
Bringing bioassay protocols to the world of informatics, using semantic annot...
ACS Denver 2024: Assay annotation with ontologies
CDD BioAssay Express: Expanding the target dimension: How to visualize a lot ...
2015.04.08-Next-generation-sequencing-issues
Next-Gen Drug Discovery: An Integrated Micro-Droplet Based Platform
BioAssay Express
What can your library do for you?
Molecular modelling for in silico drug discovery

Similar to Autonomous model building with a preponderance of well annotated assay protocols (20)

PDF
Identifying pattern in reaction networks of computational models
PPTX
Discovery PBPK: Efficiently using machine learning & PBPK modeling to drive l...
PPT
Luscher Lab Meeting
PDF
Large scale classification of chemical reactions from patent data
PDF
Accelerate Delivery of High Producing Cell Lines
PDF
Accelerate Delivery of High Producing Cell Lines
PPTX
CRISPR bacterial transformation mixes
PPTX
Using open bioactivity data for developing machine-learning prediction models...
PDF
GPU-accelerated Virtual Screening
PDF
Structural databases
PPTX
XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...
PPT
Goslar2010 poster
PPT
Cadd and molecular modeling for M.Pharm
PDF
iMate Protocol Guide version 3.0
PPTX
Poster_AR_V5
PPT
Informatics In The Manchester Centre For Integrative Systems Biology
PDF
ChemDiv CNS BBB Library
PDF
BCSRCv1.3
PDF
Mining Big datasets to create and validate machine learning models
DOCX
1PhylogeneticAnalysisHomeworkassignmentThisa.docx
Identifying pattern in reaction networks of computational models
Discovery PBPK: Efficiently using machine learning & PBPK modeling to drive l...
Luscher Lab Meeting
Large scale classification of chemical reactions from patent data
Accelerate Delivery of High Producing Cell Lines
Accelerate Delivery of High Producing Cell Lines
CRISPR bacterial transformation mixes
Using open bioactivity data for developing machine-learning prediction models...
GPU-accelerated Virtual Screening
Structural databases
XabTracker & SeqAgent: Integrated LIMS & Sequence Analysis Tools for Antibody...
Goslar2010 poster
Cadd and molecular modeling for M.Pharm
iMate Protocol Guide version 3.0
Poster_AR_V5
Informatics In The Manchester Centre For Integrative Systems Biology
ChemDiv CNS BBB Library
BCSRCv1.3
Mining Big datasets to create and validate machine learning models
1PhylogeneticAnalysisHomeworkassignmentThisa.docx
Ad

More from Alex Clark (20)

PDF
Mixing small molecules and macromolecules in the world of informatics
PDF
ACS Denver 2024: Generative chemistry with deep learning models
PDF
Mixtures QSAR: modelling collections of chemicals
PDF
Mixtures InChI: a story of how standards drive upstream products
PDF
Mixtures as first class citizens in the realm of informatics
PDF
Mixtures: informatics for formulations and consumer products
PDF
Coordination InChI (2019)
PDF
Chemical mixtures: File format, open source tools, example data, and mixtures...
PDF
ACS CINF Luncheon talk (Boston 2018)
PDF
Representing molecules with minimalism: A solution to the entropy of informatics
PDF
SLAS2016: Why have one model when you could have thousands?
PDF
The anatomy of a chemical reaction: Dissection by machine learning algorithms
PDF
Compact models for compact devices: Visualisation of SAR using mobile apps
PDF
Green chemistry in chemical reactions: informatics by design
PDF
ICCE 2014: The Green Lab Notebook
PDF
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)
PDF
Building a mobile reaction lab notebook (ACS Dallas 2014)
PDF
Reaction Lab Notebooks for Mobile Devices - Alex M. Clark - GDCh 2013
PDF
Alex Clark : NETTAB 2013
PDF
Open Drug Discovery Teams @ Hacking Health Montreal
Mixing small molecules and macromolecules in the world of informatics
ACS Denver 2024: Generative chemistry with deep learning models
Mixtures QSAR: modelling collections of chemicals
Mixtures InChI: a story of how standards drive upstream products
Mixtures as first class citizens in the realm of informatics
Mixtures: informatics for formulations and consumer products
Coordination InChI (2019)
Chemical mixtures: File format, open source tools, example data, and mixtures...
ACS CINF Luncheon talk (Boston 2018)
Representing molecules with minimalism: A solution to the entropy of informatics
SLAS2016: Why have one model when you could have thousands?
The anatomy of a chemical reaction: Dissection by machine learning algorithms
Compact models for compact devices: Visualisation of SAR using mobile apps
Green chemistry in chemical reactions: informatics by design
ICCE 2014: The Green Lab Notebook
Cloud hosted APIs for cheminformatics on mobile devices (ACS Dallas 2014)
Building a mobile reaction lab notebook (ACS Dallas 2014)
Reaction Lab Notebooks for Mobile Devices - Alex M. Clark - GDCh 2013
Alex Clark : NETTAB 2013
Open Drug Discovery Teams @ Hacking Health Montreal
Ad

Recently uploaded (20)

PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PDF
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
2. Earth - The Living Planet Module 2ELS
PDF
An interstellar mission to test astrophysical black holes
PPTX
famous lake in india and its disturibution and importance
PDF
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
PPT
POSITIONING IN OPERATION THEATRE ROOM.ppt
PPT
protein biochemistry.ppt for university classes
PDF
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
Microbiology with diagram medical studies .pptx
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PPTX
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
PDF
HPLC-PPT.docx high performance liquid chromatography
PDF
Sciences of Europe No 170 (2025)
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Looking into the jet cone of the neutrino-associated very high-energy blazar ...
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
TOTAL hIP ARTHROPLASTY Presentation.pptx
2. Earth - The Living Planet Module 2ELS
An interstellar mission to test astrophysical black holes
famous lake in india and its disturibution and importance
Cosmic Outliers: Low-spin Halos Explain the Abundance, Compactness, and Redsh...
POSITIONING IN OPERATION THEATRE ROOM.ppt
protein biochemistry.ppt for university classes
Assessment of environmental effects of quarrying in Kitengela subcountyof Kaj...
ECG_Course_Presentation د.محمد صقران ppt
Classification Systems_TAXONOMY_SCIENCE8.pptx
Microbiology with diagram medical studies .pptx
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
Protein & Amino Acid Structures Levels of protein structure (primary, seconda...
HPLC-PPT.docx high performance liquid chromatography
Sciences of Europe No 170 (2025)
Introduction to Fisheries Biotechnology_Lesson 1.pptx

Autonomous model building with a preponderance of well annotated assay protocols