SlideShare a Scribd company logo
Palmira, October 2018
Mueller lab
demoand
Guillaume Bauchet, Bryan Ellerbrock, David Lyon, Naama
Menda, Nicolas Morales, Alex C. Ogbonna, Adrian Powell, Titima
Tantikanjana and Isaak Y Tecle
Mueller lab
- Bioinformatics
- Genomics
- Databases
https://btiscience.org/lukas-mueller/#lab-members
https://www.facebook.com/solgenomics/
NEXTGEN CASSAVA
Ensuring Data Quality
• Integrated electronic data capture using the
Android Fieldbook and other tablet-based
solutions
• Digital data never “leaks” into “analog”
domain
• Widely used barcoding ensures data
collection quality
• Quality filtering upon upload
Cassavabase Status
• Has collected breeding data from all NextGen programs
• 9.7 million phenotypic observations
• 2488 trials
• 34,000 genotypes
• From phase I (2012-2017) to phase II (2018-2022):
• Increase data collection and ensure quality
• Increase database interoperability and expand the “digital ecosystem” to
farmers
Project Partners
https://github.com/solgenomic
s
https://yambase.org
https://sweetpotatobase.org
https://musabase.org
https://cassavabase.org
Expanding resources: BrAPI
• Breeding Application Programming Interface
(API)
• Language support: Brapi.R interface and Brapi.JS
• Data exchange
• New way for coding breeding applications
(BrAPPs)
• BrAPPs run on any data backend that supports
BrAPI
cassavabase
BMS
GOBii
Flapjack
Germinate
B4R
= Empower breeder’s toolbox to increase genetic gain
https://github.com/CIP-RIU/brapi
Tool Example:
Genotype Visualization
Today’s Demo Content
3.00-4.00: Breeding Data Management, Sample
Tracking and FieldApp (Guillaume)
4.00-5.00: Data analysis (Isaak)
User: sgn
Password: eggplant
1: Generic password:
Database training websites
https://cassava-test.sgn.cornell.edu/
https://cassava-test.sgn.cornell.edu/
2: Personal password:
User: breeder1, breeder2,…,breeder10
Password: ISTRC18
Account Privileges
Account Type Privileges
none Browse, use tools
“user" User database, forum
“submitter"
create trials, add phenotype
information etc.
“Curator” All previous + data deletions
Create New
Trial
Fieldbook
files
Creation
Collect Data
& samples
Import and
Setup Trial in
PhenoApps
Upload
Phenotypes
Historical data
Uploads
Phenotype/
Genotype
Analysis
Crossings
& Nursery
Search/Download
accessions-seeds
Manage
-> List
-> Dataset Manage data
collection
-> barcode tools
-> label design
-> phenoApps
Pedigree & Crossing
-> cross upload
-> seedlots
-> phenoApps
-> Selection Index
-> Summary statistics
-> Graphical filtering
Search tools
-> Single criteria
-> Wizard search
Select / add accessions
Database pipeline and tools: the Big picture
+ Trait ontologies
+ SNP marker data
-> ANOVA, HIDAP
-> Trial comparison
-> Genomic selection
Analysis
Workflows
data collection workflow
Phenotype data collection
Workflows
Tissue sampling
data collection workflow
Tutorial:
https://www.slideshare.net/solgenomics/sample-tracking-tutorialistrc2018
https://cassava-test.sgn.cornell.edu/breeders/search
-1- Select the
“2018_NGCGOBII_Gstr
ialdataset”
-2- Click ”select all”:
-3- Select ”traits” such as:
fresh root yield|CO_334:0000013
top yield|CO_334:0000017
root number counting|CO_334:0000011
harvest index variable|CO_334:0000015
dry matter content by specific gravity method|CO_334:0000160
dry matter content percentage|CO_334:0000092
cassava mosaic disease severity 1-month evaluation|CO_334:0000191
cassava mosaic disease severity 3-month evaluation|CO_334:0000192
cassava mosaic disease severity 6-month evaluation|CO_334:0000194
-4- Download data in
excel:
-5- Store your selection
as a dataset. It will be
stored under your
profile on cassavabase
an can be re-accessed
anytime (same as list)
Access data from cassavabase
• Exploratory
• Descriptive statistics
• Interactive visualization
• Pairwise multiple comparison
• Inferential
• ANOVA, correlation, population structure, clustering
• Genomic Prediction
• QTL analysis…coming soon
• GWAS…coming soon
• Efficiency
• Automation
• Reproducibility
• Access and sharing
Cassavabase-PhenoApps demo ISTRC 2018
Cassavabase-PhenoApps demo ISTRC 2018
Explore trial data
Cassavabase-PhenoApps demo ISTRC 2018
Filter interactively
Compare traits across trials
Analyze data
Cassavabase-PhenoApps demo ISTRC 2018
Cassavabase-PhenoApps demo ISTRC 2018
Check traits correlation
Run ANOVA
Calculate selection index
Check population structure (PCA)…
Partition samples into groups (clusters)
GWAS
GWAS
Genomic Prediction (solGS)
workflow
Phenotyped
&
genotyped individuals
Genomic selection…
Prediction model
Predicted
breeding
Values (GEBVs)
Genotyped selection
candidates
Training population
Prediction modeling
• Univariate
• Two-stage analysis
• GBLUP
• Marker-based realized relationship matrix
• Prediction accuracy
• Based on 2 replication, 10-fold cross-validation
Cassavabase-PhenoApps demo ISTRC 2018
Creating a training dataset
Cassavabase-PhenoApps demo ISTRC 2018
Fitting a prediction model
Cassavabase-PhenoApps demo ISTRC 2018
Cassavabase-PhenoApps demo ISTRC 2018
Exploring model input
Cassavabase-PhenoApps demo ISTRC 2018
Cassavabase-PhenoApps demo ISTRC 2018
Checking the model
Cassavabase-PhenoApps demo ISTRC 2018
Exploring model output
(GEBVs)
Cassavabase-PhenoApps demo ISTRC 2018
Cassavabase-PhenoApps demo ISTRC 2018
Estimating breeding values of
selection candidates
Applying the model…
Cassavabase-PhenoApps demo ISTRC 2018
Cassavabase-PhenoApps demo ISTRC 2018
Selection gain?
Cassavabase-PhenoApps demo ISTRC 2018
Genetic correlation
Cassavabase-PhenoApps demo ISTRC 2018
GEBVs based Multi-trait selection:
Selection index
Cassavabase-PhenoApps demo ISTRC 2018
Summary
• Exploratory and inferential analysis
• Interactive visualization
• Adds efficiency, reproducibility
• Easy access and sharing
Contact us!
USER MANUAL
CONTACT SGN TEAM
Contact us!
https://cassavabase.org/contact/form
Online manual: https://solgenomics.github.io/sgn/
Request new traits: http://submit.rtbbase.org/
Slides: http://www.slideshare.net/solgenomics
Looking for code?
Online Resources
Looking for database tutorials or ontology request?
Looking for phenoApps?
PhenoApps: https://github.com/PhenoApps
https://www.youtube.com/playlist?list=PLs7Y2nGwfz4E5_gv1H6Y4imeWDkFJDhIn
Cassavabase code: https://github.com/solgenomics
BrAPI code: https://brapi.org/
BrApps: https://brapi.org/brapps.php
Lukas
Mueller
Alex
Ogbonna
Bryan
Ellerbrock
Naama
Menda
Isaak
Tecle
Nick
Morales
Chiedozie
Egesi
Peter
Kulakow
Robert
Kawuki
Ismail
Rabbi
Prasad
Peteti
Afola
Agbona
Titima
Tantikanjana
Thanks!
Hernan
Ceballos
Eder
Oliveira

More Related Content

PDF
Cassavabase workshop ibadan March17
PPTX
Cassavabase workshop IITA oct2016
PPT
SolGS workshop 2016
PPT
SolGS Hyderabad conference 2016
PDF
Cassavabase general presentation PAG 2016
PPT
Gene Ontology Enrichment Network Analysis -Tutorial
PDF
Drug Discovery- ELRIG -2012
Cassavabase workshop ibadan March17
Cassavabase workshop IITA oct2016
SolGS workshop 2016
SolGS Hyderabad conference 2016
Cassavabase general presentation PAG 2016
Gene Ontology Enrichment Network Analysis -Tutorial
Drug Discovery- ELRIG -2012

What's hot (20)

PPTX
An examination of data quality on QSAR Modeling in regards to the environment...
PPTX
Gene Ontology WormBase Workshop International Worm Meeting 2015
PPTX
The needs for chemistry standards, database tools and data curation at the ch...
PDF
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
PDF
2015 Summer - Araport Project Overview Leaflet
PPTX
Structure Identification Using High Resolution Mass Spectrometry Data and the...
PDF
Research Methodology - Target Discovery
PPTX
Multi-omics methods and resources for Bioconductor
PPTX
2016 bmdid-mappings
PPTX
Structure Identification Using High Resolution Mass Spectrometry Data and the...
PPTX
Guided tutorial of the Neuroscience Information Framework
PPTX
Cheminformatics approaches to support chemical identification delivered via t...
PDF
Pathway Studio v.12 Release Notes
PDF
The influence of data curation on QSAR Modeling – examining issues of qualit...
PDF
Nowomics at Cambridge Open Research
PPTX
Incorporating new technologies and High Throughput Screening in the design an...
PPTX
How to Use mirtronDB
PPTX
Delivering The Benefits of Chemical-Biological Integration in Computational T...
PPTX
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
PPTX
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
An examination of data quality on QSAR Modeling in regards to the environment...
Gene Ontology WormBase Workshop International Worm Meeting 2015
The needs for chemistry standards, database tools and data curation at the ch...
ReVeaLD: A user-driven domain-specific interactive search platform for biomed...
2015 Summer - Araport Project Overview Leaflet
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Research Methodology - Target Discovery
Multi-omics methods and resources for Bioconductor
2016 bmdid-mappings
Structure Identification Using High Resolution Mass Spectrometry Data and the...
Guided tutorial of the Neuroscience Information Framework
Cheminformatics approaches to support chemical identification delivered via t...
Pathway Studio v.12 Release Notes
The influence of data curation on QSAR Modeling – examining issues of qualit...
Nowomics at Cambridge Open Research
Incorporating new technologies and High Throughput Screening in the design an...
How to Use mirtronDB
Delivering The Benefits of Chemical-Biological Integration in Computational T...
The EPA iCSS Chemistry Dashboard to Support Compound Identification Using Hig...
The EPA Online Prediction Physicochemical Prediction Platform to Support Envi...
Ad

Similar to Cassavabase-PhenoApps demo ISTRC 2018 (20)

PDF
1 introduction to cassavabase
PDF
Cassavabase-PhenoApp sample tracking
PDF
Cassava genome hub
PPTX
2013 Cornell's Plant Breeding and Genetic Seminar Series
PDF
B4FA 2012 Nigeria: Cassava Research in Nigeria - Emmanual Okogbenin
PPT
FruitBreedomics KOM 29-03-2011 9 WP7 presentation
PDF
Cassava digital genebank
PDF
GRM 2011: Improving cowpea productivity in Africa - J Ehlers
PPT
FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...
PPT
Cassava for sustainable poverty alleviation
PDF
Development of genomics pipelines and its integration with breeding
PDF
The Ginés‐Mera Fellowship Fund for Postgraduates Studies in Biodiversity
PDF
B4FA 2012 Tanzania: Combating cassava brown streak disease - Fortunus Anton K...
PDF
Cassavabase SolGS presentation PAG 2016
PPTX
Genotyping in Breeding programs
PPTX
Session 3.1 Review of Genetic Tools and Knowledge that could Contribute to Ca...
PDF
TLM III: Improve cowpea productivity for marginal environments in sub-Sahara...
PDF
Cassavabase SolGS poster PAG 2016
PPTX
Cassava at CIAT
1 introduction to cassavabase
Cassavabase-PhenoApp sample tracking
Cassava genome hub
2013 Cornell's Plant Breeding and Genetic Seminar Series
B4FA 2012 Nigeria: Cassava Research in Nigeria - Emmanual Okogbenin
FruitBreedomics KOM 29-03-2011 9 WP7 presentation
Cassava digital genebank
GRM 2011: Improving cowpea productivity in Africa - J Ehlers
FruitBreedomics KOM Stakeholders meeting 31-03-2011 9 WP7 presentation and fe...
Cassava for sustainable poverty alleviation
Development of genomics pipelines and its integration with breeding
The Ginés‐Mera Fellowship Fund for Postgraduates Studies in Biodiversity
B4FA 2012 Tanzania: Combating cassava brown streak disease - Fortunus Anton K...
Cassavabase SolGS presentation PAG 2016
Genotyping in Breeding programs
Session 3.1 Review of Genetic Tools and Knowledge that could Contribute to Ca...
TLM III: Improve cowpea productivity for marginal environments in sub-Sahara...
Cassavabase SolGS poster PAG 2016
Cassava at CIAT
Ad

More from solgenomics (20)

PDF
Sl4.0 and ITAG4.0
PDF
breeding informatics solutions at SGN
PDF
Musabase PAG 2018
PDF
Improvements in the Tomato Reference Genome (SL3.0) and Annotation (ITAG3.0)
PPTX
Musa base phenotyping workflow demo
PDF
Sql cheat sheet
PDF
Introduction to SQL
PPTX
YamBase phenotyping workflow demo
PPTX
Introduction to YamBase
PDF
2 Cassavabase workshop: search menu
PDF
3a Cassavabase worksop: manage breeding-program ands locations
PDF
3b Cassavabase workshop: manage accessions
PDF
3c Cassavabase workshop: manage-crosses
PDF
3d Cassavabase workshop: manage field-trial
PDF
3e Cassavabase workshop: manage genotyping-trials
PDF
3f Cassavabase workshop: manage field-book
PDF
3g Cassavabase workshop: manage phenotyping
PDF
3h Cassavabase workshop: manage barcode
PDF
4 Cassavabase workshop: analyze menu
PDF
5 Cassavabase workshop: contact us
Sl4.0 and ITAG4.0
breeding informatics solutions at SGN
Musabase PAG 2018
Improvements in the Tomato Reference Genome (SL3.0) and Annotation (ITAG3.0)
Musa base phenotyping workflow demo
Sql cheat sheet
Introduction to SQL
YamBase phenotyping workflow demo
Introduction to YamBase
2 Cassavabase workshop: search menu
3a Cassavabase worksop: manage breeding-program ands locations
3b Cassavabase workshop: manage accessions
3c Cassavabase workshop: manage-crosses
3d Cassavabase workshop: manage field-trial
3e Cassavabase workshop: manage genotyping-trials
3f Cassavabase workshop: manage field-book
3g Cassavabase workshop: manage phenotyping
3h Cassavabase workshop: manage barcode
4 Cassavabase workshop: analyze menu
5 Cassavabase workshop: contact us

Recently uploaded (20)

PPTX
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PDF
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
bbec55_b34400a7914c42429908233dbd381773.pdf
PDF
An interstellar mission to test astrophysical black holes
PDF
HPLC-PPT.docx high performance liquid chromatography
PPTX
famous lake in india and its disturibution and importance
PPTX
Introduction to Fisheries Biotechnology_Lesson 1.pptx
PPTX
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
PDF
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
PPTX
Classification Systems_TAXONOMY_SCIENCE8.pptx
PPTX
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
PPTX
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PPTX
Microbiology with diagram medical studies .pptx
PDF
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
2. Earth - The Living Planet Module 2ELS
PDF
AlphaEarth Foundations and the Satellite Embedding dataset
EPIDURAL ANESTHESIA ANATOMY AND PHYSIOLOGY.pptx
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
SEHH2274 Organic Chemistry Notes 1 Structure and Bonding.pdf
7. General Toxicologyfor clinical phrmacy.pptx
bbec55_b34400a7914c42429908233dbd381773.pdf
An interstellar mission to test astrophysical black holes
HPLC-PPT.docx high performance liquid chromatography
famous lake in india and its disturibution and importance
Introduction to Fisheries Biotechnology_Lesson 1.pptx
GEN. BIO 1 - CELL TYPES & CELL MODIFICATIONS
ELS_Q1_Module-11_Formation-of-Rock-Layers_v2.pdf
Classification Systems_TAXONOMY_SCIENCE8.pptx
SCIENCE10 Q1 5 WK8 Evidence Supporting Plate Movement.pptx
ognitive-behavioral therapy, mindfulness-based approaches, coping skills trai...
TOTAL hIP ARTHROPLASTY Presentation.pptx
Microbiology with diagram medical studies .pptx
Formation of Supersonic Turbulence in the Primordial Star-forming Cloud
Phytochemical Investigation of Miliusa longipes.pdf
2. Earth - The Living Planet Module 2ELS
AlphaEarth Foundations and the Satellite Embedding dataset

Cassavabase-PhenoApps demo ISTRC 2018