SlideShare a Scribd company logo
Mauricio Parra Quijano
FAO consultant
International Treaty on Plant Genetic Resources
for Nutrition and Agriculture
CAPFITOGEN Program Coordinator
Tools
ColNucleo
Obtaining ecogeographical core collections based
on ELC maps
Again about genetic representativeness
A B C
accggtccc accggtcgc accggtctc
A B C
A A A
A B C
A
A
A
A
B
BB
B
C BA
When collections are very large (>1000)…
ABB
AAA
CAB
CAB
ABB
AAA
AAA
A B
A
A
A
A
B
BB
B
C BA
A
B C
A
A
A
AB
B
B
B
CBA
C
A
A
A
A
A
A
A
A
Random
By genotype
By phenotype
ABB
AAA
AAA
ABB
AAA
CAB
CAB
But not real
What information should we use to select?
Characterization
Morphological
Biochemical/
Molecular
Agronomic/
Physiological/
Phytopathology
Entomology
Types of core collections according to data
 Random
 Political / Administrative
 Phenotypic (morphological)
 Phenotypic (quantitative traits of agronomic interest)
 Genotypic (molecular markers - neutral)
 Ecogeographical (adaptation to the abiotic environment)
 Mixed / Cumulative
Ecogeographical core collections
 The first ideas about using information on CC using adaptation data back to 1995
 Only until 2000-2010 the use of GIS became popular in RFG
 In 2005 the first ELC map was created
 In 2009, two eco-geographical core collections were obtained and validated
Ecogeographical core collections
Determination of representativeness
Mean Variance Matching Ranges Coefficient of variace
Ecogeographical CC vs Phenotypic CC
Determination of representativeness
What does ColNucleo offer?
Starting with an ELC map
(from ELC mapas tool)
P
C
Sampling
intensity
10%
15%
20%
…
1000
100
What does ColNucleo offer?
Seeds availability?
Ecogeographical core collection
In addition…
 Phenotypic/Genotipic validation is
advisable
 Perform further stepwise strategy
by selecting other types of
variables (descriptors)
 Selecting by pheno/genotypic
representativeness, not randomly
One or more
core
collections?
FIGS_R
Determination of subsets focused on traits of interest
for breeders (Focused Identification of Germplasm
Strategy)
Why is it so difficult to use germplasm?
Poor visibility of
the germplasm
collections
Lack of information
on the preserved
material
The available
information is not very
useful in practice
Limited accessibility
to information
Inaccessibility to
germplasm
Limited interest
of breeders to
use germplasm
collections
Conflict of interests…
 Curators Representativeness  Breeders Traits
The paradox of the use of PGR
 Breeders frequently find collections of 1000 entries
or more
 They have limited availability to test
 Breeders use 100 or 150 entries at the most to evaluate a trait of particular
interest, as part of their routine activity
 Breeders need information (characterization / evaluation data) on the preserved
germplasm to make use of it.
 PGR curators prioritize efforts to preserve and, only when enough funds are
available, to characterize
 There are very few evaluation data (or at least
available)... which consequently leads to almost
random selections by breeders…
 There are always little or insufficient funds to characterize and evaluate the germplasm
 Low level of use, reduced interest
 Gradual reduction of funds for characterizing/evaluating
Focused Identification Germplasm Strategy
 Original idea from Michael Mackay (1986,1990, 1995)
Fenotype = Genotype + Environment + (GxE)
 Identifies germplasm with high probability of containing genetic diversity for the trait of
interest
 Uses ecogeographical information for the prediction of traits occurrence as a preliminary
step to field trials, where breeders ultimately confirm the existence of the trait
No previous efforts on characterization/field evaluation are required and the number of
entries that are delivered to the breeders to be evaluated is reduced
Resistanc e/Tolerance = Genotype + Environment + (GxE)
 Generating FIGS subcollections (≠ core collections)
Enhancing the
First approach…
Temperature
Salinity score
Elevation
Rainfall
Agro-climatic zone
Disease distribution
F I G SOCUSED DENTIFICATION OF ERMPLASM TRATEGY
Datalayerssieveaccessions
basedonlatitude&longitude
Source: Figure from
Mackay (1995)
GISlayers/
Ecogeographicalvariables
Germplasm
FILTERED!!!
We use expert knowledge
 Species experts
 Breeders
 Entomologists,
phytopathologists
Second approach… modeling
Clasification method AUC Kappa Field validation
Principal Component
Regression (PCR)
0.69 0.40 ?
Partial Least Squares (PLS) 0.69 0.41 ?
Random Forest (RF) 0.70 0.42 ?
Support Vector Machines
(SVM)
0.71 0.44 ?
Artificial Neural Networks
(ANN)
0.71 0.44 ?
Y = b + X1 + X2 + X3Resistance/
Tolerance
Ecogeographical
variables
(Genebank: ICARDA wheat collection– Trait: Stem rust (Puccinia gramini)
Source: Bari et al., 2012. Focused identification of germplasm strategy (FIGS) detects wheat stem rust resistance
linked to environmental variables. Genet Resour Crop Evol 59(7):1465-1481
Predict on non-eval/characterized germplasmEval/characterized of germplasm Pattern
What does FIGS_R offer?
It generates FIGS subsets via filtering
Ecogeographical
characterization
Matrix
Pasaport
data table Elevation
Average Annual Temperature
Edaphic Organic Carbon
Topsoil pH
….
….
Y
X
ECOGEO
 FIGS_R characterize ecogeographically the collection using the selected variables
What does FIGS_R offer?
 FIGS_R characterize ecogeographically the collection using the selected variables
 It uses up to three ecogeographical variables and perform a stepwise selection
Annual Precipitation (primary variable)
Edaphic clay (secondary variable)
Slope (tertiary variable)
40
4
Intensidad
de
selección
What does FIGS_R offer?
 FIGS_R characterize ecogeographically the collection using the selected variables
 It uses up to three eco-geographical variables and perform a stepwise selection
 It selects entries from a range of values ​​for each variable or a proportion of the
distribution of values ​​(e.g. lower 30%), in separate processes for each variable.
PROPORTION OF
THE DISTRIBUTION
40% lower
35% higher
Lower
value
Upper
valueRANGE
What does FIGS_R offer?
 FIGS_R characterize ecogeographically the collection using the selected variables
 It uses up to three eco-geographical variables and perform a stepwise selection
 It selects entries from a range of values ​​for each variable or a proportion of the
distribution of values ​​(e.g. lower 30%), in separate processes for each variable.
 It can use (depending on the user) an ELC map to try to balance the selection of
accessions, taking the fraction of the distribution from each category
What does FIGS_R offer?
 FIGS_R characterize ecogeographically the collection using the selected variables
 It uses up to three eco-geographical variables and perform a stepwise selection
 It selects entries from a range of values ​​for each variable or a proportion of the
distribution of values ​​(e.g. lower 30%), in separate processes for each variable.
 Like ColNucleo, it can take into account the availability of the germplasm indicated by
the curator.
One or more
FIGS
subsets?
Presentation4 - ColNucleo & FIGS_R tools

More Related Content

PPTX
Presentation3 - Representa & DIVmapas tools
PPTX
Presentation 3 - TesTable and GEOQUAL tools
PPTX
Presentation 7 r layer_complementa
PPTX
Ecogeografía y SIG en Recursos Fitogenéticos
PPTX
Aplicaciones y herramientas de ecogeografía para la colecta, conservación y u...
PPTX
Presentation 8- Bfuture, Modela y Mcompare
PDF
Bioversity International booklet
PPTX
Ecogeographic core collections and FIGS
Presentation3 - Representa & DIVmapas tools
Presentation 3 - TesTable and GEOQUAL tools
Presentation 7 r layer_complementa
Ecogeografía y SIG en Recursos Fitogenéticos
Aplicaciones y herramientas de ecogeografía para la colecta, conservación y u...
Presentation 8- Bfuture, Modela y Mcompare
Bioversity International booklet
Ecogeographic core collections and FIGS

Similar to Presentation4 - ColNucleo & FIGS_R tools (20)

PPTX
New predictive characterization methods for accessing and using crop wild rel...
PPTX
CAPFITOGEN tools. Facilitated spatial and ecogeographical germplasm analysis ...
PPTX
Ecogeographical approaches to characterize CWR adaptive traits useful for cro...
PPTX
Predictive association between trait data and eco-geographic data for Nordic ...
PPTX
Presentation 4 - SelecVar, ELCmapas and ECOGEO tools
PPTX
Presentation2 - GEOQUAL, ELCmapas & ECOGEO tools
PPTX
Presentation1 ecogeographic basis
PPT
Core Collection
PPTX
Presentation1 - Basis of application of Ecogeography in PGR
PPT
Amman Workshop - Overview - M MacKay
PPTX
Ecogeographic land characterization for CWR diversity and gap analysis Worksh...
PPT
Amman Workshop #2 - M MacKay
PPTX
Presentation 5 representa di_vmapas
PPTX
Pangenomics.pptx
PPTX
Role of Pangenomics for crop Improvement
PPTX
Ecogeographic land characterization for CWR diversity and gap analysis Worksh...
PDF
THEME – 2 Pattern and Climate Change-Induced Patterns and their Implications ...
PPTX
Bari a 2nd iwsrs conference - izmir - 29 april2014
PPTX
Searching for traits in PGR collections using Focused Identification of Germp...
PPTX
NOVA PhD training course on pre-breeding, Nordic University Network (2012)
New predictive characterization methods for accessing and using crop wild rel...
CAPFITOGEN tools. Facilitated spatial and ecogeographical germplasm analysis ...
Ecogeographical approaches to characterize CWR adaptive traits useful for cro...
Predictive association between trait data and eco-geographic data for Nordic ...
Presentation 4 - SelecVar, ELCmapas and ECOGEO tools
Presentation2 - GEOQUAL, ELCmapas & ECOGEO tools
Presentation1 ecogeographic basis
Core Collection
Presentation1 - Basis of application of Ecogeography in PGR
Amman Workshop - Overview - M MacKay
Ecogeographic land characterization for CWR diversity and gap analysis Worksh...
Amman Workshop #2 - M MacKay
Presentation 5 representa di_vmapas
Pangenomics.pptx
Role of Pangenomics for crop Improvement
Ecogeographic land characterization for CWR diversity and gap analysis Worksh...
THEME – 2 Pattern and Climate Change-Induced Patterns and their Implications ...
Bari a 2nd iwsrs conference - izmir - 29 april2014
Searching for traits in PGR collections using Focused Identification of Germp...
NOVA PhD training course on pre-breeding, Nordic University Network (2012)
Ad

More from Mauricio Parra Quijano (19)

PPTX
Presentation 6 col nucleo_figs_r
PPTX
Presentation 3 - Installation and use
PPTX
Opening session - CAPFITOGEN Programme introduction
PPTX
Presentation 7 - Herramientas rLayer y Complementa
PPTX
Presentación 6 - Herramientas ColNucleo y FIGS_R
PPTX
Presentación 5 - Herramientas Representa y DIVmapas
PPTX
Presentación 4 - Herramientas SelecVar, ELCmapas y ECOGEO
PPTX
Presentación 3 - Herramientas TesTable y GEOQUAL
PPTX
Presentación 2 - Instalación y uso
PPTX
Presentación 1 - Bases de la Ecogeografía
PPTX
Apertura - Introducción al Programa
PPTX
Opening presentation - CAPFITOGEN workshops
PPTX
Presentation5 - Installation and use
PPTX
Presentación 5 - Instalación y Uso - Taller Regional
PPTX
Presentación 4 - ColNucleo, FIGS_R - Taller Regional
PPTX
Presentación 3 - Representa, DIVmapas - Taller Regional
PPTX
Presentación 2 - GEOQUAL, ELCmapas, ECOGEO - Taller Regional
PPTX
Presentación 1 - Bases Ecogeografía - Taller Regional
PPTX
Apertura - Taller Regional
Presentation 6 col nucleo_figs_r
Presentation 3 - Installation and use
Opening session - CAPFITOGEN Programme introduction
Presentation 7 - Herramientas rLayer y Complementa
Presentación 6 - Herramientas ColNucleo y FIGS_R
Presentación 5 - Herramientas Representa y DIVmapas
Presentación 4 - Herramientas SelecVar, ELCmapas y ECOGEO
Presentación 3 - Herramientas TesTable y GEOQUAL
Presentación 2 - Instalación y uso
Presentación 1 - Bases de la Ecogeografía
Apertura - Introducción al Programa
Opening presentation - CAPFITOGEN workshops
Presentation5 - Installation and use
Presentación 5 - Instalación y Uso - Taller Regional
Presentación 4 - ColNucleo, FIGS_R - Taller Regional
Presentación 3 - Representa, DIVmapas - Taller Regional
Presentación 2 - GEOQUAL, ELCmapas, ECOGEO - Taller Regional
Presentación 1 - Bases Ecogeografía - Taller Regional
Apertura - Taller Regional
Ad

Recently uploaded (20)

PPTX
Cell Types and Its function , kingdom of life
PPTX
master seminar digital applications in india
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
01-Introduction-to-Information-Management.pdf
PDF
Complications of Minimal Access Surgery at WLH
PPTX
History, Philosophy and sociology of education (1).pptx
PDF
What if we spent less time fighting change, and more time building what’s rig...
PPTX
Lesson notes of climatology university.
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
RMMM.pdf make it easy to upload and study
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Trump Administration's workforce development strategy
PPTX
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Cell Types and Its function , kingdom of life
master seminar digital applications in india
Paper A Mock Exam 9_ Attempt review.pdf.
01-Introduction-to-Information-Management.pdf
Complications of Minimal Access Surgery at WLH
History, Philosophy and sociology of education (1).pptx
What if we spent less time fighting change, and more time building what’s rig...
Lesson notes of climatology university.
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
RMMM.pdf make it easy to upload and study
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Supply Chain Operations Speaking Notes -ICLT Program
2.FourierTransform-ShortQuestionswithAnswers.pdf
Trump Administration's workforce development strategy
Radiologic_Anatomy_of_the_Brachial_plexus [final].pptx
Chinmaya Tiranga quiz Grand Finale.pdf
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
202450812 BayCHI UCSC-SV 20250812 v17.pptx

Presentation4 - ColNucleo & FIGS_R tools

  • 1. Mauricio Parra Quijano FAO consultant International Treaty on Plant Genetic Resources for Nutrition and Agriculture CAPFITOGEN Program Coordinator Tools
  • 2. ColNucleo Obtaining ecogeographical core collections based on ELC maps
  • 3. Again about genetic representativeness A B C accggtccc accggtcgc accggtctc A B C A A A A B C A A A A B BB B C BA
  • 4. When collections are very large (>1000)… ABB AAA CAB CAB ABB AAA AAA A B A A A A B BB B C BA A B C A A A AB B B B CBA C A A A A A A A A Random By genotype By phenotype ABB AAA AAA ABB AAA CAB CAB But not real
  • 5. What information should we use to select? Characterization Morphological Biochemical/ Molecular Agronomic/ Physiological/ Phytopathology Entomology
  • 6. Types of core collections according to data  Random  Political / Administrative  Phenotypic (morphological)  Phenotypic (quantitative traits of agronomic interest)  Genotypic (molecular markers - neutral)  Ecogeographical (adaptation to the abiotic environment)  Mixed / Cumulative
  • 7. Ecogeographical core collections  The first ideas about using information on CC using adaptation data back to 1995  Only until 2000-2010 the use of GIS became popular in RFG  In 2005 the first ELC map was created  In 2009, two eco-geographical core collections were obtained and validated
  • 9. Determination of representativeness Mean Variance Matching Ranges Coefficient of variace
  • 10. Ecogeographical CC vs Phenotypic CC
  • 12. What does ColNucleo offer? Starting with an ELC map (from ELC mapas tool) P C Sampling intensity 10% 15% 20% … 1000 100
  • 13. What does ColNucleo offer? Seeds availability? Ecogeographical core collection In addition…  Phenotypic/Genotipic validation is advisable  Perform further stepwise strategy by selecting other types of variables (descriptors)  Selecting by pheno/genotypic representativeness, not randomly
  • 15. FIGS_R Determination of subsets focused on traits of interest for breeders (Focused Identification of Germplasm Strategy)
  • 16. Why is it so difficult to use germplasm? Poor visibility of the germplasm collections Lack of information on the preserved material The available information is not very useful in practice Limited accessibility to information Inaccessibility to germplasm Limited interest of breeders to use germplasm collections
  • 17. Conflict of interests…  Curators Representativeness  Breeders Traits
  • 18. The paradox of the use of PGR  Breeders frequently find collections of 1000 entries or more  They have limited availability to test  Breeders use 100 or 150 entries at the most to evaluate a trait of particular interest, as part of their routine activity  Breeders need information (characterization / evaluation data) on the preserved germplasm to make use of it.  PGR curators prioritize efforts to preserve and, only when enough funds are available, to characterize  There are very few evaluation data (or at least available)... which consequently leads to almost random selections by breeders…  There are always little or insufficient funds to characterize and evaluate the germplasm  Low level of use, reduced interest  Gradual reduction of funds for characterizing/evaluating
  • 19. Focused Identification Germplasm Strategy  Original idea from Michael Mackay (1986,1990, 1995) Fenotype = Genotype + Environment + (GxE)  Identifies germplasm with high probability of containing genetic diversity for the trait of interest  Uses ecogeographical information for the prediction of traits occurrence as a preliminary step to field trials, where breeders ultimately confirm the existence of the trait No previous efforts on characterization/field evaluation are required and the number of entries that are delivered to the breeders to be evaluated is reduced Resistanc e/Tolerance = Genotype + Environment + (GxE)  Generating FIGS subcollections (≠ core collections) Enhancing the
  • 20. First approach… Temperature Salinity score Elevation Rainfall Agro-climatic zone Disease distribution F I G SOCUSED DENTIFICATION OF ERMPLASM TRATEGY Datalayerssieveaccessions basedonlatitude&longitude Source: Figure from Mackay (1995) GISlayers/ Ecogeographicalvariables Germplasm FILTERED!!! We use expert knowledge  Species experts  Breeders  Entomologists, phytopathologists
  • 21. Second approach… modeling Clasification method AUC Kappa Field validation Principal Component Regression (PCR) 0.69 0.40 ? Partial Least Squares (PLS) 0.69 0.41 ? Random Forest (RF) 0.70 0.42 ? Support Vector Machines (SVM) 0.71 0.44 ? Artificial Neural Networks (ANN) 0.71 0.44 ? Y = b + X1 + X2 + X3Resistance/ Tolerance Ecogeographical variables (Genebank: ICARDA wheat collection– Trait: Stem rust (Puccinia gramini) Source: Bari et al., 2012. Focused identification of germplasm strategy (FIGS) detects wheat stem rust resistance linked to environmental variables. Genet Resour Crop Evol 59(7):1465-1481 Predict on non-eval/characterized germplasmEval/characterized of germplasm Pattern
  • 22. What does FIGS_R offer? It generates FIGS subsets via filtering Ecogeographical characterization Matrix Pasaport data table Elevation Average Annual Temperature Edaphic Organic Carbon Topsoil pH …. …. Y X ECOGEO  FIGS_R characterize ecogeographically the collection using the selected variables
  • 23. What does FIGS_R offer?  FIGS_R characterize ecogeographically the collection using the selected variables  It uses up to three ecogeographical variables and perform a stepwise selection Annual Precipitation (primary variable) Edaphic clay (secondary variable) Slope (tertiary variable) 40 4 Intensidad de selección
  • 24. What does FIGS_R offer?  FIGS_R characterize ecogeographically the collection using the selected variables  It uses up to three eco-geographical variables and perform a stepwise selection  It selects entries from a range of values ​​for each variable or a proportion of the distribution of values ​​(e.g. lower 30%), in separate processes for each variable. PROPORTION OF THE DISTRIBUTION 40% lower 35% higher Lower value Upper valueRANGE
  • 25. What does FIGS_R offer?  FIGS_R characterize ecogeographically the collection using the selected variables  It uses up to three eco-geographical variables and perform a stepwise selection  It selects entries from a range of values ​​for each variable or a proportion of the distribution of values ​​(e.g. lower 30%), in separate processes for each variable.  It can use (depending on the user) an ELC map to try to balance the selection of accessions, taking the fraction of the distribution from each category
  • 26. What does FIGS_R offer?  FIGS_R characterize ecogeographically the collection using the selected variables  It uses up to three eco-geographical variables and perform a stepwise selection  It selects entries from a range of values ​​for each variable or a proportion of the distribution of values ​​(e.g. lower 30%), in separate processes for each variable.  Like ColNucleo, it can take into account the availability of the germplasm indicated by the curator.