Predicting Aflatoxin levels: An Spatial
                            Autoregressive approach

                                  Gissele Gajate-Garrido, IFPRI


  International Food Policy Research Institute           Uniformed Services University of the Health Sciences
  International Center for the Improvement of Maize      ACDI/VOCA/Kenya Maize Development Program
and Wheat                                                Kenya Agricultural Research Institute
  International Crops Research Institute for the Semi-   Institut d’Economie Rurale
  Arid Tropics                                           The Eastern Africa Grain Council
  University of Pittsburgh
   Collecting aflatoxin information is time
    consuming and expensive.
   Sometimes we can have aflatoxin
    information from a smaller sample of
    households.
   These information could be useful to
    predict the level of aflatoxins in other
    households with similar characteristics.
A Spatial Autoregressive
Model (SAR) uses the
household characteristics
and the aflatoxin level of
people around it to predict
aflatoxin levels in each
household.
   This model gives more weight
       Aflatoxin level                  to the information of my
                                        closest “neighbors” and less to
                                        the ones that are further away.
                                       My “neighbors” information
                                        could help predict my own
Observable        Unobservable:         aflatoxin level since it could
characteristics   - Attitudes
                  - Risk aversion
                                        contain information that
                  - Motivation
                                        usually is not captured by
                                        surveys.
                                       When we estimate models
                                        there is always an error term
                                        present that represents the
                                        variation that we are unable to
                                        capture.
   There are variables such as a person’s
    determination or innate ability that could help
    predict how much time and effort they will invest
    in preventing aflatoxins in their crops.
   These variables cannot be observed or recorded in
    a survey.
   However, by capturing information about my
    peers this could help provide additional
    information about how I behave and how high is
    my aflatoxin level.
   In order to asses who is “closest” to me I use
    location variables:
     Longitude

     Latitude

     Elevation

     Slope
      ▪ (Only for the pre-harvest sample)
90%                                                            Storage
80%
70%                                                                                   63%
                                                                              74%
60%              Production
50%
                                           38%
40%
                         29%                                         27%
30%
20%
                                  6%                9%       9%
10%     6%
                 2%
0%
       Treated Improved Pesticide Fertilizer Insect Rodent Plastic Storage: Frequent Hand
      soil (lime, seed                       damage damage bags for special use of sorting
       manure,                                             storage room pestcide before
         etc.)                                                      inside     in    storage
                                                                    house storage
   We use      100%
                                        Aflatoxin variation
    data from   90%
                80%    36%
    Mali to
                70%               The inside sample prediction captures
    test the    60%               36% of the variation in prevalence
    model.      50%
                                  values.

                40%    64 %       Yet, the information of my neighbors
                                  is not useful to predict my prevalence
   We start    30%
                                  levels, only my characteristics are
    with pre-   20%
                                  relevant.
                10%
    harvest
                 0%
    data.                     My neighbors'   My characteristics   Unobservable
2.5

 2
                                                         The relationship
1.5
                                                         between
 1                              1.04 ***                 predicted and
0.5                                                      real values is
 0                                                       almost 1 to 1.
      0        1         2           3                   It is significant at
       Measured prevalence (part per billion)            1%.
         Predicted prevalence      45 degree line

Variable                             Obs        Mean Std. Dev. Min Max
Measured prevalence                  247        27.2   64.0    0.05 492.0
Predicted prevalence                 247        29.6   26.9    0.00 130.7
Kernel density estimate for Pre-harvest Aflatoxin levels
          .04




                                   76%                                                       The model is not
          .03




                                                                                             able to capture
Density




                                                                                             extremely high
          .02




                                                                                             values of
                                                                                             prevalence and in
                                                                                             general
                                     43%
          .01




                                                                                             overestimates
                                                                                             lower values.
                0




                     0 20            100             200            300          400   500
                                                 prevalence (part per billion)

                                                 Kernel density measured prevalence
                                                 Kernel density predicted prevalence
                    kernel = epanechnikov, bandwidth = 3.8288
Kernel density estimate for Main HH Pre-harvest Aflatoxin levels
                   .01
                .008
                .006
      Density




                               37%
                .004




                                                  63%
                .002

                         0




                                  0     20       50            100           150           200   250
                                              Kernel density predicted prevalence for Main HH
                             kernel = epanechnikov, bandwidth = 12.9933



Variable                                Obs Mean Std. Dev.                                             Min   Max
Predicted prevalence for main HH survey 1169 58.4  59.3                                                0.0   223.1
   Post-harvest data
    after 1 month in                Total variation in aflatoxin levels
    storage

   During storage
                          Variation                                        Variation
    not only your         explained by                                     explained by
    characteristics but   personal                                         neighbors
                          characteristics                                  aflatoxin level
    also your
    "neighbors"
    information help                        Unexplained variation = 62 %

    explain your
    aflatoxin level.          The inside sample prediction captures 38% of
                              the variation in prevalence values.
2.5

     2                                                       The
    1.5                                                      relationship
                                                             between
      1
                                      0.95 ***               predicted and
    0.5                                                      real values is
                                                             almost 1 to 1.
     0
          0             1            2               3       It is significant
              Measured prevalence (part per billion)         at 1%.
              Predicted prevalence      45 degree line

Variable                               Obs       Mean    Std. Dev.   Min   Max
Measured prevalence                    243       121.9     256.9     0.0   1911.2
Predicted prevalence                   243       129.0     130.5     0.0    778.0
   The same methodology applied to the data in Mali
    will be applied to the data in Kenya.
   Hence will be able to predict prevalence levels for
    the main household survey and use it for further
    analysis.
   Should we expect similar results?
     Different crops
      ▪ Mali –groundnuts vs. Kenya – maize
     It also depends on production and storage practices in
      Kenya.
Predicting aflatoxin levels a spatial autoregressive approach
   We have two models that can be used to
    predict aflatoxin models:
     Maxent
     SAR model

   We need to compare the strengths and
    weakness of both models.

   We can also consider introducing other
    variables to improve the predictions.
Current Partners:

Donor: Bill and Melinda Gates Foundation

Center/ Universities
         IFPRI: C. Narrod (Project lead), P. Trench(Project manager), M. Tiongco,
         D. Roy, A. Saak, R. Scott, W. Collier, M. Elias.

         CIMMYT: J. Hellin, H. DeGroote, G. Mahuku, S. Kimenju, B. Munyua

         ICRISAT: F. Waliyar, J. Ndjeunga, A. Diallo, M. Diallo, V. Reddy

         University of Pittsburgh: F. Wu, Y. Liu

         US Uniformed Health Services: J. Chamberlin, P. Masuoka, J. Grieco

Country Partners
        ACDI/VOCA: S. Collins, S. Guantai, S. Walker

         Kenya Agricultural Research Institute: S. Nzioki, C. Bett

         Institut d’Economie Rurale: B. Diarra, O. Kodio, L. Diakite

More Related Content

PDF
Stephen Friend National Heart Lung & Blood Institute 2011-07-19
PDF
Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15
PDF
Levels of aflatoxins in the Kenyan dairy value chain: How can we assess the e...
PDF
Measuring the burden of aflatoxin induced human disease
PDF
Abrines iac2009
PDF
Abrines Iac2009
PDF
Martin Bardsley: Predictive risk 2012: context
PPT
2010 corporate end user study
Stephen Friend National Heart Lung & Blood Institute 2011-07-19
Stephen Friend Molecular Imaging Program at Stanford (MIPS) 2011-08-15
Levels of aflatoxins in the Kenyan dairy value chain: How can we assess the e...
Measuring the burden of aflatoxin induced human disease
Abrines iac2009
Abrines Iac2009
Martin Bardsley: Predictive risk 2012: context
2010 corporate end user study

More from pchenevixtrench (20)

PDF
Economic importance of different maize storage structures in kenya
PDF
Findings from the cost effectiveness analysis
PDF
Communicating the findings
PDF
Exploring the scope of cost effective aflatoxin risk reduction strategies
PDF
Estimating consumer willingness to pay for aflatoxin free food
PDF
Estimating demand for aflatoxin risk reducing strategies in kenya (2)
PDF
Findings from the KAP analysis
PDF
SPS barriers as a bridge to cross
PDF
Impacts of aflatoxin contamination on livelihoods of the poor households
PDF
Prevalence of aflatoxin along the maize value chain in kenya
PDF
Kenya value chain
PDF
Aflatoxin and communication - lessons learnt from the Aflacontrol project
PDF
Groundnut value chain analysis in mali
PPT
Biocontrol of Aflatoxin in Kenya, R. Bandyopadhyay et al.
PPTX
Preliminary results showing prevalance of aflatoxin in maize in Kenya by Geor...
PPTX
Exploring the Scope of Cost Effective Aflatoxin Risk Reduction Strategies
PPTX
Alternative use of aflatoxin contaminated grain
PPT
Aflatoxins in Grains and Food Safety, Guantai, S.M., ACDI/VOCA Kenya
PPTX
Post-Harvest control of Aflatoxin, Ernest Moturi Ogwora, NCPB
PPT
Agronomic practices, post harvest handling and management to mitigate for Afl...
Economic importance of different maize storage structures in kenya
Findings from the cost effectiveness analysis
Communicating the findings
Exploring the scope of cost effective aflatoxin risk reduction strategies
Estimating consumer willingness to pay for aflatoxin free food
Estimating demand for aflatoxin risk reducing strategies in kenya (2)
Findings from the KAP analysis
SPS barriers as a bridge to cross
Impacts of aflatoxin contamination on livelihoods of the poor households
Prevalence of aflatoxin along the maize value chain in kenya
Kenya value chain
Aflatoxin and communication - lessons learnt from the Aflacontrol project
Groundnut value chain analysis in mali
Biocontrol of Aflatoxin in Kenya, R. Bandyopadhyay et al.
Preliminary results showing prevalance of aflatoxin in maize in Kenya by Geor...
Exploring the Scope of Cost Effective Aflatoxin Risk Reduction Strategies
Alternative use of aflatoxin contaminated grain
Aflatoxins in Grains and Food Safety, Guantai, S.M., ACDI/VOCA Kenya
Post-Harvest control of Aflatoxin, Ernest Moturi Ogwora, NCPB
Agronomic practices, post harvest handling and management to mitigate for Afl...
Ad

Recently uploaded (20)

PDF
sbt 2.0: go big (Scala Days 2025 edition)
PDF
Zenith AI: Advanced Artificial Intelligence
PDF
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
CloudStack 4.21: First Look Webinar slides
PDF
1 - Historical Antecedents, Social Consideration.pdf
PPTX
Custom Battery Pack Design Considerations for Performance and Safety
PDF
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
PPTX
Chapter 5: Probability Theory and Statistics
PPTX
Benefits of Physical activity for teenagers.pptx
PDF
OpenACC and Open Hackathons Monthly Highlights July 2025
PDF
Architecture types and enterprise applications.pdf
PDF
Five Habits of High-Impact Board Members
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
STKI Israel Market Study 2025 version august
PDF
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
PDF
A contest of sentiment analysis: k-nearest neighbor versus neural network
PPTX
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
PPT
Geologic Time for studying geology for geologist
PDF
A review of recent deep learning applications in wood surface defect identifi...
sbt 2.0: go big (Scala Days 2025 edition)
Zenith AI: Advanced Artificial Intelligence
“A New Era of 3D Sensing: Transforming Industries and Creating Opportunities,...
Getting started with AI Agents and Multi-Agent Systems
CloudStack 4.21: First Look Webinar slides
1 - Historical Antecedents, Social Consideration.pdf
Custom Battery Pack Design Considerations for Performance and Safety
How IoT Sensor Integration in 2025 is Transforming Industries Worldwide
Chapter 5: Probability Theory and Statistics
Benefits of Physical activity for teenagers.pptx
OpenACC and Open Hackathons Monthly Highlights July 2025
Architecture types and enterprise applications.pdf
Five Habits of High-Impact Board Members
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
STKI Israel Market Study 2025 version august
Produktkatalog für HOBO Datenlogger, Wetterstationen, Sensoren, Software und ...
A contest of sentiment analysis: k-nearest neighbor versus neural network
AI IN MARKETING- PRESENTED BY ANWAR KABIR 1st June 2025.pptx
Geologic Time for studying geology for geologist
A review of recent deep learning applications in wood surface defect identifi...
Ad

Predicting aflatoxin levels a spatial autoregressive approach

  • 1. Predicting Aflatoxin levels: An Spatial Autoregressive approach Gissele Gajate-Garrido, IFPRI International Food Policy Research Institute Uniformed Services University of the Health Sciences International Center for the Improvement of Maize ACDI/VOCA/Kenya Maize Development Program and Wheat Kenya Agricultural Research Institute International Crops Research Institute for the Semi- Institut d’Economie Rurale Arid Tropics The Eastern Africa Grain Council University of Pittsburgh
  • 2. Collecting aflatoxin information is time consuming and expensive.  Sometimes we can have aflatoxin information from a smaller sample of households.  These information could be useful to predict the level of aflatoxins in other households with similar characteristics.
  • 3. A Spatial Autoregressive Model (SAR) uses the household characteristics and the aflatoxin level of people around it to predict aflatoxin levels in each household.
  • 4. This model gives more weight Aflatoxin level to the information of my closest “neighbors” and less to the ones that are further away.  My “neighbors” information could help predict my own Observable Unobservable: aflatoxin level since it could characteristics - Attitudes - Risk aversion contain information that - Motivation usually is not captured by surveys.  When we estimate models there is always an error term present that represents the variation that we are unable to capture.
  • 5. There are variables such as a person’s determination or innate ability that could help predict how much time and effort they will invest in preventing aflatoxins in their crops.  These variables cannot be observed or recorded in a survey.  However, by capturing information about my peers this could help provide additional information about how I behave and how high is my aflatoxin level.
  • 6. In order to asses who is “closest” to me I use location variables:  Longitude  Latitude  Elevation  Slope ▪ (Only for the pre-harvest sample)
  • 7. 90% Storage 80% 70% 63% 74% 60% Production 50% 38% 40% 29% 27% 30% 20% 6% 9% 9% 10% 6% 2% 0% Treated Improved Pesticide Fertilizer Insect Rodent Plastic Storage: Frequent Hand soil (lime, seed damage damage bags for special use of sorting manure, storage room pestcide before etc.) inside in storage house storage
  • 8. We use 100% Aflatoxin variation data from 90% 80% 36% Mali to 70% The inside sample prediction captures test the 60% 36% of the variation in prevalence model. 50% values. 40% 64 % Yet, the information of my neighbors is not useful to predict my prevalence  We start 30% levels, only my characteristics are with pre- 20% relevant. 10% harvest 0% data. My neighbors' My characteristics Unobservable
  • 9. 2.5 2 The relationship 1.5 between 1 1.04 *** predicted and 0.5 real values is 0 almost 1 to 1. 0 1 2 3 It is significant at Measured prevalence (part per billion) 1%. Predicted prevalence 45 degree line Variable Obs Mean Std. Dev. Min Max Measured prevalence 247 27.2 64.0 0.05 492.0 Predicted prevalence 247 29.6 26.9 0.00 130.7
  • 10. Kernel density estimate for Pre-harvest Aflatoxin levels .04 76% The model is not .03 able to capture Density extremely high .02 values of prevalence and in general 43% .01 overestimates lower values. 0 0 20 100 200 300 400 500 prevalence (part per billion) Kernel density measured prevalence Kernel density predicted prevalence kernel = epanechnikov, bandwidth = 3.8288
  • 11. Kernel density estimate for Main HH Pre-harvest Aflatoxin levels .01 .008 .006 Density 37% .004 63% .002 0 0 20 50 100 150 200 250 Kernel density predicted prevalence for Main HH kernel = epanechnikov, bandwidth = 12.9933 Variable Obs Mean Std. Dev. Min Max Predicted prevalence for main HH survey 1169 58.4 59.3 0.0 223.1
  • 12. Post-harvest data after 1 month in Total variation in aflatoxin levels storage  During storage Variation Variation not only your explained by explained by characteristics but personal neighbors characteristics aflatoxin level also your "neighbors" information help Unexplained variation = 62 % explain your aflatoxin level. The inside sample prediction captures 38% of the variation in prevalence values.
  • 13. 2.5 2 The 1.5 relationship between 1 0.95 *** predicted and 0.5 real values is almost 1 to 1. 0 0 1 2 3 It is significant Measured prevalence (part per billion) at 1%. Predicted prevalence 45 degree line Variable Obs Mean Std. Dev. Min Max Measured prevalence 243 121.9 256.9 0.0 1911.2 Predicted prevalence 243 129.0 130.5 0.0 778.0
  • 14. The same methodology applied to the data in Mali will be applied to the data in Kenya.  Hence will be able to predict prevalence levels for the main household survey and use it for further analysis.  Should we expect similar results?  Different crops ▪ Mali –groundnuts vs. Kenya – maize  It also depends on production and storage practices in Kenya.
  • 16. We have two models that can be used to predict aflatoxin models:  Maxent  SAR model  We need to compare the strengths and weakness of both models.  We can also consider introducing other variables to improve the predictions.
  • 17. Current Partners: Donor: Bill and Melinda Gates Foundation Center/ Universities IFPRI: C. Narrod (Project lead), P. Trench(Project manager), M. Tiongco, D. Roy, A. Saak, R. Scott, W. Collier, M. Elias. CIMMYT: J. Hellin, H. DeGroote, G. Mahuku, S. Kimenju, B. Munyua ICRISAT: F. Waliyar, J. Ndjeunga, A. Diallo, M. Diallo, V. Reddy University of Pittsburgh: F. Wu, Y. Liu US Uniformed Health Services: J. Chamberlin, P. Masuoka, J. Grieco Country Partners ACDI/VOCA: S. Collins, S. Guantai, S. Walker Kenya Agricultural Research Institute: S. Nzioki, C. Bett Institut d’Economie Rurale: B. Diarra, O. Kodio, L. Diakite