SlideShare a Scribd company logo
International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019
DOI: 10.5121/ijaia.2019.10603 33
MODELING THE CHLOROPHYLL-A FROM SEA
SURFACE REFLECTANCE IN WEST AFRICA BY DEEP
LEARNING METHODS: A COMPARISON OF MULTIPLE
ALGORITHMS
Daouda DIOUF and Djibril Seck
Laboratoire de Traitement de l’Information (LTI) – ESP –
Université Cheikh Anta Diop de Dakar
BP: 5085 Dakar-Fann (Sénégal)
ABSTRACT
Deep learning provide successful applications in many fields. Recently, machines learning are involved for
oceans remote sensing applications. In this study, we use and compare about eight (8) deep learning estimators
for retrieval of a mainly pigment of phytoplankton. Depending on the water case and the multiple instruments
simultaneously observing the earth on a variety of platforms, several algorithm are used to estimate the
chlolophyll-a from marine reflectance.By using a long-term multi-sensor time-series of satellite ocean-colour
data, as MODIS, SeaWifs, VIIRS, MERIS, etc…, we make a unique deep network model able to establish a
relationship between sea surface reflectance and chlorophyll-a from any measurement satellite sensor over West
Africa. These data fusion take into account the bias between case water and instruments.We construct several
chlorophyll-a concentration prediction deep learning based models, compare them and therefore use the best for
our study. Results obtained for accuracy training and test are quite good. The mean absolute error are very low
and vary between 0,07 to 0,13 mg/m3
.
KEYWORDS
deep learning estimators; remote sensing; chlorophyll-a
1. INTRODUCTION
The sensor ocean color provides measures ρtoa multispectral Top Of Atmosphere (TOA) reflectance (λ)
of ocean-atmosphere system in the visible and near infrared since many decades.
Reflectance ρcor measured by the radiometer, corrected to Rayleigh scattering contribution, specular
reflection and absorption gas is expressed as follows:
ρcor (λ) = ρA (λ) + t.ρw (λ)
where ρA is the atmospheric contribution and ρw the contribution due to the ocean, t the atmospheric
transmittance.
Atmospheric correction algorithm can be used [1] to estimate ρA and and determined chlorophyll-a
from the remaining ρw.
Phytoplankton are important to marine ecosystems. Its play great role in the food web and
biogeochemical cycles. Chlorophyll represents mainly the phytoplankton.
International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019
34
To retrieved chlorophyll-a concentration from ρw, SeaWifS sensor use OC4V4 algorithm that compute
the rapports of ρw at wavelengths 443nm, 490nm, 510nm and 555nm.
)
532
.
1
049
.
6
930
.
1
067
.
3
366
.
0
( 4
3
2
10 R
R
R
R
a
chl 





where ))
)
555
(
)
510
(
,
)
555
(
)
490
(
,
)
555
(
)
443
(
(max(
log10
w
w
w
w
w
w
R







This mean that the maximum rapport value is taken.
The machine learning show strong computing power of classification and fitting capability to big data
and multi-feature data [2]. In many fields as image recognition [3], search engines [4], stock price
predictions [5], accurate results are obtained.
Machine learning found many application on earth remote sensing [6] especially for ocean data
products.
Due to the non-linearity and complexity of the measurements made by ocean colour sensors, and
taking advantage of the high-dimensional data reduction technique for the construction of high-
dimensional predictors in input-output models of deep learning, we aim, with these last ones, to
construct and optimize a chlorophyll-a concentration prediction model.
2. DATASET
Data are from ESA Ocean Colour Climate Change Initiative (Ocean_Colour_cci): Version 3.1 Data.
This collection contains version 3.1 datasets produced by the Ocean Colour project of the ESA
Climate Change Inititative (CCI). The Ocean Colour CCI is producing long-term multi-sensor time-
series of satellite ocean-colour data with a particular focus for use in climate studies.
Data products being produced include: phytoplankton chlorophyll-a concentration; remote-sensing
reflectance at six wavelengths (412 nm, 443 nm, 490 nm, 510 nm, 555 nm, 670 nm)
Datasets are 5 days composite images of year 2014 and are taken in an area off the West Africa,
between 6°N and 30°N and - 34°W and -8°W. This area is very important. It contains the senegalo-
mauritanian upwelling zone.
Datasets of year 2014 are about 26 686 250 pixels. We random these datasets to avoid the overfitting
and take the 5 % for train data and 1 % for test data which is used to test the prediction accuracy of the
model.
We use also independent dataset from MODIS sensor for validation.
3. METHODS
We implemented ours models in Keras and sklearn libraries and we used many deep learning regressor
methods to establish deep network relationship between sea surface reflectance. The aims is, first to
shown the feasibility because of its multivalued of ocean color data, and second, to choose the best
one and further explored it and their hyperparameters tuned. For each model, input are the six spectral
sea surface reflectance and output is the chlorophyll-a.
International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019
35
3.1. Generalized Linear Models
Linear models are simple way to predict output using a linear function of input features.
The spectral sea surface reflectance represents the input features and are notes by  
n
x
x
x
X ,...
, 2
1
 ,
output by y and therefore ŷ the predicted output.
Linear Regression: The library of sklearn allow us to parameterize a linear regression. It fits a linear
model with coefficients w=(w1...wn) to minimize the residual sum of squares between the spectral sea
surface reflectance X, and the chlorophyll-a concentration ŷ by the linear approximation:
n
n x
w
x
w
x
w
w
X
w
y 



 ...
)
,
(
ˆ 2
2
1
1
0
Linear regression optimization is to minimize the cost function written as:
  
  




M
i
M
i
n
j
ij
j
i
i
i x
w
y
y
y
0 0 0
2
2
)
(
)
ˆ
(
Ridge regression: The cost function is altered by imposing a penalty equivalent to square of the
magnitude of the coefficients. The cost function to minimize is:
   
   





M
i
M
i
n
j
n
j
j
ij
j
i
i
i w
x
w
y
y
y
0 0 0 0
2
2
2
)
(
)
ˆ
( 
Ridge Regression is an optimization of Ordinary Least Squares Regression.
3.2. Generalized Ensemble Methods
Ensemble methods are algorithms that combine multiple algorithms into a single predictive model in
order to decrease variance, decrease bias, or improve predictions. In others words, they are a sets of
machine learning techniques whose decisions are combined to improve the performances of the overall
system.
Bagging regressor: Bagging methods are aggregation methods. The bagging approach consists of
trying to reduce the dependency between the estimators that are aggregated by building them on
bootstrap samples. The algorithm is simple to implement: it is necessary to build n estimators on
bootstrap samples and to aggregate them.
Decision trees regressor: Classification and regression trees models, or CART models, were
introduced by [7]. A top down approach is applied to dataset. The complexity of the model is managed
by two parameters: max_depth, which determines the max number of leaves in the trees, and the
minimum number min_samples_split of dataset required to search for a dichotomy.
Random forest regressor: A random forest is only an aggregation trees dependent on random
variables. For example, bagging trees (building trees on bootstrap samples) defines a random forest.
The Random Forest allows to improve the predictive accuracy and to control over-fitting. [8].
More than the number of trees n_estimators, the parameter to be optimized is the number of variables
randomly drawn for the search for the optimal division of a node: max_features. Maximizing the
max_features parameter can be achieved by minimizing the out-of-bag forecast error.
International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019
36
Extra Trees regressor: This class implements a meta-estimator that fits a number of randomized
decision trees on various sub-samples of the dataset and uses averaging to improve the predictive
accuracy and control over-fitting.
3.3. Support Vector Regressor Model
SVR algorithm use RBF as a kernel function. SVR minimizes the generalization error bound so as to
achieve generalized, instead of minimizing the observed training error [9].
3.4. K-Neighbors Regressor Model
The k-nearest neighbors algorithm (k-NN) is a non-parametric. For regression, the output is based on
the mean or median of the k-nearest neighbors in the feature space. The parameter to optimize to
control the complexity of the model is the number of neighbor K.
4. RESULTS
In this section, we verify the generalization capability of the constructed models. Therefore a set of
data were used for prediction experiments. Results are shown on table1. The ensemble models
regressor are more quite fitting data with respectively test accuracy and mean absolute error (mae)
predictions of 96,04% and 0,09 for bagging; 96,05% and 0,09 for random forest; 96,46% and 0,07 for
extra tree and 93,78% and 0,13 for decision trees.
The linear model give us a test accuracy of about 76,06 %. We get a mae prediction of about 0,43. The
K-neighbor model have a mae prediction of 0,09 and a test accuracy of 95,08.
The support vector and the ridge gave least good results with respectively test accuracy and mae
predictions of 5,05% and 0,67; 24,54% and 0,8.
The distribution with a kernel density estimate on figure below show it clearly.
Tab.1: Test of accuracy and error prediction
Regression Mean absolute error (mg/m3
) Accuracy (%)
Linear 0.43 76,06
Ridge 0,8 24,54
SVR 0,67 5,05
K-Neighbor 0,09 95,08
Extra Tree 0,07 96,46
Random Forest 0,09 96,05
Decision Trees 0,13 93,78
Bagging 0,09 96,04
With regard to Table 1, we find that all ensemble models regression work well with larger prediction
values.
In figure 1, we show kernel density estimation (KDE) to visualize frequency test data predicted by
each method. A KDE is used to get a smooth estimation of the probability density function. This curve
is estimated from the data, and the most widely used kernel is a Gaussian kernel. This is particularly
useful in looking for a cluster of analyses in spectra of data. We noted that extra tree, random forest,
decision trees and bagging method are more robust. For the following of this paper, we will work with
the extra trees regression model to predict the target.
International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019
37
Fig. 1: Distribution with a kernel density estimation with a Gaussian kernel and a data set with 266862 sample
points from a combined normal density
International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019
38
In figure 2 and figure 3, we plot the predicted chl-a concentration with extra trees model and the real
values measured respectively for the average of period from January 16, to January 30, 2014 and for
the average of period from from March 11, 2014, to March 30, 2014. For average value, we apply the
algorithms to each daily image and average this daily estimate for the climatology period under study.
The dataset we compare did not participate in learning phase.
The two images are very similar. Indeed, chl-a estimated for this average is equal to 95,05% with a
mae of 0,09 mg/m3
. We see that an abundance chlorophyll-a pattern is observed near the shore and this
rapidly decrease offshore. This is consistent with the literature because the upwelling area runs along
the West African coast from Guinea to Mauritania. [10] and more recently [11], [12] demonstrate that
the upwelling intensity is maximum in March–April.
Fig. 2: The average of estimate chlorophyll-a concentration from January 16, 2014, to January 30, 2014 in
mg/m3
for (left) the true CCI value and (right) Extra trees prediction value. The relative error between them is
represented in bottom.
International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019
39
Fig. 3: The average of estimate chlorophyll-a concentration from March 11, 2014, to March 30, 2014 in mg/m3
for (left) the true CCI value and (right) Extra trees prediction value. The relative error between them is
represented in bottom.
By using independent dataset, comparison can be made with the chl-a predicted using from spectral
reflectance of MODIS sensor with the model and the standard chl-a estimate with standard algorithm,
OC3V3. This comparison shows that both methods bring out the patterns of chl-a. However the
intensity of is stronger in the standard retrieval. Figure 4.
This result is very significant because it mean that according to whether we use data from several
sensors, such as those used to build the different models of this work, or data from a single sensor
(MODIS sensor, figure 4), the modeling capacity remains good.
Fig.4: The average of estimate chlorophyll-a concentration from January 1, 2015, to January 16, 2015 in mg/m3
for (left) the standard MODIS value and (right) Extra trees prediction value.
International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019
40
5. CONCLUSION
The high-dimensional predictors in input-output models offered by deep learning demonstrate in the
work the effectiveness of chlorophyll-a retrieval from sea surface reflectance.
High accuracy are obtained on both the training and test dataset with a low mean absolute error of 0,09
mg/m3
and correlation coefficient higher than 92%. The extra tree regression was the model we used.
Retrievals values of chlorophyll-a are in consistence with upwelling phenomena denoted on this area.
Comparison we independent value have shown satisfactory results.
REFERENCES
[1] D. Diouf, A. Niang, J. Brajard, M. Crepon, and S. Thiria, “Retrieving aerosol characteristics and
sea‐surface chlorophyll from satellite ocean color multi‐spectral sensors using a neural‐variational
method”, Remote Sens. Environ., 130, 2013: 74– 86.
[2] G.E. Hinton, .S Osindero, Y.W. The, “A fast learning algorithm for deep belief nets. Neural
computation”, 2006, 18(7): 1527–1554. https://doi.org/10.1162/neco.2006.18.7.1527 PMID: 16764513
[3] K. He, X. Zhang, S. Ren, J. Sun, “Deep residual learning for image recognition”. Proceedings of the
IEEE conference on computer vision and pattern recognition. 2016: 770–778.
[4] R. Collobert, J. A. Weston, “unified architecture for natural language processing: Deep neural networks
with multitask learning”. Proceedings of the 25th international conference on Machine learning. ACM,
2008: 160–167.
[5] R. Akita, A. Yoshihara, T. Matsubara, K. Uehara, “Deep learning for stock prediction using numerical
and textual information.” Computer and Information Science (ICIS), 2016 IEEE/ACIS 15th International
Conference on. IEEE, 2016: 1–6.
[6] D.J. Lary , G.K. Zewdie, X. Liu, D. Wu, E. Levetin, R.J. Allee, N. Malakar, A. Walker, H. Mussa, A.
Mannino, D. Aurin “Machine Learning Applications for Earth Observation”, In book: Earth Observation
Open Science and Innovation, ISSI Scientific Report Series 15, P.-P. Mathieu, C. Aubrecht (eds.),
https://doi.org/10.1007/978-3-319-65633-5_8
[7] L. Breiman, J.H. Friedman, R.A Olshen and C. J. Stone “Classification and Regression Trees”, 1984,
Wadsworth International Group, Belmont, CA.
[8] L. Breiman, “Random forests”, Machine Learning 45 (2001), 5–32.
[9] D. Basak, S. Pal and D. C. Patranabis, “Support Vector Regression”, Neural Information Processing –
Letters and Reviews, Vol. 11, No. 10, October 2007
[10] Demarcq, H., and Faure, V. ( 2000), “Coastal upwelling and associated retention indices from satellite
SST. Application to Octopus vulgaris recruitment”, Oceanogr. Acta, 23, 391– 407.
[11] Sawadogo, S., Brajard, J., Niang, A., Lathuilière, C., Crepon, M., and Thiria, S. ( 2009), “Analysis of the
Senegalo‐Mauritanian upwelling by processing satellite remote sensing observations with topological
maps”, in Proceedings IEEE International Joint Conference on Neural Networks, IJCNN, pp. 2826–2832,
Atlanta, USA
[12] Farikou, O., Sawadogo, S., Niang, A., Brajard, J., Mejia, C., Crépon, M., and Thiria, S. ( 2013),
“Multivariate analysis of the Senegalo‐Mauritanian area by merging satellite remote sensing ocean color
and SST observations”, J. Environ. Earth Sci., 5( 12), 756– 768.

More Related Content

PDF
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
PPT
IGARSS_Presentation_Rodrigo_Jose_Pisani.ppt
PPT
IGARSS 2011.ppt
PPTX
IGARSS_2011_GALLOZA.pptx
PPTX
An Assessment of Pan-sharpening Algorithms for Mapping Mangrove Ecosystems: A...
PDF
NIR Reflectance Spectroscopy in Soil Surveys oriented Toward Monitoring in Me...
 
PDF
Nonlinear filtering approaches to field mapping by sampling using mobile sensors
PDF
Analysis of Soil in the Field using portable FTIR - A H Jean Robertson, H Rac...
 
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
IGARSS_Presentation_Rodrigo_Jose_Pisani.ppt
IGARSS 2011.ppt
IGARSS_2011_GALLOZA.pptx
An Assessment of Pan-sharpening Algorithms for Mapping Mangrove Ecosystems: A...
NIR Reflectance Spectroscopy in Soil Surveys oriented Toward Monitoring in Me...
 
Nonlinear filtering approaches to field mapping by sampling using mobile sensors
Analysis of Soil in the Field using portable FTIR - A H Jean Robertson, H Rac...
 

What's hot (19)

PPTX
Copernicus Land Moniotring Service Portfolio
PDF
LiDAR work
PDF
Performance improvement of a Rainfall Prediction Model using Particle Swarm O...
PPTX
Barber_TU2.T03_hk_mb_casa_fg_hk_FINAL.pptx
PDF
Prediction of soil properties with NIR data and site descriptors using prepro...
 
PPTX
201107IGARSS_OHKI.pptx
PDF
Poster Roseanne Clement
PDF
Global Sensitivity Analysis for the Calibration of a Fully-distributed Hydrol...
PPTX
MONITORING FOREST MANAGEMENT ACTIVTIES USING AIRBORNE LIDAR AND ALOS PALSAR.pptx
PPTX
Radiometric Calibration of Digital Images
PPTX
Uniform geometrical theory of diffraction (User body shadowing)
PPT
L16 radiation shielding
PPT
2005-12-05 Aerosol Characterization Using the SeaWiFS Sensor and Surface Data
PPTX
GENERATING FINE RESOLUTION LEAF AREA INDEX MAPS FOR BOREAL FORESTS OF FINLAND...
PDF
Modelling Receptor Optimization A&WMA 2011 Florida
PPSX
Sentinel 2
PDF
Leaf Area Index (LAI) in the quantification of vegetation disturbance in Iris...
PDF
Radar reflectance model for the extraction of height from shape from shading ...
PPTX
L 15 radiation shielding principles. ppt
Copernicus Land Moniotring Service Portfolio
LiDAR work
Performance improvement of a Rainfall Prediction Model using Particle Swarm O...
Barber_TU2.T03_hk_mb_casa_fg_hk_FINAL.pptx
Prediction of soil properties with NIR data and site descriptors using prepro...
 
201107IGARSS_OHKI.pptx
Poster Roseanne Clement
Global Sensitivity Analysis for the Calibration of a Fully-distributed Hydrol...
MONITORING FOREST MANAGEMENT ACTIVTIES USING AIRBORNE LIDAR AND ALOS PALSAR.pptx
Radiometric Calibration of Digital Images
Uniform geometrical theory of diffraction (User body shadowing)
L16 radiation shielding
2005-12-05 Aerosol Characterization Using the SeaWiFS Sensor and Surface Data
GENERATING FINE RESOLUTION LEAF AREA INDEX MAPS FOR BOREAL FORESTS OF FINLAND...
Modelling Receptor Optimization A&WMA 2011 Florida
Sentinel 2
Leaf Area Index (LAI) in the quantification of vegetation disturbance in Iris...
Radar reflectance model for the extraction of height from shape from shading ...
L 15 radiation shielding principles. ppt
Ad

Similar to Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Deep Learning Methods: A Comparison of Multiple Algorithms (20)

PDF
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
PDF
AI Final report 1.pdf
PPT
Presen_IGARSS11_Jamet.ppt
PDF
A detailed analysis of the supervised machine Learning Algorithms
PDF
Morton john canty image analysis and pattern recognition for remote sensing...
PDF
Prediction of Air Quality Index using Random Forest Algorithm
PDF
ElliottHarrington_Poster_AGU-2013
PPTX
Statistical learning approach for estimating water quality parameters
PPT
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
PDF
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
PPTX
University of Victoria Talk - Metocean analysis and Machine Learning for Impr...
PDF
Application of extreme learning machine for estimating solar radiation from s...
PPTX
Rapid phenotyping of prawn biochemical attributes using hyperspectral imaging
PPTX
Rapid phenotyping of prawn biochemical attributes using hyperspectral imaging
PDF
Machine learning methods in the environmental sciences Hsieh W.W.
PDF
Optimized Neural Network for Classification of Multispectral Images
PDF
Deep learning and machine learning classification technique for integrated fo...
PDF
Calculation of solar radiation by using regression methods
PDF
book.pdf
PDF
Supervised learning (2)
MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEE...
AI Final report 1.pdf
Presen_IGARSS11_Jamet.ppt
A detailed analysis of the supervised machine Learning Algorithms
Morton john canty image analysis and pattern recognition for remote sensing...
Prediction of Air Quality Index using Random Forest Algorithm
ElliottHarrington_Poster_AGU-2013
Statistical learning approach for estimating water quality parameters
MACHINE LEARNING FOR SATELLITE-GUIDED WATER QUALITY MONITORING
Water Quality Index Calculation of River Ganga using Decision Tree Algorithm
University of Victoria Talk - Metocean analysis and Machine Learning for Impr...
Application of extreme learning machine for estimating solar radiation from s...
Rapid phenotyping of prawn biochemical attributes using hyperspectral imaging
Rapid phenotyping of prawn biochemical attributes using hyperspectral imaging
Machine learning methods in the environmental sciences Hsieh W.W.
Optimized Neural Network for Classification of Multispectral Images
Deep learning and machine learning classification technique for integrated fo...
Calculation of solar radiation by using regression methods
book.pdf
Supervised learning (2)
Ad

More from gerogepatton (20)

PDF
International Journal of Artificial Intelligence & Applications (IJAIA)
PDF
International Journal of Artificial Intelligence & Applications (IJAIA)
PDF
Performance Evaluation of Block-Sized Algorithms for Majority Vote in Facial ...
PDF
3rd International Conference on Artificial Intelligence and IoT (AIIoT 2025)
PDF
International Journal of Artificial Intelligence & Applications (IJAIA)
PDF
3rd International Conference on Artificial Intelligence and IoT (AIIoT 2025)
PDF
Augmented and Synthetic Data in Artificial Intelligence
PDF
3rd International Conference on AI, Data Mining and Data Science (AIDD 2025)
PDF
July 2025 - Top 10 Read Articles in Artificial Intelligence and Applications ...
PDF
6th International Conference on Natural Language Processing and Computational...
PDF
From Insight to Impact: The Evolution of Data-Driven Decision Making in the A...
PDF
6th International Conference on Artificial Intelligence and Machine Learning ...
PDF
3rd International Conference on Artificial Intelligence and IoT (AIIoT 2025)
PDF
International Journal of Artificial Intelligence & Applications (IJAIA)
PDF
AI-Driven Vulnerability Analysis in Smart Contracts: Trends, Challenges and F...
PDF
International Journal of Artificial Intelligence & Applications (IJAIA)
PDF
6th International Conference on Artificial Intelligence and Machine Learning ...
PDF
A Thorough Introduction to Multimodal Machine Translation
PDF
International Journal of Artificial Intelligence & Applications (IJAIA)
PDF
6th International Conference on Advanced Machine Learning (AMLA 2025)
International Journal of Artificial Intelligence & Applications (IJAIA)
International Journal of Artificial Intelligence & Applications (IJAIA)
Performance Evaluation of Block-Sized Algorithms for Majority Vote in Facial ...
3rd International Conference on Artificial Intelligence and IoT (AIIoT 2025)
International Journal of Artificial Intelligence & Applications (IJAIA)
3rd International Conference on Artificial Intelligence and IoT (AIIoT 2025)
Augmented and Synthetic Data in Artificial Intelligence
3rd International Conference on AI, Data Mining and Data Science (AIDD 2025)
July 2025 - Top 10 Read Articles in Artificial Intelligence and Applications ...
6th International Conference on Natural Language Processing and Computational...
From Insight to Impact: The Evolution of Data-Driven Decision Making in the A...
6th International Conference on Artificial Intelligence and Machine Learning ...
3rd International Conference on Artificial Intelligence and IoT (AIIoT 2025)
International Journal of Artificial Intelligence & Applications (IJAIA)
AI-Driven Vulnerability Analysis in Smart Contracts: Trends, Challenges and F...
International Journal of Artificial Intelligence & Applications (IJAIA)
6th International Conference on Artificial Intelligence and Machine Learning ...
A Thorough Introduction to Multimodal Machine Translation
International Journal of Artificial Intelligence & Applications (IJAIA)
6th International Conference on Advanced Machine Learning (AMLA 2025)

Recently uploaded (20)

PDF
WOOl fibre morphology and structure.pdf for textiles
PDF
A comparative study of natural language inference in Swahili using monolingua...
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
NewMind AI Weekly Chronicles - August'25-Week II
PPTX
OMC Textile Division Presentation 2021.pptx
PPTX
A Presentation on Touch Screen Technology
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
1 - Historical Antecedents, Social Consideration.pdf
PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Unlocking AI with Model Context Protocol (MCP)
PDF
Encapsulation theory and applications.pdf
PDF
Hindi spoken digit analysis for native and non-native speakers
PPTX
1. Introduction to Computer Programming.pptx
PDF
DP Operators-handbook-extract for the Mautical Institute
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
Accuracy of neural networks in brain wave diagnosis of schizophrenia
PPTX
Digital-Transformation-Roadmap-for-Companies.pptx
PPTX
TLE Review Electricity (Electricity).pptx
PPTX
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
PDF
August Patch Tuesday
WOOl fibre morphology and structure.pdf for textiles
A comparative study of natural language inference in Swahili using monolingua...
cloud_computing_Infrastucture_as_cloud_p
NewMind AI Weekly Chronicles - August'25-Week II
OMC Textile Division Presentation 2021.pptx
A Presentation on Touch Screen Technology
Agricultural_Statistics_at_a_Glance_2022_0.pdf
1 - Historical Antecedents, Social Consideration.pdf
Web App vs Mobile App What Should You Build First.pdf
Unlocking AI with Model Context Protocol (MCP)
Encapsulation theory and applications.pdf
Hindi spoken digit analysis for native and non-native speakers
1. Introduction to Computer Programming.pptx
DP Operators-handbook-extract for the Mautical Institute
Programs and apps: productivity, graphics, security and other tools
Accuracy of neural networks in brain wave diagnosis of schizophrenia
Digital-Transformation-Roadmap-for-Companies.pptx
TLE Review Electricity (Electricity).pptx
TechTalks-8-2019-Service-Management-ITIL-Refresh-ITIL-4-Framework-Supports-Ou...
August Patch Tuesday

Modeling the Chlorophyll-a from Sea Surface Reflectance in West Africa by Deep Learning Methods: A Comparison of Multiple Algorithms

  • 1. International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019 DOI: 10.5121/ijaia.2019.10603 33 MODELING THE CHLOROPHYLL-A FROM SEA SURFACE REFLECTANCE IN WEST AFRICA BY DEEP LEARNING METHODS: A COMPARISON OF MULTIPLE ALGORITHMS Daouda DIOUF and Djibril Seck Laboratoire de Traitement de l’Information (LTI) – ESP – Université Cheikh Anta Diop de Dakar BP: 5085 Dakar-Fann (Sénégal) ABSTRACT Deep learning provide successful applications in many fields. Recently, machines learning are involved for oceans remote sensing applications. In this study, we use and compare about eight (8) deep learning estimators for retrieval of a mainly pigment of phytoplankton. Depending on the water case and the multiple instruments simultaneously observing the earth on a variety of platforms, several algorithm are used to estimate the chlolophyll-a from marine reflectance.By using a long-term multi-sensor time-series of satellite ocean-colour data, as MODIS, SeaWifs, VIIRS, MERIS, etc…, we make a unique deep network model able to establish a relationship between sea surface reflectance and chlorophyll-a from any measurement satellite sensor over West Africa. These data fusion take into account the bias between case water and instruments.We construct several chlorophyll-a concentration prediction deep learning based models, compare them and therefore use the best for our study. Results obtained for accuracy training and test are quite good. The mean absolute error are very low and vary between 0,07 to 0,13 mg/m3 . KEYWORDS deep learning estimators; remote sensing; chlorophyll-a 1. INTRODUCTION The sensor ocean color provides measures ρtoa multispectral Top Of Atmosphere (TOA) reflectance (λ) of ocean-atmosphere system in the visible and near infrared since many decades. Reflectance ρcor measured by the radiometer, corrected to Rayleigh scattering contribution, specular reflection and absorption gas is expressed as follows: ρcor (λ) = ρA (λ) + t.ρw (λ) where ρA is the atmospheric contribution and ρw the contribution due to the ocean, t the atmospheric transmittance. Atmospheric correction algorithm can be used [1] to estimate ρA and and determined chlorophyll-a from the remaining ρw. Phytoplankton are important to marine ecosystems. Its play great role in the food web and biogeochemical cycles. Chlorophyll represents mainly the phytoplankton.
  • 2. International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019 34 To retrieved chlorophyll-a concentration from ρw, SeaWifS sensor use OC4V4 algorithm that compute the rapports of ρw at wavelengths 443nm, 490nm, 510nm and 555nm. ) 532 . 1 049 . 6 930 . 1 067 . 3 366 . 0 ( 4 3 2 10 R R R R a chl       where )) ) 555 ( ) 510 ( , ) 555 ( ) 490 ( , ) 555 ( ) 443 ( (max( log10 w w w w w w R        This mean that the maximum rapport value is taken. The machine learning show strong computing power of classification and fitting capability to big data and multi-feature data [2]. In many fields as image recognition [3], search engines [4], stock price predictions [5], accurate results are obtained. Machine learning found many application on earth remote sensing [6] especially for ocean data products. Due to the non-linearity and complexity of the measurements made by ocean colour sensors, and taking advantage of the high-dimensional data reduction technique for the construction of high- dimensional predictors in input-output models of deep learning, we aim, with these last ones, to construct and optimize a chlorophyll-a concentration prediction model. 2. DATASET Data are from ESA Ocean Colour Climate Change Initiative (Ocean_Colour_cci): Version 3.1 Data. This collection contains version 3.1 datasets produced by the Ocean Colour project of the ESA Climate Change Inititative (CCI). The Ocean Colour CCI is producing long-term multi-sensor time- series of satellite ocean-colour data with a particular focus for use in climate studies. Data products being produced include: phytoplankton chlorophyll-a concentration; remote-sensing reflectance at six wavelengths (412 nm, 443 nm, 490 nm, 510 nm, 555 nm, 670 nm) Datasets are 5 days composite images of year 2014 and are taken in an area off the West Africa, between 6°N and 30°N and - 34°W and -8°W. This area is very important. It contains the senegalo- mauritanian upwelling zone. Datasets of year 2014 are about 26 686 250 pixels. We random these datasets to avoid the overfitting and take the 5 % for train data and 1 % for test data which is used to test the prediction accuracy of the model. We use also independent dataset from MODIS sensor for validation. 3. METHODS We implemented ours models in Keras and sklearn libraries and we used many deep learning regressor methods to establish deep network relationship between sea surface reflectance. The aims is, first to shown the feasibility because of its multivalued of ocean color data, and second, to choose the best one and further explored it and their hyperparameters tuned. For each model, input are the six spectral sea surface reflectance and output is the chlorophyll-a.
  • 3. International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019 35 3.1. Generalized Linear Models Linear models are simple way to predict output using a linear function of input features. The spectral sea surface reflectance represents the input features and are notes by   n x x x X ,... , 2 1  , output by y and therefore ŷ the predicted output. Linear Regression: The library of sklearn allow us to parameterize a linear regression. It fits a linear model with coefficients w=(w1...wn) to minimize the residual sum of squares between the spectral sea surface reflectance X, and the chlorophyll-a concentration ŷ by the linear approximation: n n x w x w x w w X w y      ... ) , ( ˆ 2 2 1 1 0 Linear regression optimization is to minimize the cost function written as:           M i M i n j ij j i i i x w y y y 0 0 0 2 2 ) ( ) ˆ ( Ridge regression: The cost function is altered by imposing a penalty equivalent to square of the magnitude of the coefficients. The cost function to minimize is:              M i M i n j n j j ij j i i i w x w y y y 0 0 0 0 2 2 2 ) ( ) ˆ (  Ridge Regression is an optimization of Ordinary Least Squares Regression. 3.2. Generalized Ensemble Methods Ensemble methods are algorithms that combine multiple algorithms into a single predictive model in order to decrease variance, decrease bias, or improve predictions. In others words, they are a sets of machine learning techniques whose decisions are combined to improve the performances of the overall system. Bagging regressor: Bagging methods are aggregation methods. The bagging approach consists of trying to reduce the dependency between the estimators that are aggregated by building them on bootstrap samples. The algorithm is simple to implement: it is necessary to build n estimators on bootstrap samples and to aggregate them. Decision trees regressor: Classification and regression trees models, or CART models, were introduced by [7]. A top down approach is applied to dataset. The complexity of the model is managed by two parameters: max_depth, which determines the max number of leaves in the trees, and the minimum number min_samples_split of dataset required to search for a dichotomy. Random forest regressor: A random forest is only an aggregation trees dependent on random variables. For example, bagging trees (building trees on bootstrap samples) defines a random forest. The Random Forest allows to improve the predictive accuracy and to control over-fitting. [8]. More than the number of trees n_estimators, the parameter to be optimized is the number of variables randomly drawn for the search for the optimal division of a node: max_features. Maximizing the max_features parameter can be achieved by minimizing the out-of-bag forecast error.
  • 4. International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019 36 Extra Trees regressor: This class implements a meta-estimator that fits a number of randomized decision trees on various sub-samples of the dataset and uses averaging to improve the predictive accuracy and control over-fitting. 3.3. Support Vector Regressor Model SVR algorithm use RBF as a kernel function. SVR minimizes the generalization error bound so as to achieve generalized, instead of minimizing the observed training error [9]. 3.4. K-Neighbors Regressor Model The k-nearest neighbors algorithm (k-NN) is a non-parametric. For regression, the output is based on the mean or median of the k-nearest neighbors in the feature space. The parameter to optimize to control the complexity of the model is the number of neighbor K. 4. RESULTS In this section, we verify the generalization capability of the constructed models. Therefore a set of data were used for prediction experiments. Results are shown on table1. The ensemble models regressor are more quite fitting data with respectively test accuracy and mean absolute error (mae) predictions of 96,04% and 0,09 for bagging; 96,05% and 0,09 for random forest; 96,46% and 0,07 for extra tree and 93,78% and 0,13 for decision trees. The linear model give us a test accuracy of about 76,06 %. We get a mae prediction of about 0,43. The K-neighbor model have a mae prediction of 0,09 and a test accuracy of 95,08. The support vector and the ridge gave least good results with respectively test accuracy and mae predictions of 5,05% and 0,67; 24,54% and 0,8. The distribution with a kernel density estimate on figure below show it clearly. Tab.1: Test of accuracy and error prediction Regression Mean absolute error (mg/m3 ) Accuracy (%) Linear 0.43 76,06 Ridge 0,8 24,54 SVR 0,67 5,05 K-Neighbor 0,09 95,08 Extra Tree 0,07 96,46 Random Forest 0,09 96,05 Decision Trees 0,13 93,78 Bagging 0,09 96,04 With regard to Table 1, we find that all ensemble models regression work well with larger prediction values. In figure 1, we show kernel density estimation (KDE) to visualize frequency test data predicted by each method. A KDE is used to get a smooth estimation of the probability density function. This curve is estimated from the data, and the most widely used kernel is a Gaussian kernel. This is particularly useful in looking for a cluster of analyses in spectra of data. We noted that extra tree, random forest, decision trees and bagging method are more robust. For the following of this paper, we will work with the extra trees regression model to predict the target.
  • 5. International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019 37 Fig. 1: Distribution with a kernel density estimation with a Gaussian kernel and a data set with 266862 sample points from a combined normal density
  • 6. International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019 38 In figure 2 and figure 3, we plot the predicted chl-a concentration with extra trees model and the real values measured respectively for the average of period from January 16, to January 30, 2014 and for the average of period from from March 11, 2014, to March 30, 2014. For average value, we apply the algorithms to each daily image and average this daily estimate for the climatology period under study. The dataset we compare did not participate in learning phase. The two images are very similar. Indeed, chl-a estimated for this average is equal to 95,05% with a mae of 0,09 mg/m3 . We see that an abundance chlorophyll-a pattern is observed near the shore and this rapidly decrease offshore. This is consistent with the literature because the upwelling area runs along the West African coast from Guinea to Mauritania. [10] and more recently [11], [12] demonstrate that the upwelling intensity is maximum in March–April. Fig. 2: The average of estimate chlorophyll-a concentration from January 16, 2014, to January 30, 2014 in mg/m3 for (left) the true CCI value and (right) Extra trees prediction value. The relative error between them is represented in bottom.
  • 7. International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019 39 Fig. 3: The average of estimate chlorophyll-a concentration from March 11, 2014, to March 30, 2014 in mg/m3 for (left) the true CCI value and (right) Extra trees prediction value. The relative error between them is represented in bottom. By using independent dataset, comparison can be made with the chl-a predicted using from spectral reflectance of MODIS sensor with the model and the standard chl-a estimate with standard algorithm, OC3V3. This comparison shows that both methods bring out the patterns of chl-a. However the intensity of is stronger in the standard retrieval. Figure 4. This result is very significant because it mean that according to whether we use data from several sensors, such as those used to build the different models of this work, or data from a single sensor (MODIS sensor, figure 4), the modeling capacity remains good. Fig.4: The average of estimate chlorophyll-a concentration from January 1, 2015, to January 16, 2015 in mg/m3 for (left) the standard MODIS value and (right) Extra trees prediction value.
  • 8. International Journal of Artificial Intelligence & Applications (IJAIA) Vol.10, No.6, November 2019 40 5. CONCLUSION The high-dimensional predictors in input-output models offered by deep learning demonstrate in the work the effectiveness of chlorophyll-a retrieval from sea surface reflectance. High accuracy are obtained on both the training and test dataset with a low mean absolute error of 0,09 mg/m3 and correlation coefficient higher than 92%. The extra tree regression was the model we used. Retrievals values of chlorophyll-a are in consistence with upwelling phenomena denoted on this area. Comparison we independent value have shown satisfactory results. REFERENCES [1] D. Diouf, A. Niang, J. Brajard, M. Crepon, and S. Thiria, “Retrieving aerosol characteristics and sea‐surface chlorophyll from satellite ocean color multi‐spectral sensors using a neural‐variational method”, Remote Sens. Environ., 130, 2013: 74– 86. [2] G.E. Hinton, .S Osindero, Y.W. The, “A fast learning algorithm for deep belief nets. Neural computation”, 2006, 18(7): 1527–1554. https://doi.org/10.1162/neco.2006.18.7.1527 PMID: 16764513 [3] K. He, X. Zhang, S. Ren, J. Sun, “Deep residual learning for image recognition”. Proceedings of the IEEE conference on computer vision and pattern recognition. 2016: 770–778. [4] R. Collobert, J. A. Weston, “unified architecture for natural language processing: Deep neural networks with multitask learning”. Proceedings of the 25th international conference on Machine learning. ACM, 2008: 160–167. [5] R. Akita, A. Yoshihara, T. Matsubara, K. Uehara, “Deep learning for stock prediction using numerical and textual information.” Computer and Information Science (ICIS), 2016 IEEE/ACIS 15th International Conference on. IEEE, 2016: 1–6. [6] D.J. Lary , G.K. Zewdie, X. Liu, D. Wu, E. Levetin, R.J. Allee, N. Malakar, A. Walker, H. Mussa, A. Mannino, D. Aurin “Machine Learning Applications for Earth Observation”, In book: Earth Observation Open Science and Innovation, ISSI Scientific Report Series 15, P.-P. Mathieu, C. Aubrecht (eds.), https://doi.org/10.1007/978-3-319-65633-5_8 [7] L. Breiman, J.H. Friedman, R.A Olshen and C. J. Stone “Classification and Regression Trees”, 1984, Wadsworth International Group, Belmont, CA. [8] L. Breiman, “Random forests”, Machine Learning 45 (2001), 5–32. [9] D. Basak, S. Pal and D. C. Patranabis, “Support Vector Regression”, Neural Information Processing – Letters and Reviews, Vol. 11, No. 10, October 2007 [10] Demarcq, H., and Faure, V. ( 2000), “Coastal upwelling and associated retention indices from satellite SST. Application to Octopus vulgaris recruitment”, Oceanogr. Acta, 23, 391– 407. [11] Sawadogo, S., Brajard, J., Niang, A., Lathuilière, C., Crepon, M., and Thiria, S. ( 2009), “Analysis of the Senegalo‐Mauritanian upwelling by processing satellite remote sensing observations with topological maps”, in Proceedings IEEE International Joint Conference on Neural Networks, IJCNN, pp. 2826–2832, Atlanta, USA [12] Farikou, O., Sawadogo, S., Niang, A., Brajard, J., Mejia, C., Crépon, M., and Thiria, S. ( 2013), “Multivariate analysis of the Senegalo‐Mauritanian area by merging satellite remote sensing ocean color and SST observations”, J. Environ. Earth Sci., 5( 12), 756– 768.