SlideShare a Scribd company logo
TELKOMNIKA Telecommunication, Computing, Electronics and Control
Vol. 18, No. 5, October 2020, pp. 2525~2532
ISSN: 1693-6930, accredited First Grade by Kemenristekdikti, Decree No: 21/E/KPT/2018
DOI: 10.12928/TELKOMNIKA.v18i5.14887  2525
Journal homepage: http://journal.uad.ac.id/index.php/TELKOMNIKA
Precipitation prediction using recurrent neural networks
and long short-term memory
Mishka Alditya Priatna1
, Esmeralda C. Djamal2
1
Department of Meteorology, Institut Teknologi Bandung, Indonesia
2
Department of Informatics, Universitas Jenderal Achmad Yani, Indonesia
Article Info ABSTRACT
Article history:
Received Jun 9, 2019
Revised Apr 12, 2020
Accepted May 1, 2020
Prediction of meteorological variables such as precipitation, temperature, wind
speed, and solar radiation is beneficial for human life. The variable
observations data is available from time to time for more than thirty years,
scattered each observation station makes the opportunity to map patterns into
predictions. However, the complexity of weather variables is very high, one of
which is influenced by Decadal phenomena such as El-Nino Southern
Oscillation and IOD. Weather predictions can be reviewed for the duration,
prediction variables, and observation stations. This research proposed
precipitation prediction using recurrent neural networks and long short-term
memory. Experiments were carried out using the prediction duration factor,
the period as a feature and the amount of data set used, and the optimization
model. The results showed that the time-lapse as a shorter feature gives good
accuracy. Also, the duration of weekly predictions provides more accuracy
than monthly, which is 85.71% compared to 83.33% of the validation data.
Keywords:
Deep learning
Long short-term memory
Meteorology data
Precipitation prediction
Recurrent neural networks
This is an open access article under the CC BY-SA license.
Corresponding Author:
Esmeralda Contessa Djamal,
Department of Informatics,
Universitas Jenderal Achmad Yani,
Terusan Jenderal Sudirman Cimahi St., Indonesia.
Email: esmeralda.contessa@lecture.unjani.ac.id
1. INTRODUCTION
Indonesia is between the Indian Ocean and the Pacific Ocean and two continents, especially Asia and
Australia, so the climate often changes and is influenced by many factors. Other phenomena, such as El-Nino
Southern Oscillation (ENSO), add to the complexity of the environment [1]. Precipitation is a part of
the climate that has very complicated. Its characteristic in every area is indeed different from one another. It is
affected by many factors like geographic, topographic, and monographic. Island structure and orientation have
not been calculated. As a result, the precipitation distribution patterns are not prevalent in each area with others
in a broad scope. Besides that, ENSO influences almost 70% of the change of precipitation in Indonesia
in the Maritime Continent [2]. In meanwhile, other parts of the world marked by the displacement of "warm
pool" and cloud formation ordinary occurred in Indonesia sea go easterly to the middle of the Pacific
Ocean. This phenomenon can degrade the precipitation in several parts in the Pacific area so that it runs into
drought [3]. With its dynamism, precipitation could not be discovered precisely.
Precipitation is an essential phenomenon in the climate system, which is chaotic and greatly affects
all aspects of life, such as the availability of water resources, urban planning, agriculture, and financial security.
Over the past few years, research on precipitation prediction is overgrowing. Many models have been could
improve the accuracy of precipitation prediction [4]. Meanwhile, the life cycle that forms specific patterns
 ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 5, October 2020: 2525 - 2532
2526
depends on money and time [5]. Indeed, daily rainfall is random. But in the long run, have a particular model
that can be identified. It's just also paying attention to many factors that are uncertain, such as climate zones,
sunspots, tides, atmospheric circulation, and other human activity factors [6, 7].
Climate parameter data provided by the Meteorology, Climatology, and Geophysics Agency (BMKG)
daily, except for precipitation produced every hour of each station. The use of time and prediction location
area, closely related to the use of usage. Based on the territory of use, some studies use a global area [8] while
other studies are predictive of specific regions [9]. Meanwhile, climate prediction has a time frame, such as
daily [8] of solar forecasting, hours of wind speed [9], hours of temperature [10], multi-step time of wind
speed [11], extreme climate every day [5]. Other research estimated rainfall in a short time or hours [12],
days [13], weeks [14]. Some research prefers precipitation prediction in periods monthly with ENSO [15] and
yearly [16] to help in proper agricultural planning. This study compared the estimated weekly and monthly
precipitation models.
Weather forecasting is a new research problem because its application is extensive in many sectors,
such as agriculture to flight navigation. The challenge of climate prediction is to choose the right variables
and data sets and to choose representative models to be able to explore hidden structural patterns in a large
dataset [15]. Unfortunately, it is not a convenient case. Precipitation is a very complicated event because it
happens randomly and depends on numerous factors like temperature, humidity, wind speed, and cloud
pressure. Besides that, the dependent variables which are possible to affect precipitation are not constant even
it cannot be sure how many factors may impact the rainfall. It makes the input parameters to the model may
not adequate to predict precipitation precisely [17]. Climate forecasting made attention to many researchers of
various backgrounds due to its effect on global human life. The support of computer technology and
accessibility to obtain big data of weather observation recently made many researchers are encouraged to learn
more about the pattern in the large dataset of weather prediction. It can predict weather forecasting using
machine learning. The method makes possible learning the pattern of precipitation with other variables in time
series before. Regression problems provide some challenging research in the field of machine learning,
including weather data. Rainfall is a prime example because it shows unique characteristics with high volatility
and chaotic patterns. Therefore the machine learning method can outperform other methods [4].
Prediction variables of climate which may not be clearly understood, traditional linear forecasting
techniques are ill-equipped to handle, often producing unsatisfactory results. Previous research using statistical
bias correction on the output of the daily climate model in Europe [18] improved to see the relation between
ECMWF and the Meteorology, Climatology, and Geophysics Agency (BMKG) [19]. The research resulted in
the value of the transfer function formed from the bias correction process can be used to improve
the distribution of the 2016 rainfall prediction on the island of Bali, to obtain a better prediction. In meanwhile,
some research increasingly resorts to techniques that are heuristic and non-linear. Such methods use neural
network models [20] with machine-learning, regression, and clustering. Other research used dynamic regional
combined short-term rainfall forecasting approach (DRCF) to improve multilayer perceptron and PCA.
The study gave accuracy 75-92% but depended on the number of MLP [21].
Weather data prediction has its characteristics, which depend on the variability of these variables.
Rainfall is very variable compared to solar radiation, wind speed, and temperature. Of course, this condition
has an impact on prediction accuracy, as previous studies predicting weather and temperature provide better
accuracy than real ones so that they can be used in real-time [16]. Other research developed to predict
temperature and humidity [22]. Deep learning is new computing in data mining and machine learning [6].
A neural network with deep architectures has become a kind of powerful tool to retrieve the high-level abstract
features of big data. Some methods that are often used in deep learning are convolutional neural network
(CNN), which convolutes with a fixed size kernel. Weather data can be viewed as imagery, so it can be resolved
with CNN [23]. However, for a limited image segment, it indeed will collide with memory limitations, so other
methods are needed. One way that can be used is LSTM for rainfall prediction [24]. Using CNN can be
modified in one dimension, for example, for sunlight prediction [25], and predict precipitation [26].
Meanwhile, for time series often use RNN. The uniqueness of the RNNs is the feedback connection,
which conveys interference information at the previous input that will be accommodated to the following facts.
The RNN can study sequential or time-varying patterns so that it is tools in modeling intricate weather data
patterns with accurate multi-step estimates [27]. This research predicts rainfall in the Bandung area, which is
the center of the basin, which has a height of 791m above sea level (ASL). The highest point is in the North
with an altitude of 1050 m above sea level, and the lowest point is in the south with an elevation of 675 m
above sea level. The area surrounded by mountains forms the city of Bandung into a kind of basin (Bandung
Basin). The surrounding mountain climate significantly affects the Bandung city climate. However, in recent
years, the temperature has been increasing, and the rainy season becomes more prolonged than usual. In past
years, the rainy season is more intensive happening in Bandung. Naturally, Bandung is quite a cool area. During
TELKOMNIKA Telecommun Comput El Control 
Precipitation prediction using recurrent neural networks and… (Mishka Alditya Priatna)
2527
the year 2012, recorded that the highest temperature in Bandung reached 30.9◦C, which occurred in September,
and the lowest temperature in Bandung in 2012 was 17.4◦C that happened in July.
This paper proposed a precipitation prediction model of the Bandung region using precipitation,
humidity, temperature, solar radiation of 36-years before. The model using RNNs with long short-term memory
(LSTM) to predict precipitation. The uniqueness of the RNNs is the feedback connection, which conveys
interference information at the previous input that will be accommodated to the next data. Machine learning
used this model Network based on consecutive time with prior climate data, so produced rainfall prediction.
The input variables of machine learning are minimum temperature, maximum temperature, average
temperature, relative humidity, duration of sun radiation, average wind speed, maximum wind speed, and
precipitation to predict rainfall in a certain period.
2. RESEARCH METHOD
2.1. Data set
Weather data provided by the Indonesian Agency for Meteorological, Climatological, and (BMKG)
from 1981 to 2017. This research using the Bandung city region in the analysis. Datasets provided consists of
eight variables (minimum temperature, maximum temperature, average temperature, relative humidity,
duration of sun radiation, average wind speed, maximum wind speed, and precipitation) of 36 years
(1981-2017). This research used two periods of precipitation prediction, mainly weekly and monthly
configurations. The values of climate variables in that period are the daily averages, minimum or maximum.
There are four scenarios in the experiment, particularly, variations in the amount of training data (10 years and
five years), and the duration of the predictions (monthly and weekly) with details:
− 10-years: annual (432 data sets, overlap 11 months)
− 10-years: monthly (1872 data sets, overlap 50 weeks)
− 5-years: annual (864 data sets, overlap 11 months)
− 5-years: monthly (3744 data sets, overlap 50 weeks)
All of the models, 75% is used for training data, and 25% is used for non-training or test data.
2.2. Precipitation prediction model
The design of precipitation prediction using RNN and LSTM is shown in Figure 1. Prediction chooses
one of ten classes with a specific interval. Past research used multivariates for rainfall prediction [28] with
RNN and LSTM. Sometimes, climate data has missing or lost observation. Therefore, the data need to
prepare automatically before the next process. Some solution is an interpolation of some available data,
multivariate [29], or predict the missing using refinement function [10]. Meanwhile, BMKG provides daily
data. Therefore, weekly and monthly predictions need to convert daily or weekly data. Some studies used
the average value in this time frame so that it reflects its projections [30]. This research used various variables,
i.e., minimum temperature, maximum temperature, average temperature, relative humidity, duration of sun
radiation, average wind speed, maximum wind speed, and precipitation, which have different units. Therefore,
all datasets before entering the RNN are normalized earlier [18, 25] using (1). The normalization takes
the maximum and minimum values on the 0-1 scale.
𝑥𝑖 =
𝑦−𝑣𝑎𝑙𝑀𝑖𝑛( )
𝑣𝑎𝑙𝑀𝑎𝑥( )−𝑣𝑎𝑙𝑀𝑖𝑛( )
(1)
Based on the forecast period, it is divided into two models, i.e., weekly and monthly. The results of
the study are then compared and can be utilized for their individual needs. When used for flood disaster
management, the selection of the week is more appropriate, taking into account the carrying capacity of soil
absorption. While the range of the planting season can use a monthly period, RNN can adapt to flexible
classes [29]. It can true period or pseudo period. In the meantime, we used rainfall predictions with a certain
range of 10 pre-determined classes, i.e. < 60 mm, 60-120, 120-180, 180-240, 240-300, 300-360, 360-420,
420-480, 480-540, and > 540 mm as shown in Figure 1. Then go to the second step with RNN. The dropout
layers used to minimize the number of input neurons 0.5 probability that input neuron to the next step is 480.
The third step is LSTM layer 2, with the input dropout layer using (2)-(7). The fourth step is the dense layer
using the sigmoid function, where the final result from the previous is entered into (1) to produce a new weight.
2.3. Recurrent neural networks
Deep learning techniques have been successfully applied to solve many problems in climate and
geoscience using massive-scaled observed and modeled data [31]. One method of deep learning is RNN.
Previous research proposed training three models of deep-learning: RNN, conditional restricted boltzmann
machine (CRBM), and CNN using ENSO and Weather dataset [15]. The best accuracy was RNN until 84% of
 ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 5, October 2020: 2525 - 2532
2528
precipitation forecasting using a deep belief network called DBNPF [6]. The research established
the characteristics of environmental factors and future precipitation. Other studies proposed to predict the
extreme weather using Artificial Neural Network and Support Vector Machines [32].
RNN works resemble the workings of the brain, such as sending and receiving information. Similarly,
the work of the RNN brain can also send and receive data from a neuron to another neuron. In general,
the human brain is used to make decisions, and in the process of making a rational decision, often takes into
account the past. RNN is essentially an artificial neural network that uses recurrence by utilizing past data.
RNN can predict a situation in the future [15], and can also classify [33]. RNN has an architecture that can be
used for data in the form of sequence or list, as shown in Figure 2. RNN is a modification of the Feedforward
Neural Network with the characteristic of using feedback from output to input.
Weather Training Data
Weight
Training using Recurrent
Neural Networks
LSTM Layer
1 (Relu)
Dropout
Layer (0.2)
LSTM Layer
2 (Sigmoid)
Dense Layer
(sigmoid)
Weather Data
Weekly or monthly
Prediction Rainfall using
Recurrent Neural Networks
LSTM Layer
1 (Relu)
Dropout
Layer (0.2)
LSTM Layer
2 (Sigmoid)
Dense Layer
(sigmoid)
<60 60-120 120-180 180-240 240-300 300-360 360-420 420-480 480-540 >540
Pre-Processing
Missing data
handling
Convert to
periode
Normalize
Pre-Processing
Missing data
handling
Convert to
periode
Normalize
Interval precipitation prediction (ten classes)
Figure 1. Precipitation prediction model
yt
h1
U
V
W
=
xt
y0
h0
U
V
W
x0
y1
h1
U
V
W
x1
y2
h2
U
V
W
x2
...
yt-1
ht
U
V
W
xt-1
yt
Figure 2. RNNs architecture
In the Architecture of RNNs, there are several connections from one neuron to one of the next neurons.
RNN is processed in a sequence of time. So that each information has a relationship with one another. This
way makes the RNN has a memory that functions to remember the results of the previous process that will be
TELKOMNIKA Telecommun Comput El Control 
Precipitation prediction using recurrent neural networks and… (Mishka Alditya Priatna)
2529
used in the next process. However, when the RNN processes quite a lot of data, it has difficulty in maintaining
information from the previous steps. The first hidden layer has the weight obtained from the input layer, and
at each layer will receive the weight of the prior layer. Then, the calculation of the next hidden layer uses
the appropriate entity in the previous input and hidden layer. Meanwhile, the forecast for the output layer uses
the last hidden layer. For the process of calculating the hidden layer using functions that can be seen in (2) and
the calculation of output using softmax function which can be seen in (3).
ℎ(𝑡) = tanh(𝑈𝑥(𝑡) + 𝑊ℎ(𝑡 − 1) + 𝑏) (2)
𝑦(𝑡) = 𝑠𝑜𝑓𝑡 𝑚𝑎𝑥(𝑉ℎ(𝑡) + 𝑐) (3)
In this study, the data collection that was processed consists of various kinds of data sequences such
as meteorology variables, so it is necessary to group data in the input layer to provide the training process.
However, RNN has the disadvantage of short-term memory. Therefore, in making predictions using quite a lot
of data, RNN has several variations to solve the problem. The gate is the gated recurrent unit (GRU),
backpropagation through time (BPTT), and LSTM. This research used long short-term memory (LSTM) to
overcome the vanishing gradient [34].
The RNNs training process is similar to neural network training using a backpropagation algorithm
but with few cycles. The parameter that shared equally in every time step, so gradient for each output, does not
only depend on a calculation from the current time step but also the previous one [35]. This research used
LSTM gates to overcome the dependence of long-term process, which is often phenomena that occurred in
sequential data processing, as shown in Figure 3.
LSTM LSTM
x +
x x
tanh
tanhσσ σ
Ct -1
St -1
t -1 t +1
Ct
St
x t
St
t
Figure 3. LSTM architecture
The key from LSTM is cell state with architecture marked by a horizontal line that flows from Ct-1
until Ct. LSTM can delete or add information to the cell state set by the structure, which is called the gate. Gate
is a method to pass the information, which consists of biner sigmoid function () and multiplication operation
with x. Biner Sigmoid function is as shown in (4). The first step in LSTM is deciding what information will be
disposed of from the cell state called forget gate used (5) [34].
𝑓(𝑥) =
1
1+𝑒−𝑥 (4)
𝑓𝑡 = 𝜎(𝑊𝑓[ℎ 𝑡−1, 𝑥𝑡] + 𝑏𝑓) (5)
where ht-1 and xt values, are in 0-1 interval every cell. Use of 1, which represents this information, is kept, and
0, which expresses this information is deleted. The second is deciding recent data stored in the cell. This step
is divided into two parts. First, Input gate will determine values that will be updated using (6) and calculation
for recent cell candidate (𝐶̂𝑡) which;
𝑖𝑡 = 𝜎(𝑊𝑓[ℎ 𝑡−1, 𝑥𝑡] + 𝑏𝑖) (6)
𝐶̂𝑡 = 𝑡𝑎𝑛ℎ(𝑊𝑐[ℎ 𝑡−1, 𝑥𝑡] + 𝑏𝑐) (7)
There will be added to the old cell state (Ct) with a cell (𝐶̂𝑡) candidate using (8). The former cell state
multiplicated by forgetting state. And cell candidate multiplicated with the input gate. It updated the cell state.
tiCfC Ctttt
~
1 ** += − (8)
 ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 5, October 2020: 2525 - 2532
2530
The last step is the output gate used to determine which output will be produced based on cell state from the results
of (6) calculated with the biner sigmoid function as shown in (9) Furthermore, multiplicated with activating function
from updated cell state using (10). Some previous research used sigmoid and tanh [35].
𝑜𝑡 = 𝜎(𝑊𝑜[ℎ 𝑡−1, 𝑥𝑡] + 𝑏 𝑜) (9)
ℎ 𝑡 = 𝑜𝑡 ∗ 𝑡𝑎𝑛ℎ(𝐶𝑡) (10)
3. RESULTS AND ANALYSIS
Experiments from precipitation prediction models are carried out with variations in the interval of
training data (10 years and five years), and the prediction time (monthly and weekly), with the RNN
configuration in Table 1. In getting the optimal prediction, variations are performed on the dataset time and
prediction duration. The experiment also tested the accuracy of the optimization model, the adaptive moment
estimation (Adam) model, and the stochastic gradient descent (SGD) model. Accuracy is calculated from
the accuracy of the output class against the actual class label. This research used two predictive models. First,
it used weather data for ten years and five years. Both models are tested weekly and monthly. The accuracy is
obtained as in Table 2, Figure 4 of 10-years dataset, and Figure 5 of the 5-years dataset. This simulation
developed using two optimizer model (Adam and SGD) to correct weight, in 200 epoch.
Table 1. RNN configuration
Configuration
10-years 5- years
weekly monthly weekly monthly
Dataset 1872 432 3744 864
Input 4160 960 2080 480
Hidden 4160 64 2080 64
Dropout 0.2 0.2 0.2 0.2
Dense 13 13 13 13
Output layer 10 10 10 10
Table 2. Accuracy of Precipitation toward dataset and duration
Model
Accuracy (%)
10-years – 432 data set 5-years – 864 data set
Weekly Monthly Weekly Monthly
Training
Data
Test
Data
Training
Data
Test
Data
Training
Data
Test
Data
Training
Data
Test
Data
Adam 99.21 80.95 96.60 73.25 99.60 85.71 99.21 83.33
SGD 97.22 65.07 91.66 61.42 96.42 79.36 92.85 75.21
(a) (b)
Figure 4. The accuracy of 10 years – 432 data set (a) weekly (b) monthly
(a) (b)
Figure 5. The accuracy of 5 years–864 data set (a) weekly (b) monthly
This study is looking for representative data sets representing patterns of rainfall occurrence. From
Table 2, it is shown that an interval of 5-years for weather prediction is enough to provide better accuracy. The use
TELKOMNIKA Telecommun Comput El Control 
Precipitation prediction using recurrent neural networks and… (Mishka Alditya Priatna)
2531
of shorter time ranges gives the consequence of the many variations of training data, of course, providing better
accuracy. Based on Figure 5 and Table 2, visible 5-year data sets provide an accuracy of 83.33% for monthly and
85.71% for weekly. While in Figure 5 gave an accuracy of 73.25% for monthly and 80.95% for weekly. This result
strengthens the hypothesis that the amount of training data is more dominant than the number of variables that are
processed. Refer to the prediction periods in Table 2. It appears that weekly period predictions have better accuracy
(85.71%), given the variation of meteorological data every day of the week is not too large. Meanwhile, there is more
variety of daily meteorology in a month so that when combined in the month makes a higher deviation. However,
the model used to predict rain for a month is more robust when available weather data is incomplete.
Precipitation prediction depends on the training data sequence, configuration, method, and period of forecast.
The results of the study, Adam model, provided better accuracy than SGD, considering Adam model used aggregate
data in training. In contrast, SGD used only one or several parts randomly selected, so the possibility of occurrence is
minimum local, as a result of not representing all data in each class. The weekly forecast offers higher efficiency than
monthly finding that in that period, meteorological data is more homogeneous than monthly data. Training data used
for ten years provides better correctness, considering it provides more variation than the accuracy of training data from
the last five years. The best accuracy was 85.71% with weekly using RNN and LSTM of training data for ten years.
This research can compare with MLP with 75-92% [21] and RNN with heuristic optimized of 59-84.6% [15].
4. CONCLUSION
In this work, we studied how to use multiple variables of weather can rainfall prediction monthly.
We proposed the advantage of recurrent neural networks to automatically gave accuracy 85.71 of test data.
The research showed that using five years of weather data can predict precipitation weekly and monthly.
Weather data in the last five years can predict rainfall for a month of the following year. However, weekly
predictions have higher accuracy. The experimental results also show that a large number of data sets can
improve accuracy. In the future, compared the proposed methods approach with a public weather forecast
center results and demonstrated the effectiveness of the model. So that output prediction in value, improving
the output class of this study. Current uncertain rainfall predictions do not only pay attention to past data
patterns but also need to consider extreme phenomena such as El Nino as additional features.
REFERENCES
[1] R. Hidayat, M. Juniarti, and U. Marufah, “Impact of La Niña and La Niña Modoki on Indonesia Rainfall Variability,”
Earth and Environmental Science, vol. 149, no. 1, pp. 217-222, 2017.
[2] Supari, F. Tangang, E. Salimun, E. Aldrian, A. Sopaheluwakan, and L. Juneng, “ENSO Modulation of Seasonal
Rainfall and Extremes in Indonesia,” Climate Dynamics, vol. 51, no. 7, pp. 1-22, 2017.
[3] E. Mulyana, “Relationship between ENSO With Variations Cheap Rain in Indonesia,” Jurnal Sains & Teknologi
Modifikasi Cuaca, vol. 3, pp. 1-4, 2002.
[4] S. Cramer, M. Kampouridis, et al., “An Extensive Evaluation of Seven Machine Learning Methods for Rainfall
Prediction in Weather Derivatives,” Expert Systems with Applications, vol. 85, pp. 169-181, 2017.
[5] Y. Liu, E. Racah, J. Correa, et al., “Application of Deep Convolutional Neural Networks for Detecting Extreme
Weather in Climate Datasets,” International Conference on Advances in Big Data Analytics, pp. 81-88, 2016.
[6] P. Zhang, L. Zhang, et al., “A Deep-Learning Based Precipitation Forecasting Approach Using Multiple
Environmental Factors,” 2017 IEEE 6th International Congress on Big Data, BigData Congress, pp. 193-200, 2017.
[7] N. Sinha, B. Purkayastha, and L. Marbaniang, “Weather Prediction by Recurrent Neural Network Dynamics,”
International Journal Intelligent Engineering Informatics, vol. 2, no. 2/3, pp. 17-80, 2014.
[8] S. G. Gouda, Z. Hussein, S. Luo, and Q. Yuan, “Model selection for accurate daily global solar radiation prediction
in China,” Journal of Cleaner Production, vol. 221, pp. 132-144, 2019.
[9] I. Tanaka and H. Ohmori, “Method Selection in Different Regions for Short-Term Wind Speed Prediction in Japan,”
SICE Annual Conference, vol. 2, pp. 189-194, 2015.
[10] H. K. Kim, “Temperature Prediction Using the Missing Data Refinement Model Based on a Long Short-Term
Memory Neural Network,” Atmosphere, vol. 10, no. 11, pp. 1-16, 2019.
[11] F. Li, G. Ren, and J. Lee, “Multi-step wind speed prediction based on turbulence intensity and hybrid deep neural
networks,” Energy Conversion and Management, vol. 186, pp. 306-322, 2019.
[12] S. Moon, Y. Kim, Y. Hee, and B. Moon, “Application of Machine Learning to an Early Warning System for Very
Short-Term Heavy Rainfall,” Journal of Hydrology, vol. 568, pp. 1042–1054, 2019.
[13] S. Yuan, X. Luo, B. Mu, J. Li, and G. Dai, “Prediction of North Atlantic Oscillation Index with Convolutional LSTM
Based on Ensemble Empirical Mode Decomposition,” Atmosphere, vol. 10, no. 252, pp. 2-13, 2019.
[14] E. P. Prasetya and E. C. Djamal, “Rainfall Forecasting for the Natural Disasters Preparation Using Recurrent Neural
Networks,” 2019 International Conference on Electrical Engineering and Informatics (ICEEI), pp. 52-57, 2019.
[15] A. G. Salman, B. Kanigoro, and Y. Heryadi, “Weather Forecasting Using Deep Learning Techniques,” in 2015
International Conference on Advanced Computer Science and Information Systems (ICACSIS), 2015, pp. 281–285.
[16] M. Kannan, S. Prabhakaran, and P. Ramachandran, “Rainfall Forecasting Using Data Mining Technique,”
International Journal of Engineering and Technology, vol. 2, no. 6, pp. 397-400, 2010.
 ISSN: 1693-6930
TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 5, October 2020: 2525 - 2532
2532
[17] S. K. Mohapatra, A. Upadhyay, and C. Gola, “RainfallPrediction based on 100 years ofMeteorological Data,” International
Conference on Computing and Communication Technologies for Smart Nation (IC3TSN), pp. 162-166, 2017.
[18] C. Piani, J. O. Haerter, and E. Coppola, “Statistical bias correction for daily precipitation in regional climate models
over Europe,” Theoretical and Applied Climatology, vol. 99, no. 1–2, pp. 187-192, 2010.
[19] D. Lealdi, S. Nurdiati, and A Sopaheluwakan, “Statistical Bias Correction Modelling for Seasonal Rainfall Forecast
for the Case of Bali Island,” Journal of Physics, vol. 1008, no. 1, pp. 1-10, 2018.
[20] B. K. Rani and A. Govardhan, “Rainfall Prediction Using Data Mining Techniques a Survey,” The Second
International Conference on Information Technology Convergence and Services, pp. 23-30, 2013.
[21] P. Zhang, Y. Jia, J. Gao, W. Song, and H. Leung, “Short-term Rainfall Forecasting Using Multilayer Perceptron,”
IEEE Transactions on Big Data, vol. 6, no. 1, pp. 93-106, 2018.
[22] M. A. Zaytar and C. El Amrani, “Sequence to Sequence Weather Forecasting with Long Short-Term Memory
Sequence to Sequence Weather Forecasting with Long Short-Term Memory Recurrent Neural Networks,”
International Journal of Computer Applications, vol. 143, no. 11, pp. 7-11, 2016.
[23] B. Zhao, X. Li, X. Lu, and Z. Wang, “A CNN-RNN Architecture for Multi-Label Weather Recognition,”
Neurocomputing, vol. 322, pp. 45-57, 2018.
[24] X. Shi, Z. Chen, and H. Wang, “Convolutional LSTM Network : A Machine Learning Approach for Precipitation
Nowcasting,” Advances in Neural Information Processing Systems, vol. 28, pp. 802-810, 2015.
[25] A. Mulyadi and E. C. Djamal, “Sunshine Duration Prediction Using 1D Convolutional Neural Networks,” 2019 6th
International Conference on Instrumentation, Control, and Automation (ICA), 2019.
[26] M. Qiu, P. Zhao, K. Zhang, J. Huang, X. Shi, and X. Wang, “A short-Term Rainfall Prediction Model Using
Multi-task Convolutional Neural Networks,” International Conference on Data Mining, pp. 395-404, 2017.
[27] F. J. Chang, P. A. Chen, Y. R. Lu, E. Huang, and K. Y. Chang, “Real-time Multi-Step-Ahead Water Level Forecasting
by Recurrent Neural Networks for Urban Flood Control,” Journal of Hydrology, vol. 517, pp. 836-846, 2014.
[28] Y. Liu, C. Gong, L. Yang, and Y. Chen, “DSTP-RNN: a dual-stage two-phase attention-based recurrent neural
network for long-term and multivariate time series prediction,” Expert Systems With Applications, vol. 143, 2020.
[29] Yagmur Gizem Cinar, H. Mirisaeea, P. Goswami, E. Gaussiera, and A.-A. Bachir, “Period-aware Content Attention
RNNs for Time Series Forecasting with Missing Values,” Neurocomputing, vol. 312, pp. 177-186, 2018.
[30] F. R. Ningsih and E. C. Djamal, “Wind Speed Forecasting Using Recurrent Neural Networks and Long Short Term
Memory,”6th International Conference on Instrumentation, Control, and Automation (ICA), pp. 137-141, 2019.
[31] S. Kim, S. Ames, J. Lee, C. Zhang, and A. C. Wilson, “Resolution Reconstruction of Climate Data with Pixel
Recursive Model,” International Conference on Data Mining Workshops, pp. 313-321, 2017.
[32] P. A. Chen, L. C. Chang, and F. J. Chang, “Reinforced recurrent neural networks for multi-step-ahead flood
forecasts,” Journal of Hydrology, vol. 497, pp. 71-79, 2013.
[33] Y. Luo, “Recurrent Neural Networks for Classifying Relations in Clinical Notes,” Journal of Biomedical Informatics,
vol. 72, pp. 85-95, 2017.
[34] S. Alhagry, A. A. Fahmy, and R. A. El-Khoribi, “Emotion Recognition based on EEG using LSTM Recurrent Neural
Network,” International Journal of Advanced Computer Science and Applications, vol. 8, no. 10, pp. 356-358, 2017.
[35] S. Poornima and M. Pushpalatha, “Prediction of Rainfall Using Intensified LSTM Based Recurrent Neural Network
with Weighted Linear Units,” Atmosphere, vol. 10, no. 11, pp. 1-18, 2019.
BIOGRAPHIES OF AUTHORS
Mishka Alditya Priatna received his bachelor's degree in the Department of Meteorology from
Institut Teknologi Bandung in 2019.
E-mail: mishka.alditya@gmail.com
Esmeralda Contessa Djamal received a Bachelor's degree in Engineering Physics from Institut
Teknologi Bandung in 1994, a Master's degree in Instrument and Control from Institut
Teknologi Bandung in 1998. Since Ph.D. dissertation until now, research on EEG classification
and finished doctoral program from Institut Teknologi Bandung in 2005. She is a lecturer of
Informatics Department, Universitas Jenderal Achmad Yani.
Email: esmeralda.contessa@lecture.unjani.ac.id

More Related Content

PDF
Simulating Weather: Numerical Weather Prediction as Computational Simulation
PPTX
Numerical Weather Prediction
PDF
Estimation of TRMM rainfall for landslide occurrences based on rainfall thres...
PDF
3.3 Climate data and projections
PDF
Climate models
DOCX
WEATHER FORECASTING
PDF
Time Series Analysis of Rainfall in North Bangalore Metropolitan Region using...
PDF
Weather forecasting
Simulating Weather: Numerical Weather Prediction as Computational Simulation
Numerical Weather Prediction
Estimation of TRMM rainfall for landslide occurrences based on rainfall thres...
3.3 Climate data and projections
Climate models
WEATHER FORECASTING
Time Series Analysis of Rainfall in North Bangalore Metropolitan Region using...
Weather forecasting

What's hot (20)

PDF
Using landsat 8 data to explorethe correlation between urban heat island and ...
PDF
Geophysical Challenge in Oil and Gas Project
PDF
Climate Change Events in Myanmar and Future Scenarios mod
PDF
Climate Models
PDF
Landuse landcover and ndvi analysis for halia catchment
PDF
Nwp final paper
PDF
5 - K Prasad - Weather forecasting in modern age-Sep-16
PPTX
Descriptive modeling in climate change
PDF
Parametric analysis of ground temperature profile in bwari north central nigeria
PDF
Paper id 71201909
PPTX
PPT
Cliff Mass: Big Data and Weather Prediction - Seattle Interactive 2015
PPTX
Climate science part 3 - climate models and predicted climate change
PDF
Paschalis, A., Molnar, P., Fatichi, S. y Burlando, P. (2013). Un modelo estoc...
PPT
Climate Modelling, Predictions and Projections
PDF
Development of an Integrated Urban Heat Island Simulation Tool
PDF
Models erosion as methodical basis combating its manifestations in Ukraine
PDF
Relations between Arctic large-scale TEC changes and scintillations over Gree...
PDF
International Journal of Engineering Research and Development (IJERD)
Using landsat 8 data to explorethe correlation between urban heat island and ...
Geophysical Challenge in Oil and Gas Project
Climate Change Events in Myanmar and Future Scenarios mod
Climate Models
Landuse landcover and ndvi analysis for halia catchment
Nwp final paper
5 - K Prasad - Weather forecasting in modern age-Sep-16
Descriptive modeling in climate change
Parametric analysis of ground temperature profile in bwari north central nigeria
Paper id 71201909
Cliff Mass: Big Data and Weather Prediction - Seattle Interactive 2015
Climate science part 3 - climate models and predicted climate change
Paschalis, A., Molnar, P., Fatichi, S. y Burlando, P. (2013). Un modelo estoc...
Climate Modelling, Predictions and Projections
Development of an Integrated Urban Heat Island Simulation Tool
Models erosion as methodical basis combating its manifestations in Ukraine
Relations between Arctic large-scale TEC changes and scintillations over Gree...
International Journal of Engineering Research and Development (IJERD)
Ad

Similar to Precipitation prediction using recurrent neural networks and long short-term memory (20)

PDF
Monthly precipitation forecasting with Artificial Intelligence.
PPTX
Presentation1rainfall prediction raincast
PDF
Levenberg-Marquardt-optimized neural network for rainfall forecasting
PDF
A Literature Review on Rainfall Prediction using different Data Mining Techni...
PDF
Estimation of precipitation during the period of south west monsoon
PDF
Comparative Study of Machine Learning Algorithms for Rainfall Prediction
PDF
Time Series Data Analysis for Forecasting – A Literature Review
PDF
Binary classification of rainfall time-series using machine learning algorithms
PDF
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEY
PDF
proposal
PDF
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEY
PDF
Neurocomputing fundamental climate analysis
PDF
Combined Standardized Precipitation Index and ANFIS Approach for Predicting R...
PPT
Monsoon Rainfall Forecast+.ppt
PDF
Assessment of Statistical Models for Rainfall Forecasting Using Machine Learn...
PDF
IRJET- Rainfall Forecasting using Regression Techniques
PDF
On the performance analysis of rainfall prediction using mutual information...
PDF
CLOUD BURST FORECAST USING EXPERT SYSTEMS
PDF
Performance Evaluation of Machine Learning Models in Predicting Dry and Wet C...
PDF
Application of mathematical modelling in rainfall forcast a csae study in...
Monthly precipitation forecasting with Artificial Intelligence.
Presentation1rainfall prediction raincast
Levenberg-Marquardt-optimized neural network for rainfall forecasting
A Literature Review on Rainfall Prediction using different Data Mining Techni...
Estimation of precipitation during the period of south west monsoon
Comparative Study of Machine Learning Algorithms for Rainfall Prediction
Time Series Data Analysis for Forecasting – A Literature Review
Binary classification of rainfall time-series using machine learning algorithms
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEY
proposal
RAINFALL PREDICTION USING DATA MINING TECHNIQUES - A SURVEY
Neurocomputing fundamental climate analysis
Combined Standardized Precipitation Index and ANFIS Approach for Predicting R...
Monsoon Rainfall Forecast+.ppt
Assessment of Statistical Models for Rainfall Forecasting Using Machine Learn...
IRJET- Rainfall Forecasting using Regression Techniques
On the performance analysis of rainfall prediction using mutual information...
CLOUD BURST FORECAST USING EXPERT SYSTEMS
Performance Evaluation of Machine Learning Models in Predicting Dry and Wet C...
Application of mathematical modelling in rainfall forcast a csae study in...
Ad

More from TELKOMNIKA JOURNAL (20)

PDF
Earthquake magnitude prediction based on radon cloud data near Grindulu fault...
PDF
Implementation of ICMP flood detection and mitigation system based on softwar...
PDF
Indonesian continuous speech recognition optimization with convolution bidir...
PDF
Recognition and understanding of construction safety signs by final year engi...
PDF
The use of dolomite to overcome grounding resistance in acidic swamp land
PDF
Clustering of swamp land types against soil resistivity and grounding resistance
PDF
Hybrid methodology for parameter algebraic identification in spatial/time dom...
PDF
Integration of image processing with 6-degrees-of-freedom robotic arm for adv...
PDF
Deep learning approaches for accurate wood species recognition
PDF
Neuromarketing case study: recognition of sweet and sour taste in beverage pr...
PDF
Reversible data hiding with selective bits difference expansion and modulus f...
PDF
Website-based: smart goat farm monitoring cages
PDF
Novel internet of things-spectroscopy methods for targeted water pollutants i...
PDF
XGBoost optimization using hybrid Bayesian optimization and nested cross vali...
PDF
Convolutional neural network-based real-time drowsy driver detection for acci...
PDF
Addressing overfitting in comparative study for deep learningbased classifica...
PDF
Integrating artificial intelligence into accounting systems: a qualitative st...
PDF
Leveraging technology to improve tuberculosis patient adherence: a comprehens...
PDF
Adulterated beef detection with redundant gas sensor using optimized convolut...
PDF
A 6G THz MIMO antenna with high gain and wide bandwidth for high-speed wirele...
Earthquake magnitude prediction based on radon cloud data near Grindulu fault...
Implementation of ICMP flood detection and mitigation system based on softwar...
Indonesian continuous speech recognition optimization with convolution bidir...
Recognition and understanding of construction safety signs by final year engi...
The use of dolomite to overcome grounding resistance in acidic swamp land
Clustering of swamp land types against soil resistivity and grounding resistance
Hybrid methodology for parameter algebraic identification in spatial/time dom...
Integration of image processing with 6-degrees-of-freedom robotic arm for adv...
Deep learning approaches for accurate wood species recognition
Neuromarketing case study: recognition of sweet and sour taste in beverage pr...
Reversible data hiding with selective bits difference expansion and modulus f...
Website-based: smart goat farm monitoring cages
Novel internet of things-spectroscopy methods for targeted water pollutants i...
XGBoost optimization using hybrid Bayesian optimization and nested cross vali...
Convolutional neural network-based real-time drowsy driver detection for acci...
Addressing overfitting in comparative study for deep learningbased classifica...
Integrating artificial intelligence into accounting systems: a qualitative st...
Leveraging technology to improve tuberculosis patient adherence: a comprehens...
Adulterated beef detection with redundant gas sensor using optimized convolut...
A 6G THz MIMO antenna with high gain and wide bandwidth for high-speed wirele...

Recently uploaded (20)

PPTX
Construction Project Organization Group 2.pptx
DOCX
573137875-Attendance-Management-System-original
PPTX
Sustainable Sites - Green Building Construction
PPTX
Lecture Notes Electrical Wiring System Components
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPT
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
Model Code of Practice - Construction Work - 21102022 .pdf
PDF
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PDF
PPT on Performance Review to get promotions
PDF
Digital Logic Computer Design lecture notes
PDF
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PDF
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
PDF
Embodied AI: Ushering in the Next Era of Intelligent Systems
PDF
composite construction of structures.pdf
PPTX
UNIT 4 Total Quality Management .pptx
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PPTX
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx
Construction Project Organization Group 2.pptx
573137875-Attendance-Management-System-original
Sustainable Sites - Green Building Construction
Lecture Notes Electrical Wiring System Components
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
CRASH COURSE IN ALTERNATIVE PLUMBING CLASS
R24 SURVEYING LAB MANUAL for civil enggi
Model Code of Practice - Construction Work - 21102022 .pdf
Enhancing Cyber Defense Against Zero-Day Attacks using Ensemble Neural Networks
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
PPT on Performance Review to get promotions
Digital Logic Computer Design lecture notes
July 2025 - Top 10 Read Articles in International Journal of Software Enginee...
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PRIZ Academy - 9 Windows Thinking Where to Invest Today to Win Tomorrow.pdf
Embodied AI: Ushering in the Next Era of Intelligent Systems
composite construction of structures.pdf
UNIT 4 Total Quality Management .pptx
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
MCN 401 KTU-2019-PPE KITS-MODULE 2.pptx

Precipitation prediction using recurrent neural networks and long short-term memory

  • 1. TELKOMNIKA Telecommunication, Computing, Electronics and Control Vol. 18, No. 5, October 2020, pp. 2525~2532 ISSN: 1693-6930, accredited First Grade by Kemenristekdikti, Decree No: 21/E/KPT/2018 DOI: 10.12928/TELKOMNIKA.v18i5.14887  2525 Journal homepage: http://journal.uad.ac.id/index.php/TELKOMNIKA Precipitation prediction using recurrent neural networks and long short-term memory Mishka Alditya Priatna1 , Esmeralda C. Djamal2 1 Department of Meteorology, Institut Teknologi Bandung, Indonesia 2 Department of Informatics, Universitas Jenderal Achmad Yani, Indonesia Article Info ABSTRACT Article history: Received Jun 9, 2019 Revised Apr 12, 2020 Accepted May 1, 2020 Prediction of meteorological variables such as precipitation, temperature, wind speed, and solar radiation is beneficial for human life. The variable observations data is available from time to time for more than thirty years, scattered each observation station makes the opportunity to map patterns into predictions. However, the complexity of weather variables is very high, one of which is influenced by Decadal phenomena such as El-Nino Southern Oscillation and IOD. Weather predictions can be reviewed for the duration, prediction variables, and observation stations. This research proposed precipitation prediction using recurrent neural networks and long short-term memory. Experiments were carried out using the prediction duration factor, the period as a feature and the amount of data set used, and the optimization model. The results showed that the time-lapse as a shorter feature gives good accuracy. Also, the duration of weekly predictions provides more accuracy than monthly, which is 85.71% compared to 83.33% of the validation data. Keywords: Deep learning Long short-term memory Meteorology data Precipitation prediction Recurrent neural networks This is an open access article under the CC BY-SA license. Corresponding Author: Esmeralda Contessa Djamal, Department of Informatics, Universitas Jenderal Achmad Yani, Terusan Jenderal Sudirman Cimahi St., Indonesia. Email: esmeralda.contessa@lecture.unjani.ac.id 1. INTRODUCTION Indonesia is between the Indian Ocean and the Pacific Ocean and two continents, especially Asia and Australia, so the climate often changes and is influenced by many factors. Other phenomena, such as El-Nino Southern Oscillation (ENSO), add to the complexity of the environment [1]. Precipitation is a part of the climate that has very complicated. Its characteristic in every area is indeed different from one another. It is affected by many factors like geographic, topographic, and monographic. Island structure and orientation have not been calculated. As a result, the precipitation distribution patterns are not prevalent in each area with others in a broad scope. Besides that, ENSO influences almost 70% of the change of precipitation in Indonesia in the Maritime Continent [2]. In meanwhile, other parts of the world marked by the displacement of "warm pool" and cloud formation ordinary occurred in Indonesia sea go easterly to the middle of the Pacific Ocean. This phenomenon can degrade the precipitation in several parts in the Pacific area so that it runs into drought [3]. With its dynamism, precipitation could not be discovered precisely. Precipitation is an essential phenomenon in the climate system, which is chaotic and greatly affects all aspects of life, such as the availability of water resources, urban planning, agriculture, and financial security. Over the past few years, research on precipitation prediction is overgrowing. Many models have been could improve the accuracy of precipitation prediction [4]. Meanwhile, the life cycle that forms specific patterns
  • 2.  ISSN: 1693-6930 TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 5, October 2020: 2525 - 2532 2526 depends on money and time [5]. Indeed, daily rainfall is random. But in the long run, have a particular model that can be identified. It's just also paying attention to many factors that are uncertain, such as climate zones, sunspots, tides, atmospheric circulation, and other human activity factors [6, 7]. Climate parameter data provided by the Meteorology, Climatology, and Geophysics Agency (BMKG) daily, except for precipitation produced every hour of each station. The use of time and prediction location area, closely related to the use of usage. Based on the territory of use, some studies use a global area [8] while other studies are predictive of specific regions [9]. Meanwhile, climate prediction has a time frame, such as daily [8] of solar forecasting, hours of wind speed [9], hours of temperature [10], multi-step time of wind speed [11], extreme climate every day [5]. Other research estimated rainfall in a short time or hours [12], days [13], weeks [14]. Some research prefers precipitation prediction in periods monthly with ENSO [15] and yearly [16] to help in proper agricultural planning. This study compared the estimated weekly and monthly precipitation models. Weather forecasting is a new research problem because its application is extensive in many sectors, such as agriculture to flight navigation. The challenge of climate prediction is to choose the right variables and data sets and to choose representative models to be able to explore hidden structural patterns in a large dataset [15]. Unfortunately, it is not a convenient case. Precipitation is a very complicated event because it happens randomly and depends on numerous factors like temperature, humidity, wind speed, and cloud pressure. Besides that, the dependent variables which are possible to affect precipitation are not constant even it cannot be sure how many factors may impact the rainfall. It makes the input parameters to the model may not adequate to predict precipitation precisely [17]. Climate forecasting made attention to many researchers of various backgrounds due to its effect on global human life. The support of computer technology and accessibility to obtain big data of weather observation recently made many researchers are encouraged to learn more about the pattern in the large dataset of weather prediction. It can predict weather forecasting using machine learning. The method makes possible learning the pattern of precipitation with other variables in time series before. Regression problems provide some challenging research in the field of machine learning, including weather data. Rainfall is a prime example because it shows unique characteristics with high volatility and chaotic patterns. Therefore the machine learning method can outperform other methods [4]. Prediction variables of climate which may not be clearly understood, traditional linear forecasting techniques are ill-equipped to handle, often producing unsatisfactory results. Previous research using statistical bias correction on the output of the daily climate model in Europe [18] improved to see the relation between ECMWF and the Meteorology, Climatology, and Geophysics Agency (BMKG) [19]. The research resulted in the value of the transfer function formed from the bias correction process can be used to improve the distribution of the 2016 rainfall prediction on the island of Bali, to obtain a better prediction. In meanwhile, some research increasingly resorts to techniques that are heuristic and non-linear. Such methods use neural network models [20] with machine-learning, regression, and clustering. Other research used dynamic regional combined short-term rainfall forecasting approach (DRCF) to improve multilayer perceptron and PCA. The study gave accuracy 75-92% but depended on the number of MLP [21]. Weather data prediction has its characteristics, which depend on the variability of these variables. Rainfall is very variable compared to solar radiation, wind speed, and temperature. Of course, this condition has an impact on prediction accuracy, as previous studies predicting weather and temperature provide better accuracy than real ones so that they can be used in real-time [16]. Other research developed to predict temperature and humidity [22]. Deep learning is new computing in data mining and machine learning [6]. A neural network with deep architectures has become a kind of powerful tool to retrieve the high-level abstract features of big data. Some methods that are often used in deep learning are convolutional neural network (CNN), which convolutes with a fixed size kernel. Weather data can be viewed as imagery, so it can be resolved with CNN [23]. However, for a limited image segment, it indeed will collide with memory limitations, so other methods are needed. One way that can be used is LSTM for rainfall prediction [24]. Using CNN can be modified in one dimension, for example, for sunlight prediction [25], and predict precipitation [26]. Meanwhile, for time series often use RNN. The uniqueness of the RNNs is the feedback connection, which conveys interference information at the previous input that will be accommodated to the following facts. The RNN can study sequential or time-varying patterns so that it is tools in modeling intricate weather data patterns with accurate multi-step estimates [27]. This research predicts rainfall in the Bandung area, which is the center of the basin, which has a height of 791m above sea level (ASL). The highest point is in the North with an altitude of 1050 m above sea level, and the lowest point is in the south with an elevation of 675 m above sea level. The area surrounded by mountains forms the city of Bandung into a kind of basin (Bandung Basin). The surrounding mountain climate significantly affects the Bandung city climate. However, in recent years, the temperature has been increasing, and the rainy season becomes more prolonged than usual. In past years, the rainy season is more intensive happening in Bandung. Naturally, Bandung is quite a cool area. During
  • 3. TELKOMNIKA Telecommun Comput El Control  Precipitation prediction using recurrent neural networks and… (Mishka Alditya Priatna) 2527 the year 2012, recorded that the highest temperature in Bandung reached 30.9◦C, which occurred in September, and the lowest temperature in Bandung in 2012 was 17.4◦C that happened in July. This paper proposed a precipitation prediction model of the Bandung region using precipitation, humidity, temperature, solar radiation of 36-years before. The model using RNNs with long short-term memory (LSTM) to predict precipitation. The uniqueness of the RNNs is the feedback connection, which conveys interference information at the previous input that will be accommodated to the next data. Machine learning used this model Network based on consecutive time with prior climate data, so produced rainfall prediction. The input variables of machine learning are minimum temperature, maximum temperature, average temperature, relative humidity, duration of sun radiation, average wind speed, maximum wind speed, and precipitation to predict rainfall in a certain period. 2. RESEARCH METHOD 2.1. Data set Weather data provided by the Indonesian Agency for Meteorological, Climatological, and (BMKG) from 1981 to 2017. This research using the Bandung city region in the analysis. Datasets provided consists of eight variables (minimum temperature, maximum temperature, average temperature, relative humidity, duration of sun radiation, average wind speed, maximum wind speed, and precipitation) of 36 years (1981-2017). This research used two periods of precipitation prediction, mainly weekly and monthly configurations. The values of climate variables in that period are the daily averages, minimum or maximum. There are four scenarios in the experiment, particularly, variations in the amount of training data (10 years and five years), and the duration of the predictions (monthly and weekly) with details: − 10-years: annual (432 data sets, overlap 11 months) − 10-years: monthly (1872 data sets, overlap 50 weeks) − 5-years: annual (864 data sets, overlap 11 months) − 5-years: monthly (3744 data sets, overlap 50 weeks) All of the models, 75% is used for training data, and 25% is used for non-training or test data. 2.2. Precipitation prediction model The design of precipitation prediction using RNN and LSTM is shown in Figure 1. Prediction chooses one of ten classes with a specific interval. Past research used multivariates for rainfall prediction [28] with RNN and LSTM. Sometimes, climate data has missing or lost observation. Therefore, the data need to prepare automatically before the next process. Some solution is an interpolation of some available data, multivariate [29], or predict the missing using refinement function [10]. Meanwhile, BMKG provides daily data. Therefore, weekly and monthly predictions need to convert daily or weekly data. Some studies used the average value in this time frame so that it reflects its projections [30]. This research used various variables, i.e., minimum temperature, maximum temperature, average temperature, relative humidity, duration of sun radiation, average wind speed, maximum wind speed, and precipitation, which have different units. Therefore, all datasets before entering the RNN are normalized earlier [18, 25] using (1). The normalization takes the maximum and minimum values on the 0-1 scale. 𝑥𝑖 = 𝑦−𝑣𝑎𝑙𝑀𝑖𝑛( ) 𝑣𝑎𝑙𝑀𝑎𝑥( )−𝑣𝑎𝑙𝑀𝑖𝑛( ) (1) Based on the forecast period, it is divided into two models, i.e., weekly and monthly. The results of the study are then compared and can be utilized for their individual needs. When used for flood disaster management, the selection of the week is more appropriate, taking into account the carrying capacity of soil absorption. While the range of the planting season can use a monthly period, RNN can adapt to flexible classes [29]. It can true period or pseudo period. In the meantime, we used rainfall predictions with a certain range of 10 pre-determined classes, i.e. < 60 mm, 60-120, 120-180, 180-240, 240-300, 300-360, 360-420, 420-480, 480-540, and > 540 mm as shown in Figure 1. Then go to the second step with RNN. The dropout layers used to minimize the number of input neurons 0.5 probability that input neuron to the next step is 480. The third step is LSTM layer 2, with the input dropout layer using (2)-(7). The fourth step is the dense layer using the sigmoid function, where the final result from the previous is entered into (1) to produce a new weight. 2.3. Recurrent neural networks Deep learning techniques have been successfully applied to solve many problems in climate and geoscience using massive-scaled observed and modeled data [31]. One method of deep learning is RNN. Previous research proposed training three models of deep-learning: RNN, conditional restricted boltzmann machine (CRBM), and CNN using ENSO and Weather dataset [15]. The best accuracy was RNN until 84% of
  • 4.  ISSN: 1693-6930 TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 5, October 2020: 2525 - 2532 2528 precipitation forecasting using a deep belief network called DBNPF [6]. The research established the characteristics of environmental factors and future precipitation. Other studies proposed to predict the extreme weather using Artificial Neural Network and Support Vector Machines [32]. RNN works resemble the workings of the brain, such as sending and receiving information. Similarly, the work of the RNN brain can also send and receive data from a neuron to another neuron. In general, the human brain is used to make decisions, and in the process of making a rational decision, often takes into account the past. RNN is essentially an artificial neural network that uses recurrence by utilizing past data. RNN can predict a situation in the future [15], and can also classify [33]. RNN has an architecture that can be used for data in the form of sequence or list, as shown in Figure 2. RNN is a modification of the Feedforward Neural Network with the characteristic of using feedback from output to input. Weather Training Data Weight Training using Recurrent Neural Networks LSTM Layer 1 (Relu) Dropout Layer (0.2) LSTM Layer 2 (Sigmoid) Dense Layer (sigmoid) Weather Data Weekly or monthly Prediction Rainfall using Recurrent Neural Networks LSTM Layer 1 (Relu) Dropout Layer (0.2) LSTM Layer 2 (Sigmoid) Dense Layer (sigmoid) <60 60-120 120-180 180-240 240-300 300-360 360-420 420-480 480-540 >540 Pre-Processing Missing data handling Convert to periode Normalize Pre-Processing Missing data handling Convert to periode Normalize Interval precipitation prediction (ten classes) Figure 1. Precipitation prediction model yt h1 U V W = xt y0 h0 U V W x0 y1 h1 U V W x1 y2 h2 U V W x2 ... yt-1 ht U V W xt-1 yt Figure 2. RNNs architecture In the Architecture of RNNs, there are several connections from one neuron to one of the next neurons. RNN is processed in a sequence of time. So that each information has a relationship with one another. This way makes the RNN has a memory that functions to remember the results of the previous process that will be
  • 5. TELKOMNIKA Telecommun Comput El Control  Precipitation prediction using recurrent neural networks and… (Mishka Alditya Priatna) 2529 used in the next process. However, when the RNN processes quite a lot of data, it has difficulty in maintaining information from the previous steps. The first hidden layer has the weight obtained from the input layer, and at each layer will receive the weight of the prior layer. Then, the calculation of the next hidden layer uses the appropriate entity in the previous input and hidden layer. Meanwhile, the forecast for the output layer uses the last hidden layer. For the process of calculating the hidden layer using functions that can be seen in (2) and the calculation of output using softmax function which can be seen in (3). ℎ(𝑡) = tanh(𝑈𝑥(𝑡) + 𝑊ℎ(𝑡 − 1) + 𝑏) (2) 𝑦(𝑡) = 𝑠𝑜𝑓𝑡 𝑚𝑎𝑥(𝑉ℎ(𝑡) + 𝑐) (3) In this study, the data collection that was processed consists of various kinds of data sequences such as meteorology variables, so it is necessary to group data in the input layer to provide the training process. However, RNN has the disadvantage of short-term memory. Therefore, in making predictions using quite a lot of data, RNN has several variations to solve the problem. The gate is the gated recurrent unit (GRU), backpropagation through time (BPTT), and LSTM. This research used long short-term memory (LSTM) to overcome the vanishing gradient [34]. The RNNs training process is similar to neural network training using a backpropagation algorithm but with few cycles. The parameter that shared equally in every time step, so gradient for each output, does not only depend on a calculation from the current time step but also the previous one [35]. This research used LSTM gates to overcome the dependence of long-term process, which is often phenomena that occurred in sequential data processing, as shown in Figure 3. LSTM LSTM x + x x tanh tanhσσ σ Ct -1 St -1 t -1 t +1 Ct St x t St t Figure 3. LSTM architecture The key from LSTM is cell state with architecture marked by a horizontal line that flows from Ct-1 until Ct. LSTM can delete or add information to the cell state set by the structure, which is called the gate. Gate is a method to pass the information, which consists of biner sigmoid function () and multiplication operation with x. Biner Sigmoid function is as shown in (4). The first step in LSTM is deciding what information will be disposed of from the cell state called forget gate used (5) [34]. 𝑓(𝑥) = 1 1+𝑒−𝑥 (4) 𝑓𝑡 = 𝜎(𝑊𝑓[ℎ 𝑡−1, 𝑥𝑡] + 𝑏𝑓) (5) where ht-1 and xt values, are in 0-1 interval every cell. Use of 1, which represents this information, is kept, and 0, which expresses this information is deleted. The second is deciding recent data stored in the cell. This step is divided into two parts. First, Input gate will determine values that will be updated using (6) and calculation for recent cell candidate (𝐶̂𝑡) which; 𝑖𝑡 = 𝜎(𝑊𝑓[ℎ 𝑡−1, 𝑥𝑡] + 𝑏𝑖) (6) 𝐶̂𝑡 = 𝑡𝑎𝑛ℎ(𝑊𝑐[ℎ 𝑡−1, 𝑥𝑡] + 𝑏𝑐) (7) There will be added to the old cell state (Ct) with a cell (𝐶̂𝑡) candidate using (8). The former cell state multiplicated by forgetting state. And cell candidate multiplicated with the input gate. It updated the cell state. tiCfC Ctttt ~ 1 ** += − (8)
  • 6.  ISSN: 1693-6930 TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 5, October 2020: 2525 - 2532 2530 The last step is the output gate used to determine which output will be produced based on cell state from the results of (6) calculated with the biner sigmoid function as shown in (9) Furthermore, multiplicated with activating function from updated cell state using (10). Some previous research used sigmoid and tanh [35]. 𝑜𝑡 = 𝜎(𝑊𝑜[ℎ 𝑡−1, 𝑥𝑡] + 𝑏 𝑜) (9) ℎ 𝑡 = 𝑜𝑡 ∗ 𝑡𝑎𝑛ℎ(𝐶𝑡) (10) 3. RESULTS AND ANALYSIS Experiments from precipitation prediction models are carried out with variations in the interval of training data (10 years and five years), and the prediction time (monthly and weekly), with the RNN configuration in Table 1. In getting the optimal prediction, variations are performed on the dataset time and prediction duration. The experiment also tested the accuracy of the optimization model, the adaptive moment estimation (Adam) model, and the stochastic gradient descent (SGD) model. Accuracy is calculated from the accuracy of the output class against the actual class label. This research used two predictive models. First, it used weather data for ten years and five years. Both models are tested weekly and monthly. The accuracy is obtained as in Table 2, Figure 4 of 10-years dataset, and Figure 5 of the 5-years dataset. This simulation developed using two optimizer model (Adam and SGD) to correct weight, in 200 epoch. Table 1. RNN configuration Configuration 10-years 5- years weekly monthly weekly monthly Dataset 1872 432 3744 864 Input 4160 960 2080 480 Hidden 4160 64 2080 64 Dropout 0.2 0.2 0.2 0.2 Dense 13 13 13 13 Output layer 10 10 10 10 Table 2. Accuracy of Precipitation toward dataset and duration Model Accuracy (%) 10-years – 432 data set 5-years – 864 data set Weekly Monthly Weekly Monthly Training Data Test Data Training Data Test Data Training Data Test Data Training Data Test Data Adam 99.21 80.95 96.60 73.25 99.60 85.71 99.21 83.33 SGD 97.22 65.07 91.66 61.42 96.42 79.36 92.85 75.21 (a) (b) Figure 4. The accuracy of 10 years – 432 data set (a) weekly (b) monthly (a) (b) Figure 5. The accuracy of 5 years–864 data set (a) weekly (b) monthly This study is looking for representative data sets representing patterns of rainfall occurrence. From Table 2, it is shown that an interval of 5-years for weather prediction is enough to provide better accuracy. The use
  • 7. TELKOMNIKA Telecommun Comput El Control  Precipitation prediction using recurrent neural networks and… (Mishka Alditya Priatna) 2531 of shorter time ranges gives the consequence of the many variations of training data, of course, providing better accuracy. Based on Figure 5 and Table 2, visible 5-year data sets provide an accuracy of 83.33% for monthly and 85.71% for weekly. While in Figure 5 gave an accuracy of 73.25% for monthly and 80.95% for weekly. This result strengthens the hypothesis that the amount of training data is more dominant than the number of variables that are processed. Refer to the prediction periods in Table 2. It appears that weekly period predictions have better accuracy (85.71%), given the variation of meteorological data every day of the week is not too large. Meanwhile, there is more variety of daily meteorology in a month so that when combined in the month makes a higher deviation. However, the model used to predict rain for a month is more robust when available weather data is incomplete. Precipitation prediction depends on the training data sequence, configuration, method, and period of forecast. The results of the study, Adam model, provided better accuracy than SGD, considering Adam model used aggregate data in training. In contrast, SGD used only one or several parts randomly selected, so the possibility of occurrence is minimum local, as a result of not representing all data in each class. The weekly forecast offers higher efficiency than monthly finding that in that period, meteorological data is more homogeneous than monthly data. Training data used for ten years provides better correctness, considering it provides more variation than the accuracy of training data from the last five years. The best accuracy was 85.71% with weekly using RNN and LSTM of training data for ten years. This research can compare with MLP with 75-92% [21] and RNN with heuristic optimized of 59-84.6% [15]. 4. CONCLUSION In this work, we studied how to use multiple variables of weather can rainfall prediction monthly. We proposed the advantage of recurrent neural networks to automatically gave accuracy 85.71 of test data. The research showed that using five years of weather data can predict precipitation weekly and monthly. Weather data in the last five years can predict rainfall for a month of the following year. However, weekly predictions have higher accuracy. The experimental results also show that a large number of data sets can improve accuracy. In the future, compared the proposed methods approach with a public weather forecast center results and demonstrated the effectiveness of the model. So that output prediction in value, improving the output class of this study. Current uncertain rainfall predictions do not only pay attention to past data patterns but also need to consider extreme phenomena such as El Nino as additional features. REFERENCES [1] R. Hidayat, M. Juniarti, and U. Marufah, “Impact of La Niña and La Niña Modoki on Indonesia Rainfall Variability,” Earth and Environmental Science, vol. 149, no. 1, pp. 217-222, 2017. [2] Supari, F. Tangang, E. Salimun, E. Aldrian, A. Sopaheluwakan, and L. Juneng, “ENSO Modulation of Seasonal Rainfall and Extremes in Indonesia,” Climate Dynamics, vol. 51, no. 7, pp. 1-22, 2017. [3] E. Mulyana, “Relationship between ENSO With Variations Cheap Rain in Indonesia,” Jurnal Sains & Teknologi Modifikasi Cuaca, vol. 3, pp. 1-4, 2002. [4] S. Cramer, M. Kampouridis, et al., “An Extensive Evaluation of Seven Machine Learning Methods for Rainfall Prediction in Weather Derivatives,” Expert Systems with Applications, vol. 85, pp. 169-181, 2017. [5] Y. Liu, E. Racah, J. Correa, et al., “Application of Deep Convolutional Neural Networks for Detecting Extreme Weather in Climate Datasets,” International Conference on Advances in Big Data Analytics, pp. 81-88, 2016. [6] P. Zhang, L. Zhang, et al., “A Deep-Learning Based Precipitation Forecasting Approach Using Multiple Environmental Factors,” 2017 IEEE 6th International Congress on Big Data, BigData Congress, pp. 193-200, 2017. [7] N. Sinha, B. Purkayastha, and L. Marbaniang, “Weather Prediction by Recurrent Neural Network Dynamics,” International Journal Intelligent Engineering Informatics, vol. 2, no. 2/3, pp. 17-80, 2014. [8] S. G. Gouda, Z. Hussein, S. Luo, and Q. Yuan, “Model selection for accurate daily global solar radiation prediction in China,” Journal of Cleaner Production, vol. 221, pp. 132-144, 2019. [9] I. Tanaka and H. Ohmori, “Method Selection in Different Regions for Short-Term Wind Speed Prediction in Japan,” SICE Annual Conference, vol. 2, pp. 189-194, 2015. [10] H. K. Kim, “Temperature Prediction Using the Missing Data Refinement Model Based on a Long Short-Term Memory Neural Network,” Atmosphere, vol. 10, no. 11, pp. 1-16, 2019. [11] F. Li, G. Ren, and J. Lee, “Multi-step wind speed prediction based on turbulence intensity and hybrid deep neural networks,” Energy Conversion and Management, vol. 186, pp. 306-322, 2019. [12] S. Moon, Y. Kim, Y. Hee, and B. Moon, “Application of Machine Learning to an Early Warning System for Very Short-Term Heavy Rainfall,” Journal of Hydrology, vol. 568, pp. 1042–1054, 2019. [13] S. Yuan, X. Luo, B. Mu, J. Li, and G. Dai, “Prediction of North Atlantic Oscillation Index with Convolutional LSTM Based on Ensemble Empirical Mode Decomposition,” Atmosphere, vol. 10, no. 252, pp. 2-13, 2019. [14] E. P. Prasetya and E. C. Djamal, “Rainfall Forecasting for the Natural Disasters Preparation Using Recurrent Neural Networks,” 2019 International Conference on Electrical Engineering and Informatics (ICEEI), pp. 52-57, 2019. [15] A. G. Salman, B. Kanigoro, and Y. Heryadi, “Weather Forecasting Using Deep Learning Techniques,” in 2015 International Conference on Advanced Computer Science and Information Systems (ICACSIS), 2015, pp. 281–285. [16] M. Kannan, S. Prabhakaran, and P. Ramachandran, “Rainfall Forecasting Using Data Mining Technique,” International Journal of Engineering and Technology, vol. 2, no. 6, pp. 397-400, 2010.
  • 8.  ISSN: 1693-6930 TELKOMNIKA Telecommun Comput El Control, Vol. 18, No. 5, October 2020: 2525 - 2532 2532 [17] S. K. Mohapatra, A. Upadhyay, and C. Gola, “RainfallPrediction based on 100 years ofMeteorological Data,” International Conference on Computing and Communication Technologies for Smart Nation (IC3TSN), pp. 162-166, 2017. [18] C. Piani, J. O. Haerter, and E. Coppola, “Statistical bias correction for daily precipitation in regional climate models over Europe,” Theoretical and Applied Climatology, vol. 99, no. 1–2, pp. 187-192, 2010. [19] D. Lealdi, S. Nurdiati, and A Sopaheluwakan, “Statistical Bias Correction Modelling for Seasonal Rainfall Forecast for the Case of Bali Island,” Journal of Physics, vol. 1008, no. 1, pp. 1-10, 2018. [20] B. K. Rani and A. Govardhan, “Rainfall Prediction Using Data Mining Techniques a Survey,” The Second International Conference on Information Technology Convergence and Services, pp. 23-30, 2013. [21] P. Zhang, Y. Jia, J. Gao, W. Song, and H. Leung, “Short-term Rainfall Forecasting Using Multilayer Perceptron,” IEEE Transactions on Big Data, vol. 6, no. 1, pp. 93-106, 2018. [22] M. A. Zaytar and C. El Amrani, “Sequence to Sequence Weather Forecasting with Long Short-Term Memory Sequence to Sequence Weather Forecasting with Long Short-Term Memory Recurrent Neural Networks,” International Journal of Computer Applications, vol. 143, no. 11, pp. 7-11, 2016. [23] B. Zhao, X. Li, X. Lu, and Z. Wang, “A CNN-RNN Architecture for Multi-Label Weather Recognition,” Neurocomputing, vol. 322, pp. 45-57, 2018. [24] X. Shi, Z. Chen, and H. Wang, “Convolutional LSTM Network : A Machine Learning Approach for Precipitation Nowcasting,” Advances in Neural Information Processing Systems, vol. 28, pp. 802-810, 2015. [25] A. Mulyadi and E. C. Djamal, “Sunshine Duration Prediction Using 1D Convolutional Neural Networks,” 2019 6th International Conference on Instrumentation, Control, and Automation (ICA), 2019. [26] M. Qiu, P. Zhao, K. Zhang, J. Huang, X. Shi, and X. Wang, “A short-Term Rainfall Prediction Model Using Multi-task Convolutional Neural Networks,” International Conference on Data Mining, pp. 395-404, 2017. [27] F. J. Chang, P. A. Chen, Y. R. Lu, E. Huang, and K. Y. Chang, “Real-time Multi-Step-Ahead Water Level Forecasting by Recurrent Neural Networks for Urban Flood Control,” Journal of Hydrology, vol. 517, pp. 836-846, 2014. [28] Y. Liu, C. Gong, L. Yang, and Y. Chen, “DSTP-RNN: a dual-stage two-phase attention-based recurrent neural network for long-term and multivariate time series prediction,” Expert Systems With Applications, vol. 143, 2020. [29] Yagmur Gizem Cinar, H. Mirisaeea, P. Goswami, E. Gaussiera, and A.-A. Bachir, “Period-aware Content Attention RNNs for Time Series Forecasting with Missing Values,” Neurocomputing, vol. 312, pp. 177-186, 2018. [30] F. R. Ningsih and E. C. Djamal, “Wind Speed Forecasting Using Recurrent Neural Networks and Long Short Term Memory,”6th International Conference on Instrumentation, Control, and Automation (ICA), pp. 137-141, 2019. [31] S. Kim, S. Ames, J. Lee, C. Zhang, and A. C. Wilson, “Resolution Reconstruction of Climate Data with Pixel Recursive Model,” International Conference on Data Mining Workshops, pp. 313-321, 2017. [32] P. A. Chen, L. C. Chang, and F. J. Chang, “Reinforced recurrent neural networks for multi-step-ahead flood forecasts,” Journal of Hydrology, vol. 497, pp. 71-79, 2013. [33] Y. Luo, “Recurrent Neural Networks for Classifying Relations in Clinical Notes,” Journal of Biomedical Informatics, vol. 72, pp. 85-95, 2017. [34] S. Alhagry, A. A. Fahmy, and R. A. El-Khoribi, “Emotion Recognition based on EEG using LSTM Recurrent Neural Network,” International Journal of Advanced Computer Science and Applications, vol. 8, no. 10, pp. 356-358, 2017. [35] S. Poornima and M. Pushpalatha, “Prediction of Rainfall Using Intensified LSTM Based Recurrent Neural Network with Weighted Linear Units,” Atmosphere, vol. 10, no. 11, pp. 1-18, 2019. BIOGRAPHIES OF AUTHORS Mishka Alditya Priatna received his bachelor's degree in the Department of Meteorology from Institut Teknologi Bandung in 2019. E-mail: mishka.alditya@gmail.com Esmeralda Contessa Djamal received a Bachelor's degree in Engineering Physics from Institut Teknologi Bandung in 1994, a Master's degree in Instrument and Control from Institut Teknologi Bandung in 1998. Since Ph.D. dissertation until now, research on EEG classification and finished doctoral program from Institut Teknologi Bandung in 2005. She is a lecturer of Informatics Department, Universitas Jenderal Achmad Yani. Email: esmeralda.contessa@lecture.unjani.ac.id