SlideShare a Scribd company logo
TELKOMNIKA, Vol.17, No.3, June 2019, pp.1193~1201
ISSN: 1693-6930, accredited First Grade by Kemenristekdikti, Decree No: 21/E/KPT/2018
DOI: 10.12928/TELKOMNIKA.v17i3.12233 ◼ 1193
Received July 1, 2018; Revised October 24, 2018; Accepted November 30, 2018
Enhance interval width of crime forecasting with
ARIMA model-fuzzy alpha cut
Yaya Sudarya Triana*1
, Astari Retnowardhani2
1
Faculty of Computer Science, Universitas Mercu Buana, Indonesia
2
Information Systems Management Department, BINUS Graduate Program-Master of Information Systems
Management, Bina Nusantara University, Indonesia
*Corresponding author, e-mail: yaya.sudarya@mercubuana.ac.id1
, aretnowardhani@binus.edu2
Abstract
With qualified data or information a better decision can be made. The interval width of forecasting
is one of data values to assist in the selection decision making process in regards to crime prevention.
However, in time series forecasting, especially the use of ARIMA model, the amount of historical data
available can affect forecasting result including interval width forecasting value. This study proposes a
combination technique, in order to get get a better interval width crime forecasting value. The propose
combination technique between ARIMA model and Fuzzy Alpha Cut are presented. The use of variation
alpha values are used, they are 0.3, 0.5, and 0.7. The experimental results have shown the use of
ARIMA-FAC with alpha=0.5 is appropriate. The overall results obtained have shown the interval width
crime forecasting with ARIMA-FAC is better than interval width crime forecasting with 95% CI
ARIMA model.
Keywords: ARIMA, decision, FAC, forecasting, interval width
Copyright © 2019 Universitas Ahmad Dahlan. All rights reserved.
1. Introduction
In the digital era, information system plays very important role in each organization [1].
Sometimes decision makers need forecasting data or information as a basis for making a
decision. The use of Decision Support System (DSS) allows decision makers to make decisions
appropriately taking into account the possible conditions that may occur. In order to predict
future conditions in the decision making process, statistical technique like forecasting methods
is used. To adjust the parameters in order to get better forecasting system than the previous
researches. The involvement of forecasting in decision planning is aimed to help decision
makers to come up with a good decision. Using forecast data, decision-maker can take into
account subsequent events in a given situation, and thus influence the final decision [2-4].
However, forecasting results may contain some uncertainties [4-6]. Uncertainty means
more than one outcome is consistent with our expectations [7]. In order to overcome this
condition, we can estimate the range of forecasting values, which is called interval forecast or
interval values [4]. These ranges could predict the best and worst situation that may occur. One
of the methods that have been discussed is ARIMA model. ARIMA model is one of the most
popular models in time series forecasting analysis [8-11]. This model has the advantage of
giving an accurate forecast in a short time period. ARIMA has been widely used in many
different areas such as in social, economic, engineering, crime prevention, and foreign
exchange field [12-17]. A good analysis of interval for ARIMA model is required to determine
precisely the worst and the best possible forecasting conditions.
The accuracy of time series forecasting is important for many decision processes.
However, ARIMA models have data limitation. The limitation of the ARIMA model is in the data
requirements. ARIMA models need a large historical data at least 50 records and preferably 100
record or above. In some situations, however, we have to forecast future situations using few
data points over a short time period. As the forecasting results, ARIMA models provide lower
bound and upper bound values as well as forecasting values. This value is obtained from the
use of confidence interval calculation. The ARIMA limitation affects the forecasting and
forecasting intervals results [12, 18, 19].
◼ ISSN: 1693-6930
TELKOMNIKA Vol. 17, No. 3, June 2019: 1193-1201
1194
Fuzzy Alpha Cut (FAC) is also a technique to find the range of values. Some researcher
were implemented FAC, such as [20, 21]. They have applied FAC to calculate the fuzzy
expected values of the possibility-probability distribution. The use of FAC for evaluation the
earned value by [22]. In his study of the fuzzificaton of the variable CP (cost of electricity
production) allows an analysis via FAC [23]. However, there is a lack of exploration of the use of
FAC on the interval of forecasting calculation, especially in ARIMA model. Then, to improve the
interval forecasting result of ARIMA, this study proposed combination between ARIMA model
and FAC. By combining FAC into ARIMA models, expectation to find better forecasting range
values can be reached. The experiment of this proposed combination was used index
motorcycle crime data.
2. Research Method
2.1. ARIMA Model
The ARIMA model aims to describe the current behavior of variables in terms of their
linear relationships with historical data. It can be decomposed into two parts. First, it has an
integrated (I) component (d), which represents the amount of differencing to be performed on
the series to make it stationary. The second component is an ARMA model for the series
rendered stationary through differentiation. The ARMA component is further decomposed into
AR and MA components. The autoregressive (AR) component (p) captures the correlation
between the current value of the time series and some of its past values. The moving average
(MA) component (q) represents the duration of the influence of a random shock. The
Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF) are then used to
estimate the values of p and q [15]. The ARMA (p, q) has the general form:
𝑦𝑡 = 𝜃0 + ∅1 𝑦𝑡−1 + ∅2 𝑦𝑡−2 + ⋯ + ∅ 𝑝 𝑦𝑡−𝑝 + 𝜀𝑡 − 𝜃1 𝜀𝑡−1 − 𝜃 𝑝 𝜀𝑡−𝑝 (1)
where yt and εt are the actual value and random error at time period t, respectively; while
øi(i=1, 2,…, p) and θj(j=0, 1, 2,…, q) are model parameters. p and q are integers and often
referred to as orders of the model. The random errors, εt, are assumed to be independently and
identically distributed with a mean of zero and a constant variance of σ2.
When one of the terms is zero, it is common to drop AR, I or MA. For example, an I(1) is
an ARIMA(0,1,0), and MA(1) model is an ARIMA(0,0,1). Given a time series of data Xt where t is
an integer index and the Xt are real numbers, then an ARMA(p, q) model is given by: [24, 25]
(2)
where L is the lag operator, the αi are the parameters of the autoregressive part of the model,
the θi are the parameters of the moving average part and the 𝜀𝑡 are error terms. The error terms
are generally assumed to be independent, identically distributed variables sampled from a
normal distribution with zero mean. Assume now that the polynomial in (3),
(3)
has a unitary root of multiplicity d. Then it can be rewritten as:
= (4)
an ARIMA (p, d, q) process expresses this polynomial factorization property, and is given by:
(5)
thus can be thought as a particular case of an ARMA(p+d,q) process having the
auto-regressive polynomial with some roots in the unity. For this reason every ARIMA model
t
q
1i
i
it
p
1i
i
i L1xL1 




 +=




 −  ==
t





 − =
p
1i
i
i L1





 − =
p
1i
i
i L1 ( )d
dp
1i
i
i L1L1 −




 + 
−
=
( ) t
q
1i
i
it
d
p
1i
i
i L1XL1L1 




 +=−




 +  ==
TELKOMNIKA ISSN: 1693-6930 ◼
Enhance interval width of crime forecasting with ARIMA model-fuzzy... (Yaya Sudarya Triana)
1195
with d>0 is not wide sense stationary. ARIMA models are used for the observable non-
stationary processes Xt that have some clearly identified trends:
a. Constant trend (i.e. a non-zero average) leads to d=1.
b. Linear trend (i.e. a linear growth behavior) leads to d=2.
c. Quadratic trend (i.e. a quadratic growth behavior) leads to d=3.
In these cases the ARIMA model can be viewed as a "cascade" of two models. The first
is non-stationary:
𝑌𝑡 = (1 − 𝐿) 𝑑
𝑋𝑡 (6)
while the second is wide-sense stationary:
(1 − ∑ 𝜑𝑖 𝐿𝑖
𝑝
𝑖=1
) 𝑌𝑡 = (1 + ∑ 𝜃𝑖 𝐿𝑖
𝑞
𝑖=1
) 𝜀𝑡 (7)
The AR process order is determined from the PACF graph and the similarity MA
process order is determined from the ACF graph. The patterns of the sample ACF and PACF
used to determine the model processes summarized in Table 1.
Table 1. Summarize of ACF and Pacf Patterns
Model ACF PACF
AR(p) Dies down Cut off after lag q
MA(q) Cut off after lag p Dies down
ARMA(p,q) Dies down Dies down
In [8], the ARIMA procedure fits a model with a certain number of parameters and tests
for the significance of the parameters. It means if the parameters are zero (null hypothesis, H0)
or different from zero (alternative hypothesis, H0). In order to test the significance of the
parameters considered in the model, t-statistics and P-value are used. The t-statistic is used to
determine the P-value. P-value is determined as α=0.05 a level corresponding 95% of
confidence interval. If the P-value is less than α value, H0 is rejected. For the ARIMA model
which has a P-value <α then the model is acceptable.
The next check step is residuals. Ljung Box test is used to test whether the first k ACF
of the residuals are significantly different from what would be expected from a white noise
process. Using the usual significance level of α=0.05, a model passes this test if P-value>α. The
large P-values indicate that the residuals are not distinguishable from a white noise
series [19-21], [26-27].
2.2. Confidence Interval (CI)
The 95% CI technique, usually included in forecasting process with an ARIMA
model [3, 12, 22, 25] stated that a CI is more informative since it indicates, with a known degree
of confidence, the range of possible effects. In Tong, et al. [27], a (1-α) 100% CI for an unknown
parameter (e.g. population mean) is an interval calculated from the sample data. such that (1-α)
of the intervals will enclose the true parameter value. For example, if we take 95% CI is an
interval with 0.95 probabilities to enclose the true parameter. This means the t_α value is 5% or
0.05. Suppose that {x_1,x_2,…,x_n} is a random sample drawn from a normal population with
unknown mean µ and unknown variance σ^2, then a (1-α) 100 % CI for the true mean can be
constructed as follows:
𝑥̄ ± 𝑡 𝛼/2
𝑠
√ 𝑛
(8)
where is:
x = the sample mean,
s = the sample standard deviation,
n = the sample sized, and
◼ ISSN: 1693-6930
TELKOMNIKA Vol. 17, No. 3, June 2019: 1193-1201
1196
t(α/2) = the value corresponding to an upper-ail area of α/2 in the t distribution with n-1 degrees of
freedom (df).
If the sample size n is larger (n≥30), referring to the Central Limit Theorem [24], then x ̅
is will be approximately normally distributed regardless of the distribution of the sample
population. Therefore, the CI construct is as follows:
𝑥̄ ± 𝑧 𝑐𝑜𝑛𝑓 𝜎𝑥̄ (9)
where is :
x = The sample mean
zconf = A number from the standard normal table that satisfies the confidence
specifications for the confidence interval
x = The standard error of the mean In the other construct as follow:
𝑥𝑙𝑜 = 𝑥̄ − 𝑧 𝑐𝑜𝑛𝑓 𝜎 𝑥̄ (10)
and
𝑥𝑙𝑜 = 𝑥̄ + 𝑧 𝑐𝑜𝑛𝑓 𝜎 𝑥̄ (11)
Xlo = Lower limit value
Xup = Upper limit value
2.3. Fuzzy Alpha Cut
The proposed technique combines ARIMA model and FAC technique, as shown in
Figure 1. This combination is aimed to obtain more accurate interval forecasting result.
This process divided in 2 parts. The first part is Box Jenkin’s methodology to find ARIMA model.
The forecasting results and confidence interval from using ARIMA models will be converted into
TFN. The arithmetic operation on the fuzzy alpha-cut, for instance the alpha-cut method is
applied to the data through the fuzzy environmental process.
Figure 1. Combination Technique of ARIMA model and FAC
the membership function of TFN is defined as:
TELKOMNIKA ISSN: 1693-6930 ◼
Enhance interval width of crime forecasting with ARIMA model-fuzzy... (Yaya Sudarya Triana)
1197
𝜇 𝐴̃(𝑥) =
{
1, 𝑥 = 𝑥̃𝑡(𝑛)
𝑥−𝑋̃ 𝑡 𝑙𝑜
𝑥̃ 𝑡(𝑛)−𝑋̃ 𝑡 𝑙𝑜
, 𝑋̃𝑡 𝑙𝑜
≤ 𝑥 ≤ 𝑥̃ 𝑡(𝑛)
𝑋̃ 𝑡 𝑢𝑝−𝑥
𝑋̃ 𝑡 𝑢𝑝−𝑥̃ 𝑡(𝑛)
, 𝑥̃𝑡(𝑛) ≤ 𝑥 ≤ 𝑋̃𝑡 𝑢𝑝
0, 𝑒𝑙𝑠𝑒
then, to obtain the lower and upper limit where ]1,0[ use the membership function of TFN in
above, the propose technique ARIMA-FAC equation as follow in Table 2:
Table 2. Proposed Model ARIMA-FAC
FAC ARIMA-FAC
→
a)ab(Lo +−= →
→
the proposed integration of the technique proceeds as follow:
- Part I: Fit the ARIMA model using the available historical data. Then, the forecasting values
are obtained. The lower(Lo) and upper(Up) values are obtained use 95% CI. The output from
this part then used as input data sets in the next phase. In other words, the results from Part
I become the input data sets for Part II.
- Part II: The forecasting values as crisp set from Phase I are fuzzified into TFN. Then, these
TFN values use in arithmetic operation of FAC (9), to find the lower(Lo) and upper(Up)
FAC values.
3. Results and Analysis
The first experiment has been used 82 months of index crime data are used, as
appropriate under the requirements of ARIMA models. In Part I possible three ARIMA model
was obtained from this experiment, ARIMA (1,0,0), ARIMA (1,1,0), and ARIMA (0,1,1). The result
found ARIMA (1,0,0) to be the best model, because the model has a P-value less than 0.05. The
model also has the smallest MSE rather than the other model, it is 29.71. Then, the forecasting
process was carried out using this model. The forecasting result is shown in Table 3.
Table 3. Forecasting Results with ARIMA (1,0,0) for Neighborhood C (82 Months Data)
Period Forecast Lower Upper Actual
June 2010 16.47 5.79 27.16 17.00
July 2010 15.03 3.99 26.07 14.00
Aug 2010 14.65 3.59 25.72 15.00
Sept 2010 14.56 3.49 25.62 20.00
Oct 2010 14.53 3.46 25.60 16.00
While visualisation in graphs of the forecasting values, with lower and upper values of
confidence interval (CI), are shown in Figure 2. In part II, transformation the forecasting values
from Part I into TFNs is done. The forecasting values with lower and upper values became a
crisp value by a triplet (a, b, c) of the TFN. By using the values of forecasting from Part I, next
step is transformed the forecasting values into TFN values. After that, ARIMA-FAC in Table 4
used with α-cut values of 0.3, 0.5 and 0.7 to find the new lower and upper bounds of the
forecasting range values. The results value of Part II for using 82 months data is presented in
Table 4.
Table 4 presents the actual values, lower and upper of ARIMA-FAC with α=0.3, α=0.5,
and α=0.7. The lower and upper values will use to find the interval width. The second
experiment has been used 51 months of index crime data, as appropriate under the
requirements of ARIMA models. In Part I possible two ARIMA model was obtained from this
experiment, ARIMA (1,1,0) and ARIMA (0,1,1). The results found ARIMA (1,1,0) to be the best
]Up,Lo[A
~
=
]UpFAC,LoFAC[A
~
i =
lolo ttt x~)x~)n(x~(LoFAC +−=
c)bc(Up +−−= upup ttt x~))n(x~x~(UpFAC +−−=
◼ ISSN: 1693-6930
TELKOMNIKA Vol. 17, No. 3, June 2019: 1193-1201
1198
model. The model also has a P-value less than 0.05. The MSE value is 38.90. Then, the
forecasting process was carried out using this model. The forecasting result is shown in Table 5.
While visualisation in graphs of the forecasting values, with lower and upper values of confidence
interval (CI), are shown in Figure 3.
Figure 2 Plot of forecasting values from ARIMA (1,0,0)
Table 4. ARIMA-FAC Results for Neighborhood C used 82 months data, with α=0.3, α=0.5 and α=0.7
Period
ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7
lower upper lower upper lower upper
June 2010 9.00 23.95 11.13 21.82 13.27 19.68
July 2010 7.3 22.76 9.51 20.55 11.72 18.34
Aug 2010 6.91 22.4 9.12 20.19 11.33 17.97
Sept 2010 6.81 22.3 9.02 20.09 11.23 17.88
Oct 2010 6.78 22.28 9.00 20.06 11.21 17.85
Table 5. Forecasting Results with ARIMA (1,1,0) for Neighborhood C (51 months data)
Period Forecast Lower Upper Actual
June 2010 17.51 5.29 29.74 17.00
July 2010 19.61 5.87 33.35 14.00
Aug 2010 18.49 1.98 35.01 15.00
Sept 2010 18.95 0.70 37.19 20.00
Oct 2010 18.63 -1.47 38.74 16.00
Figure 3 Plot of forecasting values from ARIMA (1,1,0)
In part II, transformation the forecasting values from Part I into TFNs is done.
The forecasting values with lower and upper values became a crisp value by a triplet (a, b, c) of the
TFN. By using the values of forecasting from Part I, next step is transformed the forecasting values
into TFN values. After that, ARIMA-FAC in Table 5 used with α-cut values of 0.3, 0.5 and 0.7 to find
the new lower and upper bounds of the forecasting range values. The results value of Part II for
using 82 months data is presented in Table 6.
TELKOMNIKA ISSN: 1693-6930 ◼
Enhance interval width of crime forecasting with ARIMA model-fuzzy... (Yaya Sudarya Triana)
1199
Table 6. ARIMA-FAC Results for Neighborhood C used 51 Months Data, with α=0.3, α=0.5 and α=0.7
Period
ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7
lower upper lower upper lower upper
June 2010 8.95 26.07 11.40 23.63 13.85 21.18
July 2010 9.99 29.23 12.74 26.48 15.49 23.73
Aug 2010 6.93 30.05 10.23 26.75 13.54 23.45
Sept 2010 6.17 31.72 9.82 28.07 13.47 24.42
Oct 2010 4.56 32.70 8.58 28.68 12.60 24.66
The third experiment has been used 32 months of index crime data, as appropriate
under the requirements of ARIMA models. In Part I possible two ARIMA model were obtained
from this experiment, ARIMA (1,1,0) and ARIMA (0,1,1). The results found ARIMA (0,1,1) to be
the best model. The model also has a P-value less than 0.05. The model also has the smallest
MSE, it is 33.072. Thus, Part I was carried out using this model. The forecasting result is shown
in Table 7. While visualisation in graphs of the forecasting values, with lower and upper values of
confidence interval (CI), are shown in Figure 4.
In part II, transformation the forecasting values from Part I into TFNs is done.
The forecasting values with lower and upper values became a crisp value by a triplet (a, b, c) of
the TFN. By using the values of forecasting from Part I, next step is transformed the forecasting
values into TFN values. After that, ARIMA-FAC in Table 4 used with α-cut values of 0.3, 0.5 and
0.7 to find the new lower and upper bounds of the forecasting range values. The results value of
Part II for using 82 months data is presented in Table 8.
Figure 4 Plot of forecasting values from ARIMA (0,1,1)
Table 7. Forecasting Results with ARIMA (0,1,1) for Neighborhood C (32 Months Data)
Period Forecast Lower Upper Actual
June 2010 14.44 3.17 25.72 17.00
July 2010 14.42 3.13 25.71 14.00
Aug 2010 14.39 3.08 25.70 15.00
Sept 2010 14.37 3.04 25.70 20.00
Oct 2010 14.34 3.00 25.69 16.00
Table 8. ARIMA-FAC Results for Neighborhood C used 32 Months Data,
with α=0.3, α=0.5 and α=0.7
Period
ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7
lower upper lower upper lower upper
June 2010 6.55 22.34 8.81 20.08 11.06 17.83
July 2010 6.52 22.32 8.77 20.07 11.03 17.81
Aug 2010 6.48 22.31 8.74 20.05 11.00 17.79
Sept 2010 6.44 22.30 8.71 20.03 10.97 17.77
Oct 2010 6.40 22.29 8.67 20.02 10.94 17.75
◼ ISSN: 1693-6930
TELKOMNIKA Vol. 17, No. 3, June 2019: 1193-1201
1200
3.1. The Comparison of Interval Forecasting
The results in previous section have shown that the uses of different α values affect the
lower and upper bound values. This proposed combining technique has the purpose to find
forecasting range values closer to the actual values. Next, compare the ARIMA model CI range
MSE with the ARIMA-FAC MSE. The other comparison is comparing the interval width of the CI
ARIMA model with ARIMA-FAC. In this comparison the small MSE value is expected. With small
value the performance of forecasting is better. The comparisons of the MSE values for
forecasting results and FAC results in neighborhood C, with different amount of crime index data,
are shown in Table 9, Table 10 and Table 11. The Tables 9-11 shown the MSE value
comparisons of interval forecasting in CI ARIMA model with interval forecasting in ARIMA-FAC
α=0.3, α=0.5 and α=0.7. The MSE values of ARIMA-FAC are much better than CI ARIMA.
Table 9. MSE Forecasting Range Results (Neighborhood C used 82 Months Data)
ARIMA ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7
lower 157.18 86.66 51.75 26.47
upper 97.51 44.91 21.91 8.57
Table 10. MSE Forecasting Range Results (Neighborhood C used 51 Months Data)
ARIMA ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7
lower 210.12 93.59 42.86 13.70
upper 349.87 191.41 112.73 55.62
Table 11. MSE Forecasting Range Results (Neighborhood C used 32 Months Data)
ARIMA ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7
lower 181.61 102.76 62.98 33.44
upper 90.81 39.20 17.58 6.20
4. Conclusion
Comparison results between ARIMA and ARIMA-FAC have been done.
The comparisons are based on graphs, error measurement values and interval width, then the
conclusion is conducted. The comparison presented the better results obtained from
ARIMA-FAC with used α=0.7. However, at the ARIMA-FAC graphic results have shown the use
of ARIMA-FAC α=0.3 and α=0.7 are not appropriate. The graphs of lower and upper
ARIMA-FAC with α=0.3 showed not much closer to the actual value. The graphs of lower and
upper ARIMA-FAC with α=0.5 have shown the closest value to the actual value. The results
gained from the use of α=0.7 not come closer to the actual value, even less than the actual
value. Then the actual value is not in the interval range. Based on Zhou, et. al. [16] that the
accuracy of forecasting is better if the actual or real value fall within the interval range.
Therefore, the use of α=0.3 and α=0.7 do not show the results in accordance with the purpose
of the proposed combining technique. Then, the use of ARIMA-FAC with α=0.5 is appropriate.
The error measurement comparisons also showed the same result with the graphics results.
The error measurements of ARIMA-FAC results are better than ARIMA results. The values of
interval width also present the better values for ARIMA-FAC. The narrower interval width
obtained from the use of FAC. Therefore, the results from ARIMA-FAC will be used for
analyzing decision options in crime prevention
References
[1] Sadikin M, Purwanto. The Implementation of E-learning System Governance to Deal with User Need,
Institution Objective, and Regulation Compliance. TELKOMNIKA Telecommunication Computing
Electronics and Control. 2018; 16(3): 1332-1344.
[2] Adriansyah, Andi. Enhanced Neuro-Fuzzy Architecture for Electrical Load Forecasting,
TELKOMNIKA Telecommunication Computing Electronics and Control. 2010; 8(2): 97-106.
[3] Waddell D, Sohal AS. Forecasting: The Key to Managerial Decision Making. Management Decision,
1994; 32: 41–49.
[4] Kim J. Bootstrap-after-bootstrap prediction intervals for autoregressive models. Journal of Business &
Economic Statistics. 2001; 19(1): 117–128.
TELKOMNIKA ISSN: 1693-6930 ◼
Enhance interval width of crime forecasting with ARIMA model-fuzzy... (Yaya Sudarya Triana)
1201
[5] Hansen BE. Interval forecasts and parameter uncertainty. Journal of Econometrics. 2006; 135:
377–398.
[6] Khosravi A, Nahavandi S, Creighton D. A neural network-GARCH-based method for construction of
Prediction Intervals. Electric Power Systems Research. 2013; 96: 185–193.
[7] Pielke Jr RA. The role of models in prediction for decision. Cary Conference Discussion Draf.
Understanding Ecosystems: The Role of Quantitative Models in Observations, Synthesis, and
Prediction. 2001; 1–126.
[8] Ediger VŞ, Akar S, Uǧurlu B. Forecasting production of fossil fuel sources in Turkey using a
comparative regression and ARIMA model. Energy Policy. 2006; 34(18): 3836–3846.
[9] Kumar U, Jain VK. ARIMA forecasting of ambient air pollutants (O3, NO, NO2 and CO). Stochastic
Environmental Research and Risk Assessment. 2010; 24(5): 751–760.
[10] Ramos P, Santos N, Rebelo R. Robotics and Computer-Integrated Manufacturing Performance of
state space and ARIMA models for consumer retail sales forecasting. Robotics and Computer
Integrated Manufacturing. 2015.
[11] Zhang GP. Time series forecasting using a hybrid ARIMA and neural network model.
Neurocomputing. 2003; 50: 159–175.
[12] Khashei M, Bijari M, Raissi Ardali GA. Improvement of Auto-Regressive Integrated Moving Average
models using Fuzzy logic and Artificial Neural Networks (ANNs). Neurocomputing. 2009; 72:
956-967.
[13] Cheng C-H. Evaluating weapon systems using ranking fuzzy numbers. Fuzzy Sets and Systems.
1999; 107(1): 25–35.
[14] Wankhade R, Mahalle S, Gajbhiye S, Bodade VM. Use of the ARIMA Model for Forecasting Pigeon
Pea Production in India. International Review of Business and Finance. 2010; 2(1): 97–102.
[15] Weisang G. Vagaries of the Euro : an Introduction to ARIMA Modeling. Vagaries of the Euro. 2008;
2(1), 45–55.
[16] Wang C-C. A comparison study between fuzzy time series model and ARIMA model for forecasting
Taiwan export. Expert Systems with Applications. 2011; 38(8): 9296–9304.
[17] Zhou M, Yan Z, Ni YX, Li G, Nie Y. Electricity price forecasting with confidence-interval estimation
through an extended ARIMA approach. IEEE Proceedings-Generation, Transmission and
Distribution. 2006; 2:187-195.
[18] Tseng FM, Tzeng GH, Yu HC, Yuan BJC. Fuzzy ARIMA model for forecasting the foreign exchange
market. Fuzzy Sets and Systems. 2001; 118(1): 9–19.
[19] Asadi S, Tavakoli A, Hejazi SR. Expert Systems with Applications A new hybrid for improvement of
auto-regressive integrated moving average models applying particle swarm optimization. Expert
Systems With Applications. 2012; 39(5): 5332–5337.
[20] Zou Q, Zhou J, Zhou C, Song L, Guo J, Liu Y. The practical research on flood risk analysis based on
IIOSM and fuzzy alpha-cut technique. Applied Mathematical Modelling. 2012; 36(7): 3271–3282.
[21] Feng LH, Luo GY. Analysis on fuzzy risk of landfall typhoon in Zhejiang province of China.
Mathematics and Computers in Simulation. 2009; 79(11): 3258–3266.
[22] Moslemi Naeni L, Salehipour A. Evaluating fuzzy earned value indices and estimates by applying
alpha cuts. Expert Systems with Applications. 2011; 38(7): 8193–8198.
[23] Neto JC do L, da Costa Junior CT, Bitar SDB, Junior WB. Forecasting of energy and diesel
consumption and the cost of energy production in isolated electrical systems in the Amazon using a
fuzzification process in time series models. Energy Policy. 2011; 39(9): 4947–4955.
[24] Noor Maizura Mohamad Noor, Astari Retnowardhani, Yaya Sudarya Triana, & Sami M. Helawani.
The Use of Fuzzy Alpha Cut to Improve the Forecasting Range Results of the ARIMA Model: Assist
Decision Making Process. International Journal of Information Processing and Management (IJIPM).
2013: 4(7): 59-71.
[25] Astari R, Yaya T. Classify interval range of crime forecasting for crime prevention decision making.
Knowledge, Information and Creativity Support Systems (KICSS),11th
International Conference on,
Yogyakarta, Indonesia. 2016.
[26] Witte Robert S, Witte J S. Statistics (8th ed.). John Wiley & Sons, Inc. 2007.
[27] Tong L-I, Chang C-W, Jin S-E, Saminathan R. Quantifying uncertainty of emission estimates in
National Greenhouse Gas Inventories using bootstrap confidence intervals. Atmospheric
Environment. 2012; 56: 80–87.

More Related Content

PDF
On Modeling Murder Crimes in Nigeria
PDF
working with python
PPTX
ModelSelection1_WCSMO_2013_Ali
PDF
Adjusting PageRank parameters and comparing results : REPORT
PDF
Adjusting PageRank parameters and comparing results : REPORT
PPTX
PEMF2_SDM_2012_Ali
PPTX
AIAA-Aviation-2015-Mehmani
PDF
Minimization of Assignment Problems
On Modeling Murder Crimes in Nigeria
working with python
ModelSelection1_WCSMO_2013_Ali
Adjusting PageRank parameters and comparing results : REPORT
Adjusting PageRank parameters and comparing results : REPORT
PEMF2_SDM_2012_Ali
AIAA-Aviation-2015-Mehmani
Minimization of Assignment Problems

What's hot (20)

PDF
Parametric estimation of construction cost using combined bootstrap and regre...
DOC
MC0079 SMU 2013 Falll Session
PDF
V. pacáková, d. brebera
PDF
Building the Professional of 2020: An Approach to Business Change Process Int...
PPT
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...
PPTX
PEMF-1-MAO2012-Ali
PDF
Six Sigma Methods and Formulas for Successful Quality Management
PPTX
AIAA-SciTech-ModelSelection-2014-Mehmani
PDF
ProbErrorBoundROM_MC2015
PDF
Time Series Forecasting using Neural Nets (GNNNs)
PDF
Software of Time Series Forecasting based on Combinations of Fuzzy and Statis...
PDF
Study_Pricing_Digital_Call_Options
PDF
Comparing between maximum
PDF
2014 on exact solutions for the minmax regret aspanning tree problem
PDF
APPROACHES IN USING EXPECTATIONMAXIMIZATION ALGORITHM FOR MAXIMUM LIKELIHOOD ...
PDF
Adaptive response surface by kriging using pilot points for structural reliab...
PDF
Maximum likelihood estimation-assisted ASVSF through state covariance-based 2...
PDF
Quantitative Risk Assessment - Road Development Perspective
PDF
A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...
Parametric estimation of construction cost using combined bootstrap and regre...
MC0079 SMU 2013 Falll Session
V. pacáková, d. brebera
Building the Professional of 2020: An Approach to Business Change Process Int...
Forecasting Default Probabilities in Emerging Markets and Dynamical Regula...
PEMF-1-MAO2012-Ali
Six Sigma Methods and Formulas for Successful Quality Management
AIAA-SciTech-ModelSelection-2014-Mehmani
ProbErrorBoundROM_MC2015
Time Series Forecasting using Neural Nets (GNNNs)
Software of Time Series Forecasting based on Combinations of Fuzzy and Statis...
Study_Pricing_Digital_Call_Options
Comparing between maximum
2014 on exact solutions for the minmax regret aspanning tree problem
APPROACHES IN USING EXPECTATIONMAXIMIZATION ALGORITHM FOR MAXIMUM LIKELIHOOD ...
Adaptive response surface by kriging using pilot points for structural reliab...
Maximum likelihood estimation-assisted ASVSF through state covariance-based 2...
Quantitative Risk Assessment - Road Development Perspective
A REVIEW ON OPTIMIZATION OF LEAST SQUARES SUPPORT VECTOR MACHINE FOR TIME SER...
Ad

Similar to Enhance interval width of crime forecasting with ARIMA model-fuzzy alpha cut (20)

PPT
ARIMA Model agfmabmnbamnbgf afgkhkahkgh asfhdkhkasfhk
PPT
ARIMA Model for analysis of time series data.ppt
PPT
ARIMA Model.ppt
PPT
ARIMA Model.ppt
PPTX
PPTX
PDF
Different Models Used In Time Series - InsideAIML
PDF
PDF
Forecasting%20Economic%20Series%20using%20ARMA
PPTX
Air Passenger Prediction Using ARIMA Model
PDF
ARIMA Models - [Lab 3]
PPTX
PDF
arimamodel-170204090012.pdf
PPTX
Time series analysis
PPTX
Business Analytics Foundation with R tool - Part 5
PDF
Lecture_18 hypothesis testing and probability
PPTX
Arima model
PPTX
Time series analysis 101
PPTX
What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?
PDF
Study of effectiveness of time series modeling (arima) in forecasting stock p...
ARIMA Model agfmabmnbamnbgf afgkhkahkgh asfhdkhkasfhk
ARIMA Model for analysis of time series data.ppt
ARIMA Model.ppt
ARIMA Model.ppt
Different Models Used In Time Series - InsideAIML
Forecasting%20Economic%20Series%20using%20ARMA
Air Passenger Prediction Using ARIMA Model
ARIMA Models - [Lab 3]
arimamodel-170204090012.pdf
Time series analysis
Business Analytics Foundation with R tool - Part 5
Lecture_18 hypothesis testing and probability
Arima model
Time series analysis 101
What is ARIMA Forecasting and How Can it Be Used for Enterprise Analysis?
Study of effectiveness of time series modeling (arima) in forecasting stock p...
Ad

More from TELKOMNIKA JOURNAL (20)

PDF
Earthquake magnitude prediction based on radon cloud data near Grindulu fault...
PDF
Implementation of ICMP flood detection and mitigation system based on softwar...
PDF
Indonesian continuous speech recognition optimization with convolution bidir...
PDF
Recognition and understanding of construction safety signs by final year engi...
PDF
The use of dolomite to overcome grounding resistance in acidic swamp land
PDF
Clustering of swamp land types against soil resistivity and grounding resistance
PDF
Hybrid methodology for parameter algebraic identification in spatial/time dom...
PDF
Integration of image processing with 6-degrees-of-freedom robotic arm for adv...
PDF
Deep learning approaches for accurate wood species recognition
PDF
Neuromarketing case study: recognition of sweet and sour taste in beverage pr...
PDF
Reversible data hiding with selective bits difference expansion and modulus f...
PDF
Website-based: smart goat farm monitoring cages
PDF
Novel internet of things-spectroscopy methods for targeted water pollutants i...
PDF
XGBoost optimization using hybrid Bayesian optimization and nested cross vali...
PDF
Convolutional neural network-based real-time drowsy driver detection for acci...
PDF
Addressing overfitting in comparative study for deep learningbased classifica...
PDF
Integrating artificial intelligence into accounting systems: a qualitative st...
PDF
Leveraging technology to improve tuberculosis patient adherence: a comprehens...
PDF
Adulterated beef detection with redundant gas sensor using optimized convolut...
PDF
A 6G THz MIMO antenna with high gain and wide bandwidth for high-speed wirele...
Earthquake magnitude prediction based on radon cloud data near Grindulu fault...
Implementation of ICMP flood detection and mitigation system based on softwar...
Indonesian continuous speech recognition optimization with convolution bidir...
Recognition and understanding of construction safety signs by final year engi...
The use of dolomite to overcome grounding resistance in acidic swamp land
Clustering of swamp land types against soil resistivity and grounding resistance
Hybrid methodology for parameter algebraic identification in spatial/time dom...
Integration of image processing with 6-degrees-of-freedom robotic arm for adv...
Deep learning approaches for accurate wood species recognition
Neuromarketing case study: recognition of sweet and sour taste in beverage pr...
Reversible data hiding with selective bits difference expansion and modulus f...
Website-based: smart goat farm monitoring cages
Novel internet of things-spectroscopy methods for targeted water pollutants i...
XGBoost optimization using hybrid Bayesian optimization and nested cross vali...
Convolutional neural network-based real-time drowsy driver detection for acci...
Addressing overfitting in comparative study for deep learningbased classifica...
Integrating artificial intelligence into accounting systems: a qualitative st...
Leveraging technology to improve tuberculosis patient adherence: a comprehens...
Adulterated beef detection with redundant gas sensor using optimized convolut...
A 6G THz MIMO antenna with high gain and wide bandwidth for high-speed wirele...

Recently uploaded (20)

PPTX
OOP with Java - Java Introduction (Basics)
PDF
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
PPT
Project quality management in manufacturing
PPTX
Fundamentals of safety and accident prevention -final (1).pptx
PPTX
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
PDF
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
PPTX
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
PDF
R24 SURVEYING LAB MANUAL for civil enggi
PDF
Well-logging-methods_new................
PDF
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
PPTX
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
DOCX
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
PPTX
Geodesy 1.pptx...............................................
PPTX
Safety Seminar civil to be ensured for safe working.
PPTX
CH1 Production IntroductoryConcepts.pptx
PDF
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
PDF
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
PPTX
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
PPTX
Foundation to blockchain - A guide to Blockchain Tech
PDF
737-MAX_SRG.pdf student reference guides
OOP with Java - Java Introduction (Basics)
BMEC211 - INTRODUCTION TO MECHATRONICS-1.pdf
Project quality management in manufacturing
Fundamentals of safety and accident prevention -final (1).pptx
MET 305 2019 SCHEME MODULE 2 COMPLETE.pptx
Mitigating Risks through Effective Management for Enhancing Organizational Pe...
CARTOGRAPHY AND GEOINFORMATION VISUALIZATION chapter1 NPTE (2).pptx
R24 SURVEYING LAB MANUAL for civil enggi
Well-logging-methods_new................
TFEC-4-2020-Design-Guide-for-Timber-Roof-Trusses.pdf
Infosys Presentation by1.Riyan Bagwan 2.Samadhan Naiknavare 3.Gaurav Shinde 4...
ASol_English-Language-Literature-Set-1-27-02-2023-converted.docx
Geodesy 1.pptx...............................................
Safety Seminar civil to be ensured for safe working.
CH1 Production IntroductoryConcepts.pptx
SM_6th-Sem__Cse_Internet-of-Things.pdf IOT
Human-AI Collaboration: Balancing Agentic AI and Autonomy in Hybrid Systems
Engineering Ethics, Safety and Environment [Autosaved] (1).pptx
Foundation to blockchain - A guide to Blockchain Tech
737-MAX_SRG.pdf student reference guides

Enhance interval width of crime forecasting with ARIMA model-fuzzy alpha cut

  • 1. TELKOMNIKA, Vol.17, No.3, June 2019, pp.1193~1201 ISSN: 1693-6930, accredited First Grade by Kemenristekdikti, Decree No: 21/E/KPT/2018 DOI: 10.12928/TELKOMNIKA.v17i3.12233 ◼ 1193 Received July 1, 2018; Revised October 24, 2018; Accepted November 30, 2018 Enhance interval width of crime forecasting with ARIMA model-fuzzy alpha cut Yaya Sudarya Triana*1 , Astari Retnowardhani2 1 Faculty of Computer Science, Universitas Mercu Buana, Indonesia 2 Information Systems Management Department, BINUS Graduate Program-Master of Information Systems Management, Bina Nusantara University, Indonesia *Corresponding author, e-mail: yaya.sudarya@mercubuana.ac.id1 , aretnowardhani@binus.edu2 Abstract With qualified data or information a better decision can be made. The interval width of forecasting is one of data values to assist in the selection decision making process in regards to crime prevention. However, in time series forecasting, especially the use of ARIMA model, the amount of historical data available can affect forecasting result including interval width forecasting value. This study proposes a combination technique, in order to get get a better interval width crime forecasting value. The propose combination technique between ARIMA model and Fuzzy Alpha Cut are presented. The use of variation alpha values are used, they are 0.3, 0.5, and 0.7. The experimental results have shown the use of ARIMA-FAC with alpha=0.5 is appropriate. The overall results obtained have shown the interval width crime forecasting with ARIMA-FAC is better than interval width crime forecasting with 95% CI ARIMA model. Keywords: ARIMA, decision, FAC, forecasting, interval width Copyright © 2019 Universitas Ahmad Dahlan. All rights reserved. 1. Introduction In the digital era, information system plays very important role in each organization [1]. Sometimes decision makers need forecasting data or information as a basis for making a decision. The use of Decision Support System (DSS) allows decision makers to make decisions appropriately taking into account the possible conditions that may occur. In order to predict future conditions in the decision making process, statistical technique like forecasting methods is used. To adjust the parameters in order to get better forecasting system than the previous researches. The involvement of forecasting in decision planning is aimed to help decision makers to come up with a good decision. Using forecast data, decision-maker can take into account subsequent events in a given situation, and thus influence the final decision [2-4]. However, forecasting results may contain some uncertainties [4-6]. Uncertainty means more than one outcome is consistent with our expectations [7]. In order to overcome this condition, we can estimate the range of forecasting values, which is called interval forecast or interval values [4]. These ranges could predict the best and worst situation that may occur. One of the methods that have been discussed is ARIMA model. ARIMA model is one of the most popular models in time series forecasting analysis [8-11]. This model has the advantage of giving an accurate forecast in a short time period. ARIMA has been widely used in many different areas such as in social, economic, engineering, crime prevention, and foreign exchange field [12-17]. A good analysis of interval for ARIMA model is required to determine precisely the worst and the best possible forecasting conditions. The accuracy of time series forecasting is important for many decision processes. However, ARIMA models have data limitation. The limitation of the ARIMA model is in the data requirements. ARIMA models need a large historical data at least 50 records and preferably 100 record or above. In some situations, however, we have to forecast future situations using few data points over a short time period. As the forecasting results, ARIMA models provide lower bound and upper bound values as well as forecasting values. This value is obtained from the use of confidence interval calculation. The ARIMA limitation affects the forecasting and forecasting intervals results [12, 18, 19].
  • 2. ◼ ISSN: 1693-6930 TELKOMNIKA Vol. 17, No. 3, June 2019: 1193-1201 1194 Fuzzy Alpha Cut (FAC) is also a technique to find the range of values. Some researcher were implemented FAC, such as [20, 21]. They have applied FAC to calculate the fuzzy expected values of the possibility-probability distribution. The use of FAC for evaluation the earned value by [22]. In his study of the fuzzificaton of the variable CP (cost of electricity production) allows an analysis via FAC [23]. However, there is a lack of exploration of the use of FAC on the interval of forecasting calculation, especially in ARIMA model. Then, to improve the interval forecasting result of ARIMA, this study proposed combination between ARIMA model and FAC. By combining FAC into ARIMA models, expectation to find better forecasting range values can be reached. The experiment of this proposed combination was used index motorcycle crime data. 2. Research Method 2.1. ARIMA Model The ARIMA model aims to describe the current behavior of variables in terms of their linear relationships with historical data. It can be decomposed into two parts. First, it has an integrated (I) component (d), which represents the amount of differencing to be performed on the series to make it stationary. The second component is an ARMA model for the series rendered stationary through differentiation. The ARMA component is further decomposed into AR and MA components. The autoregressive (AR) component (p) captures the correlation between the current value of the time series and some of its past values. The moving average (MA) component (q) represents the duration of the influence of a random shock. The Autocorrelation Function (ACF) and Partial Autocorrelation Function (PACF) are then used to estimate the values of p and q [15]. The ARMA (p, q) has the general form: 𝑦𝑡 = 𝜃0 + ∅1 𝑦𝑡−1 + ∅2 𝑦𝑡−2 + ⋯ + ∅ 𝑝 𝑦𝑡−𝑝 + 𝜀𝑡 − 𝜃1 𝜀𝑡−1 − 𝜃 𝑝 𝜀𝑡−𝑝 (1) where yt and εt are the actual value and random error at time period t, respectively; while øi(i=1, 2,…, p) and θj(j=0, 1, 2,…, q) are model parameters. p and q are integers and often referred to as orders of the model. The random errors, εt, are assumed to be independently and identically distributed with a mean of zero and a constant variance of σ2. When one of the terms is zero, it is common to drop AR, I or MA. For example, an I(1) is an ARIMA(0,1,0), and MA(1) model is an ARIMA(0,0,1). Given a time series of data Xt where t is an integer index and the Xt are real numbers, then an ARMA(p, q) model is given by: [24, 25] (2) where L is the lag operator, the αi are the parameters of the autoregressive part of the model, the θi are the parameters of the moving average part and the 𝜀𝑡 are error terms. The error terms are generally assumed to be independent, identically distributed variables sampled from a normal distribution with zero mean. Assume now that the polynomial in (3), (3) has a unitary root of multiplicity d. Then it can be rewritten as: = (4) an ARIMA (p, d, q) process expresses this polynomial factorization property, and is given by: (5) thus can be thought as a particular case of an ARMA(p+d,q) process having the auto-regressive polynomial with some roots in the unity. For this reason every ARIMA model t q 1i i it p 1i i i L1xL1       +=      −  == t       − = p 1i i i L1       − = p 1i i i L1 ( )d dp 1i i i L1L1 −      +  − = ( ) t q 1i i it d p 1i i i L1XL1L1       +=−      +  ==
  • 3. TELKOMNIKA ISSN: 1693-6930 ◼ Enhance interval width of crime forecasting with ARIMA model-fuzzy... (Yaya Sudarya Triana) 1195 with d>0 is not wide sense stationary. ARIMA models are used for the observable non- stationary processes Xt that have some clearly identified trends: a. Constant trend (i.e. a non-zero average) leads to d=1. b. Linear trend (i.e. a linear growth behavior) leads to d=2. c. Quadratic trend (i.e. a quadratic growth behavior) leads to d=3. In these cases the ARIMA model can be viewed as a "cascade" of two models. The first is non-stationary: 𝑌𝑡 = (1 − 𝐿) 𝑑 𝑋𝑡 (6) while the second is wide-sense stationary: (1 − ∑ 𝜑𝑖 𝐿𝑖 𝑝 𝑖=1 ) 𝑌𝑡 = (1 + ∑ 𝜃𝑖 𝐿𝑖 𝑞 𝑖=1 ) 𝜀𝑡 (7) The AR process order is determined from the PACF graph and the similarity MA process order is determined from the ACF graph. The patterns of the sample ACF and PACF used to determine the model processes summarized in Table 1. Table 1. Summarize of ACF and Pacf Patterns Model ACF PACF AR(p) Dies down Cut off after lag q MA(q) Cut off after lag p Dies down ARMA(p,q) Dies down Dies down In [8], the ARIMA procedure fits a model with a certain number of parameters and tests for the significance of the parameters. It means if the parameters are zero (null hypothesis, H0) or different from zero (alternative hypothesis, H0). In order to test the significance of the parameters considered in the model, t-statistics and P-value are used. The t-statistic is used to determine the P-value. P-value is determined as α=0.05 a level corresponding 95% of confidence interval. If the P-value is less than α value, H0 is rejected. For the ARIMA model which has a P-value <α then the model is acceptable. The next check step is residuals. Ljung Box test is used to test whether the first k ACF of the residuals are significantly different from what would be expected from a white noise process. Using the usual significance level of α=0.05, a model passes this test if P-value>α. The large P-values indicate that the residuals are not distinguishable from a white noise series [19-21], [26-27]. 2.2. Confidence Interval (CI) The 95% CI technique, usually included in forecasting process with an ARIMA model [3, 12, 22, 25] stated that a CI is more informative since it indicates, with a known degree of confidence, the range of possible effects. In Tong, et al. [27], a (1-α) 100% CI for an unknown parameter (e.g. population mean) is an interval calculated from the sample data. such that (1-α) of the intervals will enclose the true parameter value. For example, if we take 95% CI is an interval with 0.95 probabilities to enclose the true parameter. This means the t_α value is 5% or 0.05. Suppose that {x_1,x_2,…,x_n} is a random sample drawn from a normal population with unknown mean µ and unknown variance σ^2, then a (1-α) 100 % CI for the true mean can be constructed as follows: 𝑥̄ ± 𝑡 𝛼/2 𝑠 √ 𝑛 (8) where is: x = the sample mean, s = the sample standard deviation, n = the sample sized, and
  • 4. ◼ ISSN: 1693-6930 TELKOMNIKA Vol. 17, No. 3, June 2019: 1193-1201 1196 t(α/2) = the value corresponding to an upper-ail area of α/2 in the t distribution with n-1 degrees of freedom (df). If the sample size n is larger (n≥30), referring to the Central Limit Theorem [24], then x ̅ is will be approximately normally distributed regardless of the distribution of the sample population. Therefore, the CI construct is as follows: 𝑥̄ ± 𝑧 𝑐𝑜𝑛𝑓 𝜎𝑥̄ (9) where is : x = The sample mean zconf = A number from the standard normal table that satisfies the confidence specifications for the confidence interval x = The standard error of the mean In the other construct as follow: 𝑥𝑙𝑜 = 𝑥̄ − 𝑧 𝑐𝑜𝑛𝑓 𝜎 𝑥̄ (10) and 𝑥𝑙𝑜 = 𝑥̄ + 𝑧 𝑐𝑜𝑛𝑓 𝜎 𝑥̄ (11) Xlo = Lower limit value Xup = Upper limit value 2.3. Fuzzy Alpha Cut The proposed technique combines ARIMA model and FAC technique, as shown in Figure 1. This combination is aimed to obtain more accurate interval forecasting result. This process divided in 2 parts. The first part is Box Jenkin’s methodology to find ARIMA model. The forecasting results and confidence interval from using ARIMA models will be converted into TFN. The arithmetic operation on the fuzzy alpha-cut, for instance the alpha-cut method is applied to the data through the fuzzy environmental process. Figure 1. Combination Technique of ARIMA model and FAC the membership function of TFN is defined as:
  • 5. TELKOMNIKA ISSN: 1693-6930 ◼ Enhance interval width of crime forecasting with ARIMA model-fuzzy... (Yaya Sudarya Triana) 1197 𝜇 𝐴̃(𝑥) = { 1, 𝑥 = 𝑥̃𝑡(𝑛) 𝑥−𝑋̃ 𝑡 𝑙𝑜 𝑥̃ 𝑡(𝑛)−𝑋̃ 𝑡 𝑙𝑜 , 𝑋̃𝑡 𝑙𝑜 ≤ 𝑥 ≤ 𝑥̃ 𝑡(𝑛) 𝑋̃ 𝑡 𝑢𝑝−𝑥 𝑋̃ 𝑡 𝑢𝑝−𝑥̃ 𝑡(𝑛) , 𝑥̃𝑡(𝑛) ≤ 𝑥 ≤ 𝑋̃𝑡 𝑢𝑝 0, 𝑒𝑙𝑠𝑒 then, to obtain the lower and upper limit where ]1,0[ use the membership function of TFN in above, the propose technique ARIMA-FAC equation as follow in Table 2: Table 2. Proposed Model ARIMA-FAC FAC ARIMA-FAC → a)ab(Lo +−= → → the proposed integration of the technique proceeds as follow: - Part I: Fit the ARIMA model using the available historical data. Then, the forecasting values are obtained. The lower(Lo) and upper(Up) values are obtained use 95% CI. The output from this part then used as input data sets in the next phase. In other words, the results from Part I become the input data sets for Part II. - Part II: The forecasting values as crisp set from Phase I are fuzzified into TFN. Then, these TFN values use in arithmetic operation of FAC (9), to find the lower(Lo) and upper(Up) FAC values. 3. Results and Analysis The first experiment has been used 82 months of index crime data are used, as appropriate under the requirements of ARIMA models. In Part I possible three ARIMA model was obtained from this experiment, ARIMA (1,0,0), ARIMA (1,1,0), and ARIMA (0,1,1). The result found ARIMA (1,0,0) to be the best model, because the model has a P-value less than 0.05. The model also has the smallest MSE rather than the other model, it is 29.71. Then, the forecasting process was carried out using this model. The forecasting result is shown in Table 3. Table 3. Forecasting Results with ARIMA (1,0,0) for Neighborhood C (82 Months Data) Period Forecast Lower Upper Actual June 2010 16.47 5.79 27.16 17.00 July 2010 15.03 3.99 26.07 14.00 Aug 2010 14.65 3.59 25.72 15.00 Sept 2010 14.56 3.49 25.62 20.00 Oct 2010 14.53 3.46 25.60 16.00 While visualisation in graphs of the forecasting values, with lower and upper values of confidence interval (CI), are shown in Figure 2. In part II, transformation the forecasting values from Part I into TFNs is done. The forecasting values with lower and upper values became a crisp value by a triplet (a, b, c) of the TFN. By using the values of forecasting from Part I, next step is transformed the forecasting values into TFN values. After that, ARIMA-FAC in Table 4 used with α-cut values of 0.3, 0.5 and 0.7 to find the new lower and upper bounds of the forecasting range values. The results value of Part II for using 82 months data is presented in Table 4. Table 4 presents the actual values, lower and upper of ARIMA-FAC with α=0.3, α=0.5, and α=0.7. The lower and upper values will use to find the interval width. The second experiment has been used 51 months of index crime data, as appropriate under the requirements of ARIMA models. In Part I possible two ARIMA model was obtained from this experiment, ARIMA (1,1,0) and ARIMA (0,1,1). The results found ARIMA (1,1,0) to be the best ]Up,Lo[A ~ = ]UpFAC,LoFAC[A ~ i = lolo ttt x~)x~)n(x~(LoFAC +−= c)bc(Up +−−= upup ttt x~))n(x~x~(UpFAC +−−=
  • 6. ◼ ISSN: 1693-6930 TELKOMNIKA Vol. 17, No. 3, June 2019: 1193-1201 1198 model. The model also has a P-value less than 0.05. The MSE value is 38.90. Then, the forecasting process was carried out using this model. The forecasting result is shown in Table 5. While visualisation in graphs of the forecasting values, with lower and upper values of confidence interval (CI), are shown in Figure 3. Figure 2 Plot of forecasting values from ARIMA (1,0,0) Table 4. ARIMA-FAC Results for Neighborhood C used 82 months data, with α=0.3, α=0.5 and α=0.7 Period ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7 lower upper lower upper lower upper June 2010 9.00 23.95 11.13 21.82 13.27 19.68 July 2010 7.3 22.76 9.51 20.55 11.72 18.34 Aug 2010 6.91 22.4 9.12 20.19 11.33 17.97 Sept 2010 6.81 22.3 9.02 20.09 11.23 17.88 Oct 2010 6.78 22.28 9.00 20.06 11.21 17.85 Table 5. Forecasting Results with ARIMA (1,1,0) for Neighborhood C (51 months data) Period Forecast Lower Upper Actual June 2010 17.51 5.29 29.74 17.00 July 2010 19.61 5.87 33.35 14.00 Aug 2010 18.49 1.98 35.01 15.00 Sept 2010 18.95 0.70 37.19 20.00 Oct 2010 18.63 -1.47 38.74 16.00 Figure 3 Plot of forecasting values from ARIMA (1,1,0) In part II, transformation the forecasting values from Part I into TFNs is done. The forecasting values with lower and upper values became a crisp value by a triplet (a, b, c) of the TFN. By using the values of forecasting from Part I, next step is transformed the forecasting values into TFN values. After that, ARIMA-FAC in Table 5 used with α-cut values of 0.3, 0.5 and 0.7 to find the new lower and upper bounds of the forecasting range values. The results value of Part II for using 82 months data is presented in Table 6.
  • 7. TELKOMNIKA ISSN: 1693-6930 ◼ Enhance interval width of crime forecasting with ARIMA model-fuzzy... (Yaya Sudarya Triana) 1199 Table 6. ARIMA-FAC Results for Neighborhood C used 51 Months Data, with α=0.3, α=0.5 and α=0.7 Period ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7 lower upper lower upper lower upper June 2010 8.95 26.07 11.40 23.63 13.85 21.18 July 2010 9.99 29.23 12.74 26.48 15.49 23.73 Aug 2010 6.93 30.05 10.23 26.75 13.54 23.45 Sept 2010 6.17 31.72 9.82 28.07 13.47 24.42 Oct 2010 4.56 32.70 8.58 28.68 12.60 24.66 The third experiment has been used 32 months of index crime data, as appropriate under the requirements of ARIMA models. In Part I possible two ARIMA model were obtained from this experiment, ARIMA (1,1,0) and ARIMA (0,1,1). The results found ARIMA (0,1,1) to be the best model. The model also has a P-value less than 0.05. The model also has the smallest MSE, it is 33.072. Thus, Part I was carried out using this model. The forecasting result is shown in Table 7. While visualisation in graphs of the forecasting values, with lower and upper values of confidence interval (CI), are shown in Figure 4. In part II, transformation the forecasting values from Part I into TFNs is done. The forecasting values with lower and upper values became a crisp value by a triplet (a, b, c) of the TFN. By using the values of forecasting from Part I, next step is transformed the forecasting values into TFN values. After that, ARIMA-FAC in Table 4 used with α-cut values of 0.3, 0.5 and 0.7 to find the new lower and upper bounds of the forecasting range values. The results value of Part II for using 82 months data is presented in Table 8. Figure 4 Plot of forecasting values from ARIMA (0,1,1) Table 7. Forecasting Results with ARIMA (0,1,1) for Neighborhood C (32 Months Data) Period Forecast Lower Upper Actual June 2010 14.44 3.17 25.72 17.00 July 2010 14.42 3.13 25.71 14.00 Aug 2010 14.39 3.08 25.70 15.00 Sept 2010 14.37 3.04 25.70 20.00 Oct 2010 14.34 3.00 25.69 16.00 Table 8. ARIMA-FAC Results for Neighborhood C used 32 Months Data, with α=0.3, α=0.5 and α=0.7 Period ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7 lower upper lower upper lower upper June 2010 6.55 22.34 8.81 20.08 11.06 17.83 July 2010 6.52 22.32 8.77 20.07 11.03 17.81 Aug 2010 6.48 22.31 8.74 20.05 11.00 17.79 Sept 2010 6.44 22.30 8.71 20.03 10.97 17.77 Oct 2010 6.40 22.29 8.67 20.02 10.94 17.75
  • 8. ◼ ISSN: 1693-6930 TELKOMNIKA Vol. 17, No. 3, June 2019: 1193-1201 1200 3.1. The Comparison of Interval Forecasting The results in previous section have shown that the uses of different α values affect the lower and upper bound values. This proposed combining technique has the purpose to find forecasting range values closer to the actual values. Next, compare the ARIMA model CI range MSE with the ARIMA-FAC MSE. The other comparison is comparing the interval width of the CI ARIMA model with ARIMA-FAC. In this comparison the small MSE value is expected. With small value the performance of forecasting is better. The comparisons of the MSE values for forecasting results and FAC results in neighborhood C, with different amount of crime index data, are shown in Table 9, Table 10 and Table 11. The Tables 9-11 shown the MSE value comparisons of interval forecasting in CI ARIMA model with interval forecasting in ARIMA-FAC α=0.3, α=0.5 and α=0.7. The MSE values of ARIMA-FAC are much better than CI ARIMA. Table 9. MSE Forecasting Range Results (Neighborhood C used 82 Months Data) ARIMA ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7 lower 157.18 86.66 51.75 26.47 upper 97.51 44.91 21.91 8.57 Table 10. MSE Forecasting Range Results (Neighborhood C used 51 Months Data) ARIMA ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7 lower 210.12 93.59 42.86 13.70 upper 349.87 191.41 112.73 55.62 Table 11. MSE Forecasting Range Results (Neighborhood C used 32 Months Data) ARIMA ARIMA-FAC α=0.3 ARIMA-FAC α=0.5 ARIMA-FAC α=0.7 lower 181.61 102.76 62.98 33.44 upper 90.81 39.20 17.58 6.20 4. Conclusion Comparison results between ARIMA and ARIMA-FAC have been done. The comparisons are based on graphs, error measurement values and interval width, then the conclusion is conducted. The comparison presented the better results obtained from ARIMA-FAC with used α=0.7. However, at the ARIMA-FAC graphic results have shown the use of ARIMA-FAC α=0.3 and α=0.7 are not appropriate. The graphs of lower and upper ARIMA-FAC with α=0.3 showed not much closer to the actual value. The graphs of lower and upper ARIMA-FAC with α=0.5 have shown the closest value to the actual value. The results gained from the use of α=0.7 not come closer to the actual value, even less than the actual value. Then the actual value is not in the interval range. Based on Zhou, et. al. [16] that the accuracy of forecasting is better if the actual or real value fall within the interval range. Therefore, the use of α=0.3 and α=0.7 do not show the results in accordance with the purpose of the proposed combining technique. Then, the use of ARIMA-FAC with α=0.5 is appropriate. The error measurement comparisons also showed the same result with the graphics results. The error measurements of ARIMA-FAC results are better than ARIMA results. The values of interval width also present the better values for ARIMA-FAC. The narrower interval width obtained from the use of FAC. Therefore, the results from ARIMA-FAC will be used for analyzing decision options in crime prevention References [1] Sadikin M, Purwanto. The Implementation of E-learning System Governance to Deal with User Need, Institution Objective, and Regulation Compliance. TELKOMNIKA Telecommunication Computing Electronics and Control. 2018; 16(3): 1332-1344. [2] Adriansyah, Andi. Enhanced Neuro-Fuzzy Architecture for Electrical Load Forecasting, TELKOMNIKA Telecommunication Computing Electronics and Control. 2010; 8(2): 97-106. [3] Waddell D, Sohal AS. Forecasting: The Key to Managerial Decision Making. Management Decision, 1994; 32: 41–49. [4] Kim J. Bootstrap-after-bootstrap prediction intervals for autoregressive models. Journal of Business & Economic Statistics. 2001; 19(1): 117–128.
  • 9. TELKOMNIKA ISSN: 1693-6930 ◼ Enhance interval width of crime forecasting with ARIMA model-fuzzy... (Yaya Sudarya Triana) 1201 [5] Hansen BE. Interval forecasts and parameter uncertainty. Journal of Econometrics. 2006; 135: 377–398. [6] Khosravi A, Nahavandi S, Creighton D. A neural network-GARCH-based method for construction of Prediction Intervals. Electric Power Systems Research. 2013; 96: 185–193. [7] Pielke Jr RA. The role of models in prediction for decision. Cary Conference Discussion Draf. Understanding Ecosystems: The Role of Quantitative Models in Observations, Synthesis, and Prediction. 2001; 1–126. [8] Ediger VŞ, Akar S, Uǧurlu B. Forecasting production of fossil fuel sources in Turkey using a comparative regression and ARIMA model. Energy Policy. 2006; 34(18): 3836–3846. [9] Kumar U, Jain VK. ARIMA forecasting of ambient air pollutants (O3, NO, NO2 and CO). Stochastic Environmental Research and Risk Assessment. 2010; 24(5): 751–760. [10] Ramos P, Santos N, Rebelo R. Robotics and Computer-Integrated Manufacturing Performance of state space and ARIMA models for consumer retail sales forecasting. Robotics and Computer Integrated Manufacturing. 2015. [11] Zhang GP. Time series forecasting using a hybrid ARIMA and neural network model. Neurocomputing. 2003; 50: 159–175. [12] Khashei M, Bijari M, Raissi Ardali GA. Improvement of Auto-Regressive Integrated Moving Average models using Fuzzy logic and Artificial Neural Networks (ANNs). Neurocomputing. 2009; 72: 956-967. [13] Cheng C-H. Evaluating weapon systems using ranking fuzzy numbers. Fuzzy Sets and Systems. 1999; 107(1): 25–35. [14] Wankhade R, Mahalle S, Gajbhiye S, Bodade VM. Use of the ARIMA Model for Forecasting Pigeon Pea Production in India. International Review of Business and Finance. 2010; 2(1): 97–102. [15] Weisang G. Vagaries of the Euro : an Introduction to ARIMA Modeling. Vagaries of the Euro. 2008; 2(1), 45–55. [16] Wang C-C. A comparison study between fuzzy time series model and ARIMA model for forecasting Taiwan export. Expert Systems with Applications. 2011; 38(8): 9296–9304. [17] Zhou M, Yan Z, Ni YX, Li G, Nie Y. Electricity price forecasting with confidence-interval estimation through an extended ARIMA approach. IEEE Proceedings-Generation, Transmission and Distribution. 2006; 2:187-195. [18] Tseng FM, Tzeng GH, Yu HC, Yuan BJC. Fuzzy ARIMA model for forecasting the foreign exchange market. Fuzzy Sets and Systems. 2001; 118(1): 9–19. [19] Asadi S, Tavakoli A, Hejazi SR. Expert Systems with Applications A new hybrid for improvement of auto-regressive integrated moving average models applying particle swarm optimization. Expert Systems With Applications. 2012; 39(5): 5332–5337. [20] Zou Q, Zhou J, Zhou C, Song L, Guo J, Liu Y. The practical research on flood risk analysis based on IIOSM and fuzzy alpha-cut technique. Applied Mathematical Modelling. 2012; 36(7): 3271–3282. [21] Feng LH, Luo GY. Analysis on fuzzy risk of landfall typhoon in Zhejiang province of China. Mathematics and Computers in Simulation. 2009; 79(11): 3258–3266. [22] Moslemi Naeni L, Salehipour A. Evaluating fuzzy earned value indices and estimates by applying alpha cuts. Expert Systems with Applications. 2011; 38(7): 8193–8198. [23] Neto JC do L, da Costa Junior CT, Bitar SDB, Junior WB. Forecasting of energy and diesel consumption and the cost of energy production in isolated electrical systems in the Amazon using a fuzzification process in time series models. Energy Policy. 2011; 39(9): 4947–4955. [24] Noor Maizura Mohamad Noor, Astari Retnowardhani, Yaya Sudarya Triana, & Sami M. Helawani. The Use of Fuzzy Alpha Cut to Improve the Forecasting Range Results of the ARIMA Model: Assist Decision Making Process. International Journal of Information Processing and Management (IJIPM). 2013: 4(7): 59-71. [25] Astari R, Yaya T. Classify interval range of crime forecasting for crime prevention decision making. Knowledge, Information and Creativity Support Systems (KICSS),11th International Conference on, Yogyakarta, Indonesia. 2016. [26] Witte Robert S, Witte J S. Statistics (8th ed.). John Wiley & Sons, Inc. 2007. [27] Tong L-I, Chang C-W, Jin S-E, Saminathan R. Quantifying uncertainty of emission estimates in National Greenhouse Gas Inventories using bootstrap confidence intervals. Atmospheric Environment. 2012; 56: 80–87.