SlideShare a Scribd company logo
International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013
DOI: 10.5121/ijci.2013.2401 1
Financial Time Series Analysis Based On
Normalized Mutual Information Functions.
David Blokh
C. D. Technologies Ltd., Israel
david_blokh@012.net.il
ABSTRACT
A method of predictability analysis of future values of financial time series is described. The method is
based on normalized mutual information functions. In the analysis, the use of these functions allowed to
refuse any restrictions on the distributions of the parameters and on the correlations between parameters.
A comparative analysis of the predictability of financial time series of Tel Aviv 25 stock exchange has been
carried out.
KEYWORDS
Nonlinear financial time series, Normalized mutual information function
1. INTRODUCTION
Estimation of the possibility of predicting future values of a time series is an important problem
in the analysis of financial time series [1]. This problem is applicable in econophysics [2] and, in
particular, in the forecast of financial markets [2,3]. In the present paper, the following version of
the problem is examined:
A set of time series is specified. Find time series with the "highest" possibility of predicting future
values in this set.
The principal method of the study of the time series prediction problem is the use of correlation
functions that allow evaluating the possibility of linear prediction of future values of a time series.
Thereby, the gaussianity of distribution and the linearity of correlations are assumed. Rather
often, these conditions do not hold for financial time series [1,4]. In the present paper, functions
of normalized mutual information are used for the analysis of the predictability of financial time
series. It allowed to refuse any restrictions on distributions and correlations.
The possibility of using information functions for the analysis of time series was first noted in [5].
Functions of mutual information were used for the analysis in [1,6,7]. However, mutual
information is an unnormalized value, and it is impossible to compare mutual information
functions of different time series. Therefore, in the present paper, normalized mutual information
is a measure of coupling between two time series. On the one hand, this measure allows the
estimation of nonlinear correlations between time series, and on the other, the comparison of
functions of normalized mutual information of different time series. Normalized mutual
information has been applied with significant results in various fields of medicine [8], in
International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013
2
particular in oncology [9,10,11]. The approach was proposed earlier [9], as described in the
monograph [12].
Using the method described in the present paper, a time series with the highest assessment of the
possibility of future values prediction has been chosen from time series with the parameter
"Change" of all companies of Tel Aviv 25 stock exchange. An approach to the clustering of
financial time series similar to that described in the present paper is presented in [13].
Predictability estimation of financial markets also was carried out using statistical methods
[14,15], wavelet analysis [16] and fractals [17].
2. ANALYSIS ALGORITHM
The analysis algorithm consists of four procedures:
- discretization of time series values;
- construction of a normalized mutual information function matrix;
- ranking of columns of the normalized mutual information function matrix;
- application of multiple comparisons method.
2.1 Discretization of Time Series Values
We transform a time series
1( ) ( ( ) ( ) ),...,...,i i i nt t tx x x having continuous values into a
time series
1( ) ( ( ) ( ) ),...,...,i i i nt t ty y y having discrete values. Discretization can be
performed taking into account the properties of the time series [18]. If the time series does not
possess respective properties or we are unaware of them, then we can use formal rules of
discretization [19].
2.2. Construction of normalized mutual information function
Let 1( ) ( ( ) ( ) ),...,...,i i i nt t ty y y represent time series having discrete values and
1( ) ( ( ) ( ) ),...,..., n jji ii t j t ty y y  be a time series ( )i ty with a lag j.
The normalized mutual information of the two time series ( )i ty and ( )i t jy  equals [8,20]
( ) ( ) ( ) ( ) ( ) ( )
( )
( ) ( )
( ; ) ( ) ( ) ( , )
( ) ( )i
i i i i i i
i
i
t t j t t j t t j
j
t j t j
I H H H
H Hy
y y y y y y
c
y
  
 
 
 
where ( )( )i tH y , ( )( )i t jH y  , ( ) ( )( , )i it t jH y y  are entropies of random values ( )i ty ,
( )i t jy  , ( ) ( )i it t jy y  , respectively.
International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013
3
The normalized mutual information ( )i jc is then calculated as a function of the lag j.
Properties of the Normalized Mutual Information Function ( )i jc [8,21]:
1) 0  ( )i jc  1;
2) ( )i jc = 0 if and only if ( )tiy and ( )t jiy  are mutually independent;
3) ( )i jc = 1 if and only if there exists a functional relationship between ( )tiy
and ( )t jiy  .
Consider a set of time series
1 1 1 1 1( ) ( ( ) ( ) ), , ( ) ( ( ) ( ) ),... ,...{ ,..., ... ,..., }m m m nnX t t t t t tx x x x x x 
having continuous values. Performing discretization, we obtain a set of time series
1 1 1 1 1( ) ( ( ) ( ) ) ( ) ( ( ) ( ) ), , ,...{ ,..., ,... ,..., }... m m m nnY t t t t t ty y y y y y 
We compute the normalized mutual information functions ( )i jc for the time series
iy
1 i m  ,1 j k  and obtain m×k matrix of function values ( )[ ]i jc .
2.3. Ranking of Columns of Normalized Mutual Information Function Matrix
Each row of ( )[ ]i jc matrix is a normalized mutual information function of time series, and each
column contains the values of normalized mutual information functions corresponding to the
same lag. For each column of ( )[ ]i jc matrix, we rank its entries and assign the rank 1 to the
smallest entry of the column. We obtain m×k matrix of ranks ( )[ ]i jr , with each column of the
matrix containing ranks from 1 to m.
We estimate the predictability of the i-th time series as compared to other time series by
the sum of all the entries of i-th row of the matrix ( )[ ]i jr .
Such estimation allows us to use multiple comparisons of rank statistics for the comparison of
time series predictability.
International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013
4
2.3. Application of Multiple Comparisons Method
We compare rank sums using the Newman-Keuls test [22]. The application of this test allows us
to obtain an estimation adequate to the problem content. This test is successfully used for the
analysis of biomedical data [22,23,24] and clustering financial time series [13].
3. PREDICTABILITY ESTIMATION OF TEL AVIV 25 STOCK
EXCHANGE COMPANIES
We perform the predictability estimation of Tel-Aviv 25 stock exchange companies. Obviously,
similar estimates can be also performed for other markets. The importance of such estimation is
noted in [2]. The complexity of this problem consists in the fact that the financial time series
contain nonlinearities [4].
Let ( )ji tp constitute the Adjusted Closing Price of i-th company on the
jt day, and ( )ji tx –
the parameter “Change” equal to
1
1
( ) ( )
( )
( )
j j
j
j
i i
i
i
t t
t
t
p p
x p




.
We consider the changes in the parameter “Change” for all Tel-Aviv 25 stock exchange
companies [25] in the period from June 8, 2010 till December 14, 2011 (374 days). Thus, we are
dealing with a set of time series 251( ),..., ( ){ }t tx x , and the value of each series is the
parameter “Change”. Thus, we have 25 time series, and each series contains 374 elements.
We carry out, step by step, the four procedures mentioned in the previous section.
1. The discretization rule is as follows [19]:
Having performed the discretization, we obtain a set of time series 251( ),..., ( ){ }t ty y , whose
elements assume the values of –2, –1, 0, 1, 2. The discretization is such that the quantities of
elements in each category are “approximately equal”.
2. Then we compute the matrix of normalized mutual information functions – Table 1.
International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013
5
3. We rank entries of each column of the normalized mutual information function matrix –
Table 2.
Let us consider Table 2 as the Friedman statistical model [26], and examine the row effect of this
table.
Hypotheses
H0: There is no row effect (“null hypothesis”).
H1: The null hypothesis is invalid. Critical range. The sample is “large”, therefore, the critical
range is the upper 1%-range of 2
24 distribution.
Let us calculate the 2
 -criterion [26]. This gives us 2
 = 56.05.
The critical range is 2
24 > 42.98. Since 56.05 > 42.98, the null hypothesis with respect to Table
2 is rejected. Thus, according to the Friedman test, the row effect has been found. Hence, there is
a difference between the rows under consideration.
4. For multiple comparisons, we use the Newman-Keuls test [22].
1
12.08j j
R R 
 
, where Rj and Rj+1 are elements of the column “Sum of ranks” in the j-th
and (j+1)-th rows of Table 3, respectively. By multiple comparisons, we construct the
predictability estimation shown in Table 3.
The obtained predictability estimation possesses the following properties: for two neighboring
sets of Table 3, the smallest element of one set and the greatest element of another located nearby
are significantly different ( T =0.01); elements belonging to the same set do not differ from
each other ( T =0.01). The differences between Set 1 (STRAUSS GROUP) and all the elements
(companies of the stock exchange) are statistically significant ( T =0.01). Thus, the time series
of the parameter “Change” of STRAUSS GROUP company is the most predictable in comparison
with time series of other Tel Aviv 25 Stock Exchange companies.
International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013
6
Table 1. Matrix of normalized mutual information functions.
Table 2. Ranked entries of each column of the normalized mutual information function matrix.
International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013
7
Table 3. Predictability estimation of Tel Aviv 25 Stock Exchange Companies
4. CONCLUSION
The analysis of financial time series based on the functions of normalized mutual information
allows predictability assessment without the assumptions of gaussianity and linearity. The use of
these functions also allows us to compare the predictabilities of different time series. At present,
there are no such advanced prediction methods for the functions of normalized mutual
information as for the analysis based on correlation functions. The development of prediction
methods for time series analysis based on the functions of normalized mutual information should
become an object of further research.
REFERENCES
[1] M. Small (2005) “Applied Nonlinear Time Series Analysis. Applications in Physics, Physiology and
Finance,” World Scientific Publishing Co. Pte. Ltd., Singapore.
[2] R.N. Mantegna, H.E. Stanley, (2000) “An Introduction to Econophysics. Correlations and Complexity
in Finance,” Cambridge University Press, Cambridge.
[3] R. N. Mantegna, Z. Palágyi, H.E. Stanley, (1999) “Applications of statistical mechanics to finance,”
Physica A: Statistical Mechanics and its Applications, Vol. 274, No. 1-2, pp. 216-221.
[4] Cont, (2002) “Empirical properties of asset returns: stylized facts and statistical issues,” Quantitative
Finance, Vol. 1, No. 2, pp. 223-236.
International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013
8
[5] W. Li,(1999) “Mutual Information Functions Versus Correlation Functions,” Journal of Statistical
Physics, Vol. 60, No. 5-6, pp. 823-837.
[6] G. A. Darbellay, D.Wuertz, (2000) “The entropy as a tool for analysing statistical dependences in
financial time series,” Physica A: Statistical Mechanics and its Applications, Vol. 287, No. 3-4, pp.
429-439.
[7] A. Dionisio, R. Menezes, D. Mendes, (2004) “Mutual information: a measure of dependency for
nonlinear time series,” Physica A: Statistical Mechanics and its Applications, Vol. 344, No. 1-2, pp.
326-329.
[8] J. Zvarova, M. Studeny, (1997) “Information theoretical approach to constitution and reduction of
medical data,” International Journal of Medical Informatics, Vol. 45, No. 1-2, pp. 65-74.
[9] D. Blokh, I.Stambler, E. Afrimzon, et al. (2007) “The information-theory analysis of Michaelis-
Menten constants for detection of breast cancer,” Cancer Detection and Prevention, Vol. 31, No. 6,
pp. 489-498.
[10] D. Blokh, N. Zurgil, I. Stambler, et al. (2008) “An information-theoretical model for breast cancer
detection,” Methods of Information in Medicine, Vol. 47, no. 4, pp. 322-327.
[11] D. Blokh, I. Stambler, E. Afrimzon, et al. (2009) “Comparative analysis of cell parameter groups for
breast cancer detection,” Computer Methods and Programs in Biomedicine, Vol. 94, No. 3, pp. 239-
249.
[12] P. J. Gutierrez Diez, I.H. Russo, J. Russo, (2012) “The Evolution of the Use of Mathematics in
Cancer Research,” Springer, New York.
[13] D. Blokh, (2012) “Clustering financial time series via information-theory analysis and rank
statistics,” Journal of Pattern Recognition Research, Vol. 7, No. 1, pp. 106-115.
http://www.jprr.org/index.php/jprr/article/viewFile/396/166
[14] D. K. Patro, Y. Wu, (2004) “Predictability of short-horizon returns in international equity markets,”
Journal of Empirical Finance, Vol. 11, No. 4, pp. 553-584.
[15] M. Cooper, R. C. Gutierrez, Jr., B. Marcum, (2005) “On the Predictability of Stock Returns in Real
Time,” Journal of Business, Vol. 78, No. 2, pp. 469-500.
http://www.jstor.org/stable/full/10.1086/427635
[16] N.A. Kyaw, C.A. Los, and S. Zong, (2006) “Persistence characteristics of Latin American financial
markets”, Journal of Multinational Financial Management, Vol. 16, No. 3, pp. 269–290.
[17] G. R. Richards, (2004) “A Fractal Forecasting Model for Financial Time Series”, Journal of
Forecasting, Vol. 23, No. 8, pp. 587-602.
[18] G. Nicolis, I. Prigogine, (1990) “Exploring Complexity,” W.H. Freeman, New York.
[19] G.V. Glass, J.C. Stanley, (1970) “Statistical Methods in Education and Psychology,” Prentice- Hall,
New Jersey.
[20] A. Renyi, (1959) “On measures of dependence,” Acta Mathematica Academiae Scientiarum
Hungaricae, Vol. 10, No. 3-4, pp. 441-451.
[21] T.M. Cover, J.A. Thomas, (2006) “Elements of Information Theory,” 2nd ed., Wiley-Interscience,
New York.
[22] S.A. Glantz, (1994) “Primer of Biostatistics,” 4th ed., McGraw-Hill, New York.
[23] S.M. Pupa, S. Giuffre, F.Castiglioni,“Regulation of Breast Cancer Response to Chemotherapy by
Fibulin-1,” (2007) Cancer Research, Vol. 67, No.9, pp. 4271-4277.
[24] S.S. Wu, W. Wang, D.H. Annis, “On Identification of the Number of Best Treatments Using the
Newman-Keuls Test,” (2008) Biometrical Journal, Vol. 50, No. 5, pp. 861-869.
[25] Tel-Aviv,25stockexchange
http://www.tase.co.il/TASEEng/MarketData/Indices/MarketCap/IndexMainDataMarket.htm?Action=
1&IndexID=142) [26] W.J. Conover, (1999) “Practical Nonparametric Statistics,” Wiley-Interscience,
New York.

More Related Content

PDF
recko_paper
PDF
PREDICTIVE EVALUATION OF THE STOCK PORTFOLIO PERFORMANCE USING FUZZY CMEANS A...
PDF
A hybrid bacterial foraging and modified particle swarm optimization for mode...
PDF
An Exponential Observer Design for a Class of Chaotic Systems with Exponentia...
PDF
Model reduction of unstable systems based on balanced truncation algorithm
PDF
LOGNORMAL ORDINARY KRIGING METAMODEL IN SIMULATION OPTIMIZATION
PDF
PDF
F5233444
recko_paper
PREDICTIVE EVALUATION OF THE STOCK PORTFOLIO PERFORMANCE USING FUZZY CMEANS A...
A hybrid bacterial foraging and modified particle swarm optimization for mode...
An Exponential Observer Design for a Class of Chaotic Systems with Exponentia...
Model reduction of unstable systems based on balanced truncation algorithm
LOGNORMAL ORDINARY KRIGING METAMODEL IN SIMULATION OPTIMIZATION
F5233444

What's hot (18)

PDF
Ds33717725
PPTX
Image recogonization
PDF
Multi objective predictive control a solution using metaheuristics
PDF
A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...
PDF
soft computing stock market price prediction for the nigerian stock exchange
PDF
A new survey on biclustering of
PDF
Effect Estimation Method of Parallel Computing Based on Dynamic Generation of...
PDF
Estimating Population Mean in Sample Surveys
PDF
VARIATIONAL MONTE-CARLO APPROACH FOR ARTICULATED OBJECT TRACKING
PDF
Sequential estimation of_discrete_choice_models__copy_-4
PDF
A derivative free high ordered hybrid equation solver
PDF
Fuzzy c-means
PDF
Anirban part1
PDF
A delay decomposition approach to robust stability analysis of uncertain syst...
PDF
EDGE DETECTION IN SEGMENTED IMAGES THROUGH MEAN SHIFT ITERATIVE GRADIENT USIN...
PDF
Lecture 8 (Stereo imaging) (Digital Image Processing)
PDF
Master_Thesis_Harihara_Subramanyam_Sreenivasan
Ds33717725
Image recogonization
Multi objective predictive control a solution using metaheuristics
A Counterexample to the Forward Recursion in Fuzzy Critical Path Analysis Und...
soft computing stock market price prediction for the nigerian stock exchange
A new survey on biclustering of
Effect Estimation Method of Parallel Computing Based on Dynamic Generation of...
Estimating Population Mean in Sample Surveys
VARIATIONAL MONTE-CARLO APPROACH FOR ARTICULATED OBJECT TRACKING
Sequential estimation of_discrete_choice_models__copy_-4
A derivative free high ordered hybrid equation solver
Fuzzy c-means
Anirban part1
A delay decomposition approach to robust stability analysis of uncertain syst...
EDGE DETECTION IN SEGMENTED IMAGES THROUGH MEAN SHIFT ITERATIVE GRADIENT USIN...
Lecture 8 (Stereo imaging) (Digital Image Processing)
Master_Thesis_Harihara_Subramanyam_Sreenivasan
Ad

Viewers also liked (8)

PPTX
A method to improve survival prediction using mutual information based network
PDF
Measuring instruments directive 2010
 
PPT
Открытый доступ: ремиксы как «кислород» для науки и образования
PPTX
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
PPTX
A brief introduction to mutual information and its application
PPTX
Data Mining: an Introduction
PPT
Social Network Analysis
PPT
Ratio Analysis
A method to improve survival prediction using mutual information based network
Measuring instruments directive 2010
 
Открытый доступ: ремиксы как «кислород» для науки и образования
Data Analysis: Evaluation Metrics for Supervised Learning Models of Machine L...
A brief introduction to mutual information and its application
Data Mining: an Introduction
Social Network Analysis
Ratio Analysis
Ad

Similar to Financial Time Series Analysis Based On Normalized Mutual Information Functions (20)

PDF
Forecasting Gasonline Price in Vietnam Based on Fuzzy Time Series and Automat...
PDF
Genetic Algorithm for solving Dynamic Supply Chain Problem
PDF
IRJET- Two-Class Priority Queueing System with Restricted Number of Priority ...
PDF
Time series analysis, modeling and applications
PDF
COMPARISON OF WAVELET NETWORK AND LOGISTIC REGRESSION IN PREDICTING ENTERPRIS...
PDF
D143136
PDF
Simultaneous State and Actuator Fault Estimation With Fuzzy Descriptor PMID a...
PDF
Stability and stabilization of discrete-time systems with time-delay via Lyap...
PDF
Exponential State Observer Design for a Class of Uncertain Chaotic and Non-Ch...
PDF
Ijciet 10 02_085
PDF
Regeneration of simple and complicated curves using Fourier series
PDF
Controller Design Based On Fuzzy Observers For T-S Fuzzy Bilinear Models
PDF
A Study on Youth Violence and Aggression using DEMATEL with FCM Methods
PDF
1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...
PDF
Stochastic Vol Forecasting
PDF
Application of panel data to the effect of five (5) world development indicat...
PDF
Application of panel data to the effect of five (5) world development indicat...
PDF
Design of State Estimator for a Class of Generalized Chaotic Systems
PDF
Time alignment techniques for experimental sensor data
PDF
04_AJMS_371_22.pdf
Forecasting Gasonline Price in Vietnam Based on Fuzzy Time Series and Automat...
Genetic Algorithm for solving Dynamic Supply Chain Problem
IRJET- Two-Class Priority Queueing System with Restricted Number of Priority ...
Time series analysis, modeling and applications
COMPARISON OF WAVELET NETWORK AND LOGISTIC REGRESSION IN PREDICTING ENTERPRIS...
D143136
Simultaneous State and Actuator Fault Estimation With Fuzzy Descriptor PMID a...
Stability and stabilization of discrete-time systems with time-delay via Lyap...
Exponential State Observer Design for a Class of Uncertain Chaotic and Non-Ch...
Ijciet 10 02_085
Regeneration of simple and complicated curves using Fourier series
Controller Design Based On Fuzzy Observers For T-S Fuzzy Bilinear Models
A Study on Youth Violence and Aggression using DEMATEL with FCM Methods
1 Aminullah Assagaf_Estimation-of-domain-of-attraction-for-the-fract_2021_Non...
Stochastic Vol Forecasting
Application of panel data to the effect of five (5) world development indicat...
Application of panel data to the effect of five (5) world development indicat...
Design of State Estimator for a Class of Generalized Chaotic Systems
Time alignment techniques for experimental sensor data
04_AJMS_371_22.pdf

Recently uploaded (20)

PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
MIND Revenue Release Quarter 2 2025 Press Release
PPTX
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
PDF
Approach and Philosophy of On baking technology
PDF
Spectral efficient network and resource selection model in 5G networks
PDF
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
PPTX
20250228 LYD VKU AI Blended-Learning.pptx
PPTX
Spectroscopy.pptx food analysis technology
PPTX
Big Data Technologies - Introduction.pptx
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Building Integrated photovoltaic BIPV_UPV.pdf
PDF
Review of recent advances in non-invasive hemoglobin estimation
PPTX
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
PDF
Machine learning based COVID-19 study performance prediction
PDF
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
DOCX
The AUB Centre for AI in Media Proposal.docx
PPTX
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
PDF
Encapsulation_ Review paper, used for researhc scholars
PDF
Empathic Computing: Creating Shared Understanding
PDF
Encapsulation theory and applications.pdf
Chapter 3 Spatial Domain Image Processing.pdf
MIND Revenue Release Quarter 2 2025 Press Release
ACSFv1EN-58255 AWS Academy Cloud Security Foundations.pptx
Approach and Philosophy of On baking technology
Spectral efficient network and resource selection model in 5G networks
How UI/UX Design Impacts User Retention in Mobile Apps.pdf
20250228 LYD VKU AI Blended-Learning.pptx
Spectroscopy.pptx food analysis technology
Big Data Technologies - Introduction.pptx
The Rise and Fall of 3GPP – Time for a Sabbatical?
Building Integrated photovoltaic BIPV_UPV.pdf
Review of recent advances in non-invasive hemoglobin estimation
VMware vSphere Foundation How to Sell Presentation-Ver1.4-2-14-2024.pptx
Machine learning based COVID-19 study performance prediction
Blue Purple Modern Animated Computer Science Presentation.pdf.pdf
The AUB Centre for AI in Media Proposal.docx
KOM of Painting work and Equipment Insulation REV00 update 25-dec.pptx
Encapsulation_ Review paper, used for researhc scholars
Empathic Computing: Creating Shared Understanding
Encapsulation theory and applications.pdf

Financial Time Series Analysis Based On Normalized Mutual Information Functions

  • 1. International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013 DOI: 10.5121/ijci.2013.2401 1 Financial Time Series Analysis Based On Normalized Mutual Information Functions. David Blokh C. D. Technologies Ltd., Israel david_blokh@012.net.il ABSTRACT A method of predictability analysis of future values of financial time series is described. The method is based on normalized mutual information functions. In the analysis, the use of these functions allowed to refuse any restrictions on the distributions of the parameters and on the correlations between parameters. A comparative analysis of the predictability of financial time series of Tel Aviv 25 stock exchange has been carried out. KEYWORDS Nonlinear financial time series, Normalized mutual information function 1. INTRODUCTION Estimation of the possibility of predicting future values of a time series is an important problem in the analysis of financial time series [1]. This problem is applicable in econophysics [2] and, in particular, in the forecast of financial markets [2,3]. In the present paper, the following version of the problem is examined: A set of time series is specified. Find time series with the "highest" possibility of predicting future values in this set. The principal method of the study of the time series prediction problem is the use of correlation functions that allow evaluating the possibility of linear prediction of future values of a time series. Thereby, the gaussianity of distribution and the linearity of correlations are assumed. Rather often, these conditions do not hold for financial time series [1,4]. In the present paper, functions of normalized mutual information are used for the analysis of the predictability of financial time series. It allowed to refuse any restrictions on distributions and correlations. The possibility of using information functions for the analysis of time series was first noted in [5]. Functions of mutual information were used for the analysis in [1,6,7]. However, mutual information is an unnormalized value, and it is impossible to compare mutual information functions of different time series. Therefore, in the present paper, normalized mutual information is a measure of coupling between two time series. On the one hand, this measure allows the estimation of nonlinear correlations between time series, and on the other, the comparison of functions of normalized mutual information of different time series. Normalized mutual information has been applied with significant results in various fields of medicine [8], in
  • 2. International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013 2 particular in oncology [9,10,11]. The approach was proposed earlier [9], as described in the monograph [12]. Using the method described in the present paper, a time series with the highest assessment of the possibility of future values prediction has been chosen from time series with the parameter "Change" of all companies of Tel Aviv 25 stock exchange. An approach to the clustering of financial time series similar to that described in the present paper is presented in [13]. Predictability estimation of financial markets also was carried out using statistical methods [14,15], wavelet analysis [16] and fractals [17]. 2. ANALYSIS ALGORITHM The analysis algorithm consists of four procedures: - discretization of time series values; - construction of a normalized mutual information function matrix; - ranking of columns of the normalized mutual information function matrix; - application of multiple comparisons method. 2.1 Discretization of Time Series Values We transform a time series 1( ) ( ( ) ( ) ),...,...,i i i nt t tx x x having continuous values into a time series 1( ) ( ( ) ( ) ),...,...,i i i nt t ty y y having discrete values. Discretization can be performed taking into account the properties of the time series [18]. If the time series does not possess respective properties or we are unaware of them, then we can use formal rules of discretization [19]. 2.2. Construction of normalized mutual information function Let 1( ) ( ( ) ( ) ),...,...,i i i nt t ty y y represent time series having discrete values and 1( ) ( ( ) ( ) ),...,..., n jji ii t j t ty y y  be a time series ( )i ty with a lag j. The normalized mutual information of the two time series ( )i ty and ( )i t jy  equals [8,20] ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ) ( ; ) ( ) ( ) ( , ) ( ) ( )i i i i i i i i i t t j t t j t t j j t j t j I H H H H Hy y y y y y y c y          where ( )( )i tH y , ( )( )i t jH y  , ( ) ( )( , )i it t jH y y  are entropies of random values ( )i ty , ( )i t jy  , ( ) ( )i it t jy y  , respectively.
  • 3. International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013 3 The normalized mutual information ( )i jc is then calculated as a function of the lag j. Properties of the Normalized Mutual Information Function ( )i jc [8,21]: 1) 0  ( )i jc  1; 2) ( )i jc = 0 if and only if ( )tiy and ( )t jiy  are mutually independent; 3) ( )i jc = 1 if and only if there exists a functional relationship between ( )tiy and ( )t jiy  . Consider a set of time series 1 1 1 1 1( ) ( ( ) ( ) ), , ( ) ( ( ) ( ) ),... ,...{ ,..., ... ,..., }m m m nnX t t t t t tx x x x x x  having continuous values. Performing discretization, we obtain a set of time series 1 1 1 1 1( ) ( ( ) ( ) ) ( ) ( ( ) ( ) ), , ,...{ ,..., ,... ,..., }... m m m nnY t t t t t ty y y y y y  We compute the normalized mutual information functions ( )i jc for the time series iy 1 i m  ,1 j k  and obtain m×k matrix of function values ( )[ ]i jc . 2.3. Ranking of Columns of Normalized Mutual Information Function Matrix Each row of ( )[ ]i jc matrix is a normalized mutual information function of time series, and each column contains the values of normalized mutual information functions corresponding to the same lag. For each column of ( )[ ]i jc matrix, we rank its entries and assign the rank 1 to the smallest entry of the column. We obtain m×k matrix of ranks ( )[ ]i jr , with each column of the matrix containing ranks from 1 to m. We estimate the predictability of the i-th time series as compared to other time series by the sum of all the entries of i-th row of the matrix ( )[ ]i jr . Such estimation allows us to use multiple comparisons of rank statistics for the comparison of time series predictability.
  • 4. International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013 4 2.3. Application of Multiple Comparisons Method We compare rank sums using the Newman-Keuls test [22]. The application of this test allows us to obtain an estimation adequate to the problem content. This test is successfully used for the analysis of biomedical data [22,23,24] and clustering financial time series [13]. 3. PREDICTABILITY ESTIMATION OF TEL AVIV 25 STOCK EXCHANGE COMPANIES We perform the predictability estimation of Tel-Aviv 25 stock exchange companies. Obviously, similar estimates can be also performed for other markets. The importance of such estimation is noted in [2]. The complexity of this problem consists in the fact that the financial time series contain nonlinearities [4]. Let ( )ji tp constitute the Adjusted Closing Price of i-th company on the jt day, and ( )ji tx – the parameter “Change” equal to 1 1 ( ) ( ) ( ) ( ) j j j j i i i i t t t t p p x p     . We consider the changes in the parameter “Change” for all Tel-Aviv 25 stock exchange companies [25] in the period from June 8, 2010 till December 14, 2011 (374 days). Thus, we are dealing with a set of time series 251( ),..., ( ){ }t tx x , and the value of each series is the parameter “Change”. Thus, we have 25 time series, and each series contains 374 elements. We carry out, step by step, the four procedures mentioned in the previous section. 1. The discretization rule is as follows [19]: Having performed the discretization, we obtain a set of time series 251( ),..., ( ){ }t ty y , whose elements assume the values of –2, –1, 0, 1, 2. The discretization is such that the quantities of elements in each category are “approximately equal”. 2. Then we compute the matrix of normalized mutual information functions – Table 1.
  • 5. International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013 5 3. We rank entries of each column of the normalized mutual information function matrix – Table 2. Let us consider Table 2 as the Friedman statistical model [26], and examine the row effect of this table. Hypotheses H0: There is no row effect (“null hypothesis”). H1: The null hypothesis is invalid. Critical range. The sample is “large”, therefore, the critical range is the upper 1%-range of 2 24 distribution. Let us calculate the 2  -criterion [26]. This gives us 2  = 56.05. The critical range is 2 24 > 42.98. Since 56.05 > 42.98, the null hypothesis with respect to Table 2 is rejected. Thus, according to the Friedman test, the row effect has been found. Hence, there is a difference between the rows under consideration. 4. For multiple comparisons, we use the Newman-Keuls test [22]. 1 12.08j j R R    , where Rj and Rj+1 are elements of the column “Sum of ranks” in the j-th and (j+1)-th rows of Table 3, respectively. By multiple comparisons, we construct the predictability estimation shown in Table 3. The obtained predictability estimation possesses the following properties: for two neighboring sets of Table 3, the smallest element of one set and the greatest element of another located nearby are significantly different ( T =0.01); elements belonging to the same set do not differ from each other ( T =0.01). The differences between Set 1 (STRAUSS GROUP) and all the elements (companies of the stock exchange) are statistically significant ( T =0.01). Thus, the time series of the parameter “Change” of STRAUSS GROUP company is the most predictable in comparison with time series of other Tel Aviv 25 Stock Exchange companies.
  • 6. International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013 6 Table 1. Matrix of normalized mutual information functions. Table 2. Ranked entries of each column of the normalized mutual information function matrix.
  • 7. International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013 7 Table 3. Predictability estimation of Tel Aviv 25 Stock Exchange Companies 4. CONCLUSION The analysis of financial time series based on the functions of normalized mutual information allows predictability assessment without the assumptions of gaussianity and linearity. The use of these functions also allows us to compare the predictabilities of different time series. At present, there are no such advanced prediction methods for the functions of normalized mutual information as for the analysis based on correlation functions. The development of prediction methods for time series analysis based on the functions of normalized mutual information should become an object of further research. REFERENCES [1] M. Small (2005) “Applied Nonlinear Time Series Analysis. Applications in Physics, Physiology and Finance,” World Scientific Publishing Co. Pte. Ltd., Singapore. [2] R.N. Mantegna, H.E. Stanley, (2000) “An Introduction to Econophysics. Correlations and Complexity in Finance,” Cambridge University Press, Cambridge. [3] R. N. Mantegna, Z. Palágyi, H.E. Stanley, (1999) “Applications of statistical mechanics to finance,” Physica A: Statistical Mechanics and its Applications, Vol. 274, No. 1-2, pp. 216-221. [4] Cont, (2002) “Empirical properties of asset returns: stylized facts and statistical issues,” Quantitative Finance, Vol. 1, No. 2, pp. 223-236.
  • 8. International Journal on Cybernetics & Informatics ( IJCI) Vol.2, No.4, August 2013 8 [5] W. Li,(1999) “Mutual Information Functions Versus Correlation Functions,” Journal of Statistical Physics, Vol. 60, No. 5-6, pp. 823-837. [6] G. A. Darbellay, D.Wuertz, (2000) “The entropy as a tool for analysing statistical dependences in financial time series,” Physica A: Statistical Mechanics and its Applications, Vol. 287, No. 3-4, pp. 429-439. [7] A. Dionisio, R. Menezes, D. Mendes, (2004) “Mutual information: a measure of dependency for nonlinear time series,” Physica A: Statistical Mechanics and its Applications, Vol. 344, No. 1-2, pp. 326-329. [8] J. Zvarova, M. Studeny, (1997) “Information theoretical approach to constitution and reduction of medical data,” International Journal of Medical Informatics, Vol. 45, No. 1-2, pp. 65-74. [9] D. Blokh, I.Stambler, E. Afrimzon, et al. (2007) “The information-theory analysis of Michaelis- Menten constants for detection of breast cancer,” Cancer Detection and Prevention, Vol. 31, No. 6, pp. 489-498. [10] D. Blokh, N. Zurgil, I. Stambler, et al. (2008) “An information-theoretical model for breast cancer detection,” Methods of Information in Medicine, Vol. 47, no. 4, pp. 322-327. [11] D. Blokh, I. Stambler, E. Afrimzon, et al. (2009) “Comparative analysis of cell parameter groups for breast cancer detection,” Computer Methods and Programs in Biomedicine, Vol. 94, No. 3, pp. 239- 249. [12] P. J. Gutierrez Diez, I.H. Russo, J. Russo, (2012) “The Evolution of the Use of Mathematics in Cancer Research,” Springer, New York. [13] D. Blokh, (2012) “Clustering financial time series via information-theory analysis and rank statistics,” Journal of Pattern Recognition Research, Vol. 7, No. 1, pp. 106-115. http://www.jprr.org/index.php/jprr/article/viewFile/396/166 [14] D. K. Patro, Y. Wu, (2004) “Predictability of short-horizon returns in international equity markets,” Journal of Empirical Finance, Vol. 11, No. 4, pp. 553-584. [15] M. Cooper, R. C. Gutierrez, Jr., B. Marcum, (2005) “On the Predictability of Stock Returns in Real Time,” Journal of Business, Vol. 78, No. 2, pp. 469-500. http://www.jstor.org/stable/full/10.1086/427635 [16] N.A. Kyaw, C.A. Los, and S. Zong, (2006) “Persistence characteristics of Latin American financial markets”, Journal of Multinational Financial Management, Vol. 16, No. 3, pp. 269–290. [17] G. R. Richards, (2004) “A Fractal Forecasting Model for Financial Time Series”, Journal of Forecasting, Vol. 23, No. 8, pp. 587-602. [18] G. Nicolis, I. Prigogine, (1990) “Exploring Complexity,” W.H. Freeman, New York. [19] G.V. Glass, J.C. Stanley, (1970) “Statistical Methods in Education and Psychology,” Prentice- Hall, New Jersey. [20] A. Renyi, (1959) “On measures of dependence,” Acta Mathematica Academiae Scientiarum Hungaricae, Vol. 10, No. 3-4, pp. 441-451. [21] T.M. Cover, J.A. Thomas, (2006) “Elements of Information Theory,” 2nd ed., Wiley-Interscience, New York. [22] S.A. Glantz, (1994) “Primer of Biostatistics,” 4th ed., McGraw-Hill, New York. [23] S.M. Pupa, S. Giuffre, F.Castiglioni,“Regulation of Breast Cancer Response to Chemotherapy by Fibulin-1,” (2007) Cancer Research, Vol. 67, No.9, pp. 4271-4277. [24] S.S. Wu, W. Wang, D.H. Annis, “On Identification of the Number of Best Treatments Using the Newman-Keuls Test,” (2008) Biometrical Journal, Vol. 50, No. 5, pp. 861-869. [25] Tel-Aviv,25stockexchange http://www.tase.co.il/TASEEng/MarketData/Indices/MarketCap/IndexMainDataMarket.htm?Action= 1&IndexID=142) [26] W.J. Conover, (1999) “Practical Nonparametric Statistics,” Wiley-Interscience, New York.