SlideShare a Scribd company logo
Center for Financial Studies at the Goethe University
PhD Mini-course
Frankfurt, 25 January 2013




Financial Networks

VI. Correlation Networks




                                                        Dr. Kimmo Soramäki
                                                        Founder and CEO
                                                        FNA, www.fna.fi
Agenda

V. Inferring Links
•   Prices and returns
•   Controlling for common factors
•   Correlation and dependence
•   Significant correlations
•   Multiple Comparisons


VI. Correlation Networks
•   Distance and Hierarchical Clustering
•   Minimum Spanning Tree & PMFG
•   Other filtering
•   Layout algorithms
                                           2
Hierarchical structure in financial markets




                                              3
Minimum Spanning Tree
A spanning tree of a graph is a subgraph that:
1. is a tree and
2. connects all the nodes together

Length of a tree is the sum of its links. Minimum spanning tree (MST) is a spanning
tree with shortest length.

MST reflects the hierarchical structure of the correlation matrix
MST and Hierarchical Structure




Source: R.N. Mantegna (1999). Hierarchical structure in nancial markets,
Eur. Phys. J. B 11, 193-197                                                5
36


Single Linkage Clustering

• A method for hierarchical clustering
• Clusters based on similarity or distance
• SLINK algorithm




R. Sibson (1973). SLINK: an optimally efficient algorithm for the single-link cluster
method. The Computer Journal (British Computer Society) 16 (1): 30–34.                  6
Example
# build network from correlations
buildbycorrelationd -file daxreturns-2011-recon.csv -missing Alert -preserve
false

# calculate distance
corrdistance -p correlation -method gower

# calculate single linkage clistering
slink -p corrdistance

# create heatmaps
heatmap -sortv vertex_id -p correlation -symmetric true -cellsizedefault 13 -
transition 0 -cellhover correlation -palette darkblue-lightgray-darkred -
colordomain (-1)-1 -saveas daxheat-slink-Y

                                                                                7
Unordered, Principal   Ordered by Cluster, Principal
Component Removed      Component Removed

                                                       8
Radial tree -layout
• Calculates coordinates for radial layout as presented in
  Bachmaier, Brandes and Schlieper (2005)

• The layout allows definition of each arc length

• Specific parameters of command radialtreeviz:
     –   Arc length property (-p) : Arc property defining arc length. Optional.
     –   Root vertex (-rootvertex) : Id of root vertex. The root vertex is placed in the middle of the
         screen. Due to the repositioning of the tree, nodes may be placed outside the canvas in other
         than the first network. Optional.
     –   Optimal rotation (-rotation) : Rotates layout to minimize sum of vertex distances between
         subsequent networks. Optional. By default 'false'.
     –   Scaling (-scale) : Scale of visualization: value/pixel.




Christian Bachmaier, Ulrik Brandes, and Barbara Schlieper (2005). Drawing Phylogenetic
Trees. Department of Computer & Information Science, University of
                                                                                                         9
Konstanz, Germany
Putting it all together
# build network from correlations
buildbycorrelationd -file daxreturns-2011.csv -missing Alert -savestdev -savereturns -preserve false

# calculate distance
corrdistance -p correlation -method gower

# calculate single linkage clistering
minst -p corrdistance

# drop arcs not in MST
dropa -e minst=false

# calculate absolute correlation
calcap -e 1-abs(correlation) -saveas vizdistance

# create heatmaps
radialtreeviz -p vizdistance -vlabel vertex_id -vsize stdev -transition 3000 -ahover correlation -saveas
daxviz-MST

                                                                                                       10
Asset Trees
              Size of node reflects volatility
              (variance) of returns

              Links between nodes reflect
              'backbone' correlations

              - short link = high correlation
              - long link = low correlation




                                                 11
Circle Tree -visualization
• Calculates coordinates for circle
  tree layout as presented in
  Bachmaier, Brandes and
  Schlieper (2005)



• As before but instead of
  radialtreeviz:

circletreeviz -vlabel vertex_id -vsize
stdev -transition 3000 -ahover
correlation -saveas daxviz-MST-circle




                                         12
Planar Maximally Filtered Graph
                                                                                                         Node size scales
                                                                                                         with degree

•   A complex graph with loops and
    cliques of up to 4 elements. It can be
    drawn on a planar surface without
    link crossings.

•   MST is contained in PMFG




    M. Tumminello, T. Ast, T. Di Matteo and R. N. Mantegna (2005). A Tool for Filtering Information in
    Complex Systems. PNAS vol. 102 no. 30 pp. 10421–10426
                                                                                                                   13
PMFG -command
# build network from correlations
buildbycorrelationd -file daxreturns-2011.csv -missing Alert -savestdev -savereturns -preserve false

# calculate distance
corrdistance -p correlation -method gower

# calculate single linkage clistering
pmfg -p corrdistance

# drop arcs not in MST
dropa -e pmfg=false

# calculate 1-absolute correlation
calcap -e abs(correlation) -saveas vizdistance

# calculate degree
degree

# create heatmaps
frviz -vlabel vertex_id -vsize stdev -atransparency vizdistance -ahover correlation -transition 3000 -ahover correlation -
arrows false -saveas daxviz-PMFG

                                                                                                                        14
Partial Correlation
• Measures the degree of association between two random variables

• What is the direct relationship between Adidas and
  Allianz, controlling for BASF, BAYER, ... ?

• We build regression models for Adidas and Allianz and look at the
  correlation of their model residuals (i.e. wgat left unexplained by the
  other factors) -> Partial correltation




                                                                        15
Example
# build network from correlations
buildbypartialcorrelationd -file daxreturns-2011.csv -missing Alert -
savestdev -preserve false

# show as heatmap
heatmap -sortv vertex_id -p partial_correlation -symmetric true -
cellsizedefault 13 -transition 0 -cellhover partial_correlation -palette
darkblue-lightgray-darkred -colordomain (-1)-1 -saveas daxheat-
partial-Y




                                                                           16
Correlations   Partial Correlations


                                      17
NETS
•   Network Estimation for Time-
    Series

•   Forthcoming paper by Barigozzi
    and Brownlees

•   Estimates an unknown network
    structure from multivariate data

•   Captures both comtemporenous
    and serial dependence (partial
    correlations and lead/lag effects)



                                         18
Correlation filtering                            PMFG



Balance between too much and too little
information

One of many methods to create networks
from correlation/distance matrices
   – PMFGs, Partial Correlation
     Networks, Influence Networks, Granger   Influence Network
     Causality, NETS, etc.


New graph, information-theory, economics
& statistics -based models are being
actively developed



                                                                 19
Sammon’s Projection
Proposed by John W. Sammon in IEEE Transactions on Computers 18: 401–409
(1969)

A nonlinear projection method to map a
high dimensional space onto a space of
lower dimensionality. Example:

                                                                  Iris Setosa




                                                                Iris Versicolor




                                                                 Iris Virginica
                                                                         20
Example
# build network from correlations
buildbycorrelationd -file daxreturns-2011.csv -missing Alert -savestdev -savereturns -
preserve false

# calculate distance
corrdistance -p correlation -method gower

# Calculate sammonlayout
sammonlayouta -p corrdistance -saveerror true

# Sum up error
sumaforv -p error -saveas error

# create heatmaps
sammonaviz -p corrdistance -vlabel vertex_id -vsize error -transition 3000 -ahover error
-saveas daxviz-Sammon-Y

                                                                                         21
Node size reflects
error in layout
Tutorials
•   Tutorial 1 – Loading Networks into FNA
•   Tutorial 2 – Managing Data in FNA
•   Tutorial 3 – Network Summary Measures
•   Tutorial 4 – Centrality Measures
•   Tutorial 5 – Connectedness and Components
•   Tutorial 6 – Network Visualization
•   Tutorial 7 – Correlation Networks
•   Tutorial 8 – Payment System Simulations
•   Tutorial 9 – Analyzing Cross-Border Banking Exposures




                                                            23
Blog, Library and Demos at www.fna.fi




Dr. Kimmo Soramäki
kimmo@soramaki.net
Twitter: soramaki

More Related Content

PDF
DSP IEEE paper
PDF
A comparison of non-linear regression and MDS-MAP
PDF
Implementation and Impact of LNS MAC Units in Digital Filter Application
PDF
Laplacian-regularized Graph Bandits
PDF
Curved PN Triangles
PDF
Learning Graph Representation for Data-Efficiency RL
PDF
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
PDF
Ijebea14 272
DSP IEEE paper
A comparison of non-linear regression and MDS-MAP
Implementation and Impact of LNS MAC Units in Digital Filter Application
Laplacian-regularized Graph Bandits
Curved PN Triangles
Learning Graph Representation for Data-Efficiency RL
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Ijebea14 272

What's hot (11)

PDF
Massive parallelism with gpus for centrality ranking in complex networks
PDF
A few approaches to cope with crash/byzantine failures in communication netwo...
PDF
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
PDF
Csr2011 june16 11_30_georgiadis
PDF
1801.06434
PDF
Ba36317323
PDF
A genetic algorithm for constructing broadcast trees with cost and delay cons...
PDF
CCNxCon2012: Session 5: CCN Location Sharing System
PDF
A New Approach to Linear Estimation Problem in Multiuser Massive MIMO Systems
PDF
The Effect of Network Topology on Geographic Routing Performance in Localized...
PDF
Effective Sparse Matrix Representation for the GPU Architectures
Massive parallelism with gpus for centrality ranking in complex networks
A few approaches to cope with crash/byzantine failures in communication netwo...
Graph Signal Processing for Machine Learning A Review and New Perspectives - ...
Csr2011 june16 11_30_georgiadis
1801.06434
Ba36317323
A genetic algorithm for constructing broadcast trees with cost and delay cons...
CCNxCon2012: Session 5: CCN Location Sharing System
A New Approach to Linear Estimation Problem in Multiuser Massive MIMO Systems
The Effect of Network Topology on Geographic Routing Performance in Localized...
Effective Sparse Matrix Representation for the GPU Architectures
Ad

Similar to Financial Networks VI - Correlation Networks (20)

PPTX
A TALE of DATA PATTERN DISCOVERY IN PARALLEL
PDF
High-Performance Graph Analysis and Modeling
PPTX
Financial Networks: II. Fundamentals of Network Theory and FNA
PDF
A Survey On Secure Cooperative Bait Detection Approach For...
PPTX
Jamming aware traffic allocation for multiple-path routing using portfolio se...
PDF
Distributed vertex cover
PPTX
Applications of machine learning in Wireless sensor networks.
PPTX
A Tale of Data Pattern Discovery in Parallel
PDF
Improving Route Discovery Using Stable Connected Dominating Set in MANETS
PPTX
XL-MINER:Prediction
PPTX
XL-MINER:Prediction
PDF
Optimizing the Data Collection in Wireless Sensor Network
PPTX
Deepwalk vs Node2vec
PDF
Description Of A Graph
PPTX
Yarn spark next_gen_hadoop_8_jan_2014
PDF
Spine net learning scale permuted backbone for recognition and localization
PDF
SVD and the Netflix Dataset
PDF
P229 godfrey
PPTX
PhD Consortium ADBIS presetation.
A TALE of DATA PATTERN DISCOVERY IN PARALLEL
High-Performance Graph Analysis and Modeling
Financial Networks: II. Fundamentals of Network Theory and FNA
A Survey On Secure Cooperative Bait Detection Approach For...
Jamming aware traffic allocation for multiple-path routing using portfolio se...
Distributed vertex cover
Applications of machine learning in Wireless sensor networks.
A Tale of Data Pattern Discovery in Parallel
Improving Route Discovery Using Stable Connected Dominating Set in MANETS
XL-MINER:Prediction
XL-MINER:Prediction
Optimizing the Data Collection in Wireless Sensor Network
Deepwalk vs Node2vec
Description Of A Graph
Yarn spark next_gen_hadoop_8_jan_2014
Spine net learning scale permuted backbone for recognition and localization
SVD and the Netflix Dataset
P229 godfrey
PhD Consortium ADBIS presetation.
Ad

More from Kimmo Soramaki (20)

PDF
Applications of Network Theory in Finance
PDF
Applications of Network Theory in Finance and Production
PDF
Global Network of Payment Flows - Presentation at Commerzbank Cash Forum
PDF
Visualizing Financial Stress - Talk at European Central Bank
PDF
Financial Cartography
PPTX
Financial Cartography at Bogazici University
PPTX
Network Simulations for Business Continuity
PPTX
Financial Cartography for Payments and Markets
PPTX
Quantitative Oversight of Financial Market Infrastructures
PPTX
Emerging Stress Scenarios
PDF
Network Approaches for Interbank Markets
PPTX
System shock analysis and complex network effects
PPTX
Adaptive Stress Testing
PPTX
Illuminating Interconnectedness and Contagion
PPTX
Financial Networks and Cartography
PPTX
Financial Networks V - Inferring Links
PPTX
Financial Cartography - PRMIA Webinar
PPTX
Financial Networks IV. Analyzing and Visualizing Exposures
PPTX
Financial Networks III. Centrality and Systemic Importance
PPT
Financial Cartography - Center for Financial Research
Applications of Network Theory in Finance
Applications of Network Theory in Finance and Production
Global Network of Payment Flows - Presentation at Commerzbank Cash Forum
Visualizing Financial Stress - Talk at European Central Bank
Financial Cartography
Financial Cartography at Bogazici University
Network Simulations for Business Continuity
Financial Cartography for Payments and Markets
Quantitative Oversight of Financial Market Infrastructures
Emerging Stress Scenarios
Network Approaches for Interbank Markets
System shock analysis and complex network effects
Adaptive Stress Testing
Illuminating Interconnectedness and Contagion
Financial Networks and Cartography
Financial Networks V - Inferring Links
Financial Cartography - PRMIA Webinar
Financial Networks IV. Analyzing and Visualizing Exposures
Financial Networks III. Centrality and Systemic Importance
Financial Cartography - Center for Financial Research

Recently uploaded (20)

PPTX
4.5.1 Financial Governance_Appropriation & Finance.pptx
PPTX
How best to drive Metrics, Ratios, and Key Performance Indicators
PPTX
FL INTRODUCTION TO AGRIBUSINESS CHAPTER 1
PDF
Predicting Customer Bankruptcy Using Machine Learning Algorithm research pape...
DOCX
marketing plan Elkhabiry............docx
PDF
Circular Flow of Income by Dr. S. Malini
PDF
Lecture1.pdf buss1040 uses economics introduction
PPTX
Basic Concepts of Economics.pvhjkl;vbjkl;ptx
PPTX
Session 14-16. Capital Structure Theories.pptx
PDF
NAPF_RESPONSE_TO_THE_PENSIONS_COMMISSION_8 _2_.pdf
PDF
Understanding University Research Expenditures (1)_compressed.pdf
PDF
ABriefOverviewComparisonUCP600_ISP8_URDG_758.pdf
PPTX
social-studies-subject-for-high-school-globalization.pptx
PPTX
kyc aml guideline a detailed pt onthat.pptx
PPTX
Who’s winning the race to be the world’s first trillionaire.pptx
PPTX
Unilever_Financial_Analysis_Presentation.pptx
PDF
Bladex Earnings Call Presentation 2Q2025
PDF
financing insitute rbi nabard adb imf world bank insurance and credit gurantee
PDF
1a In Search of the Numbers ssrn 1488130 Oct 2009.pdf
PPTX
Session 11-13. Working Capital Management and Cash Budget.pptx
4.5.1 Financial Governance_Appropriation & Finance.pptx
How best to drive Metrics, Ratios, and Key Performance Indicators
FL INTRODUCTION TO AGRIBUSINESS CHAPTER 1
Predicting Customer Bankruptcy Using Machine Learning Algorithm research pape...
marketing plan Elkhabiry............docx
Circular Flow of Income by Dr. S. Malini
Lecture1.pdf buss1040 uses economics introduction
Basic Concepts of Economics.pvhjkl;vbjkl;ptx
Session 14-16. Capital Structure Theories.pptx
NAPF_RESPONSE_TO_THE_PENSIONS_COMMISSION_8 _2_.pdf
Understanding University Research Expenditures (1)_compressed.pdf
ABriefOverviewComparisonUCP600_ISP8_URDG_758.pdf
social-studies-subject-for-high-school-globalization.pptx
kyc aml guideline a detailed pt onthat.pptx
Who’s winning the race to be the world’s first trillionaire.pptx
Unilever_Financial_Analysis_Presentation.pptx
Bladex Earnings Call Presentation 2Q2025
financing insitute rbi nabard adb imf world bank insurance and credit gurantee
1a In Search of the Numbers ssrn 1488130 Oct 2009.pdf
Session 11-13. Working Capital Management and Cash Budget.pptx

Financial Networks VI - Correlation Networks

  • 1. Center for Financial Studies at the Goethe University PhD Mini-course Frankfurt, 25 January 2013 Financial Networks VI. Correlation Networks Dr. Kimmo Soramäki Founder and CEO FNA, www.fna.fi
  • 2. Agenda V. Inferring Links • Prices and returns • Controlling for common factors • Correlation and dependence • Significant correlations • Multiple Comparisons VI. Correlation Networks • Distance and Hierarchical Clustering • Minimum Spanning Tree & PMFG • Other filtering • Layout algorithms 2
  • 3. Hierarchical structure in financial markets 3
  • 4. Minimum Spanning Tree A spanning tree of a graph is a subgraph that: 1. is a tree and 2. connects all the nodes together Length of a tree is the sum of its links. Minimum spanning tree (MST) is a spanning tree with shortest length. MST reflects the hierarchical structure of the correlation matrix
  • 5. MST and Hierarchical Structure Source: R.N. Mantegna (1999). Hierarchical structure in nancial markets, Eur. Phys. J. B 11, 193-197 5
  • 6. 36 Single Linkage Clustering • A method for hierarchical clustering • Clusters based on similarity or distance • SLINK algorithm R. Sibson (1973). SLINK: an optimally efficient algorithm for the single-link cluster method. The Computer Journal (British Computer Society) 16 (1): 30–34. 6
  • 7. Example # build network from correlations buildbycorrelationd -file daxreturns-2011-recon.csv -missing Alert -preserve false # calculate distance corrdistance -p correlation -method gower # calculate single linkage clistering slink -p corrdistance # create heatmaps heatmap -sortv vertex_id -p correlation -symmetric true -cellsizedefault 13 - transition 0 -cellhover correlation -palette darkblue-lightgray-darkred - colordomain (-1)-1 -saveas daxheat-slink-Y 7
  • 8. Unordered, Principal Ordered by Cluster, Principal Component Removed Component Removed 8
  • 9. Radial tree -layout • Calculates coordinates for radial layout as presented in Bachmaier, Brandes and Schlieper (2005) • The layout allows definition of each arc length • Specific parameters of command radialtreeviz: – Arc length property (-p) : Arc property defining arc length. Optional. – Root vertex (-rootvertex) : Id of root vertex. The root vertex is placed in the middle of the screen. Due to the repositioning of the tree, nodes may be placed outside the canvas in other than the first network. Optional. – Optimal rotation (-rotation) : Rotates layout to minimize sum of vertex distances between subsequent networks. Optional. By default 'false'. – Scaling (-scale) : Scale of visualization: value/pixel. Christian Bachmaier, Ulrik Brandes, and Barbara Schlieper (2005). Drawing Phylogenetic Trees. Department of Computer & Information Science, University of 9 Konstanz, Germany
  • 10. Putting it all together # build network from correlations buildbycorrelationd -file daxreturns-2011.csv -missing Alert -savestdev -savereturns -preserve false # calculate distance corrdistance -p correlation -method gower # calculate single linkage clistering minst -p corrdistance # drop arcs not in MST dropa -e minst=false # calculate absolute correlation calcap -e 1-abs(correlation) -saveas vizdistance # create heatmaps radialtreeviz -p vizdistance -vlabel vertex_id -vsize stdev -transition 3000 -ahover correlation -saveas daxviz-MST 10
  • 11. Asset Trees Size of node reflects volatility (variance) of returns Links between nodes reflect 'backbone' correlations - short link = high correlation - long link = low correlation 11
  • 12. Circle Tree -visualization • Calculates coordinates for circle tree layout as presented in Bachmaier, Brandes and Schlieper (2005) • As before but instead of radialtreeviz: circletreeviz -vlabel vertex_id -vsize stdev -transition 3000 -ahover correlation -saveas daxviz-MST-circle 12
  • 13. Planar Maximally Filtered Graph Node size scales with degree • A complex graph with loops and cliques of up to 4 elements. It can be drawn on a planar surface without link crossings. • MST is contained in PMFG M. Tumminello, T. Ast, T. Di Matteo and R. N. Mantegna (2005). A Tool for Filtering Information in Complex Systems. PNAS vol. 102 no. 30 pp. 10421–10426 13
  • 14. PMFG -command # build network from correlations buildbycorrelationd -file daxreturns-2011.csv -missing Alert -savestdev -savereturns -preserve false # calculate distance corrdistance -p correlation -method gower # calculate single linkage clistering pmfg -p corrdistance # drop arcs not in MST dropa -e pmfg=false # calculate 1-absolute correlation calcap -e abs(correlation) -saveas vizdistance # calculate degree degree # create heatmaps frviz -vlabel vertex_id -vsize stdev -atransparency vizdistance -ahover correlation -transition 3000 -ahover correlation - arrows false -saveas daxviz-PMFG 14
  • 15. Partial Correlation • Measures the degree of association between two random variables • What is the direct relationship between Adidas and Allianz, controlling for BASF, BAYER, ... ? • We build regression models for Adidas and Allianz and look at the correlation of their model residuals (i.e. wgat left unexplained by the other factors) -> Partial correltation 15
  • 16. Example # build network from correlations buildbypartialcorrelationd -file daxreturns-2011.csv -missing Alert - savestdev -preserve false # show as heatmap heatmap -sortv vertex_id -p partial_correlation -symmetric true - cellsizedefault 13 -transition 0 -cellhover partial_correlation -palette darkblue-lightgray-darkred -colordomain (-1)-1 -saveas daxheat- partial-Y 16
  • 17. Correlations Partial Correlations 17
  • 18. NETS • Network Estimation for Time- Series • Forthcoming paper by Barigozzi and Brownlees • Estimates an unknown network structure from multivariate data • Captures both comtemporenous and serial dependence (partial correlations and lead/lag effects) 18
  • 19. Correlation filtering PMFG Balance between too much and too little information One of many methods to create networks from correlation/distance matrices – PMFGs, Partial Correlation Networks, Influence Networks, Granger Influence Network Causality, NETS, etc. New graph, information-theory, economics & statistics -based models are being actively developed 19
  • 20. Sammon’s Projection Proposed by John W. Sammon in IEEE Transactions on Computers 18: 401–409 (1969) A nonlinear projection method to map a high dimensional space onto a space of lower dimensionality. Example: Iris Setosa Iris Versicolor Iris Virginica 20
  • 21. Example # build network from correlations buildbycorrelationd -file daxreturns-2011.csv -missing Alert -savestdev -savereturns - preserve false # calculate distance corrdistance -p correlation -method gower # Calculate sammonlayout sammonlayouta -p corrdistance -saveerror true # Sum up error sumaforv -p error -saveas error # create heatmaps sammonaviz -p corrdistance -vlabel vertex_id -vsize error -transition 3000 -ahover error -saveas daxviz-Sammon-Y 21
  • 23. Tutorials • Tutorial 1 – Loading Networks into FNA • Tutorial 2 – Managing Data in FNA • Tutorial 3 – Network Summary Measures • Tutorial 4 – Centrality Measures • Tutorial 5 – Connectedness and Components • Tutorial 6 – Network Visualization • Tutorial 7 – Correlation Networks • Tutorial 8 – Payment System Simulations • Tutorial 9 – Analyzing Cross-Border Banking Exposures 23
  • 24. Blog, Library and Demos at www.fna.fi Dr. Kimmo Soramäki kimmo@soramaki.net Twitter: soramaki