SlideShare a Scribd company logo
Dissecting Blockchain Price Analytics: What We
Learn from Geometry of Ethereum
Umar Islambekov
(Bowling Green State University)
joint work with
Y. Li, C. Ak¸cora, E. Smirnova, Y. Gel and M. Kantarcıo˘glu
October 6, 2019
SAMSI Blockchain Analytics Workshop
Partially supported by NSF DMS 1736417, ECCS 1824710, and IIS 1633331
1/19
Overview
Motivation
Intro to Topological Data Analysis (TDA)
Standard TDA pipeline
Method of Persistence Homology
Topological summaries
TDA for price anomaly detection in Ethereum network
Problem description and main idea
Methodology
Experimental results
2/19
Motivation. Challenges posed by Ethereum networks
Ethereum transaction graphs are very sparse and dynamic.
Conventional graph analytic tools may not be good indicators
of token activity.
Need to explore the link between token price dynamics and
topological structure underlying the transaction graph.
This structure can be described using TDA methods and
incorporated into statistical models for price dynamics.
3/19
Standard TDA pipeline in applications
4/19
Simplicial complexes
Definition 1 (Abstract simplicial complex)
Let X be a discrete set. An abstract simplicial complex is a
collection C of finite subsets of X such that if σ ∈ C then τ ∈ C for
all τ ⊆ σ. If |σ| = p + 1 then σ is called a p-simplex.
Simplicial complexes:
are higher-dimensional generalizations of graphs,
are both topological spaces and combinatorial objects.
identify topological features such as connected components,
loops and voids.
have simplices as their building blocks.
5/19
Vietoris-Rips complex
Vietoris-Rips (VR) is one of the most widely used simplicial
complexes in applications.
Definition 2 (Vietoris-Rips complex)
Let X be a discrete set in a metric space (M, ρ). A Vietoris-Rips
complex on X at scale ≥ 0 is defined as
VR = {σ | ρ(xi , xj ) ≤ for all xi , xj ∈ σ}
6/19
Homology
7/19
Why nested family of simplicial complexes? The method of
Persistence Homology
Simplicial complexes are constructed from a certain procedure
involving a scale parameter.
A key to obtain useful topological information is to avoid a
fixed value of the scale.
Instead, choose a range of increasing scale parameters.
Construct a simplicial complex for each scale and compute
topological summaries.
Keep track of topological features as they appear and
disappear.
Features persisting across many scales are likely to be true
features.
Features with short lifespans correspond to noise.
8/19
Persistence diagrams vs Betti sequences
A persistence diagram (PD) is a multiset of paired scale
values corresponding to the appearance and disappearance of
topological features.
Drawback: often cannot be directly incorporated into
statistical or machine learning methods for further analysis of
data.
Betti sequences consist of the counts of topological features
at increasing scale values.
Advantage: can easily be used in a variety of settings such as
statistical modeling, machine learning and functional data
analysis.
9/19
A simple example
10/19
TDA for price anomaly detection in Ethereum network
Given: Time evolving transaction network of an Ethereum
token and time series of the token price.
Nodes are user addresses.
Edge weights represent dissimilarity between nodes based on
transaction amount.
Goal: Predict days of future anomalous prices.
Main idea: Identify anomalous price events through
discerning changes in topology/geometry of the evolving
network.
11/19
Notion of data depth
Let X be a space of multivariate or functional data points
(e.g., Rm or C([a, b]))
Definition 3 (Data depth)
With respect to a probability distribution P, a data depth function
D : X → [0, 1] provides a center-outward ordering of points of X.
There are several data depth functions that yield different
such orderings.
Highest depth values =⇒ most central points.
Lowest depth values =⇒ most outlying points.
0 20 40 60 80 100
2.02.53.03.54.04.55.05.5
Modified Band Depth (MBD)
deepest sample
12/19
Methodology
Consider an evolving token transaction network over days
t = 1, 2, . . . , T.
Trim the network keeping only the top K nodes and compute
the associated Betti sequences {βp,t}T
t=1.
Employ a concept of functional data depth to track changes
in Betti sequences.
Calculate the modified band depth (MBD) (Pintado and
Romo, 2005) of each day’s Betti sequence with respect to
those of past w days:
RDw (βp,t) := MBD(βp,t|βp,t, βp,t−1, . . . βp,t−w+1).
13/19
Predictive machine learning models
Model Explanation F : Input
M1 Baseline Model PN, nE, nV , GC
M2 Betti 0 PN, nE, nV , GC, RD7(β0)
M3 Betti 0, 1 PN, nE, nV , GC, RD7(β0), RD7(β1)
M4 Full model PN, nE, nV , GC, RD7(β0), . . . , RD7(β2)
Token price return on day t: Rt = (Pt − Pt−1)/(Pt−1) .
Normalized token open price: PNt = Pt/ max{P1, . . . , PTk
}.
# of edges: nEt; # of nodes: nVt.
Average clustering coefficient: GCt.
If |Rt| ≥ δ, δ > 0, then day t is considered as anomalous.
Fit Random Forest models to data using 2/3 of a token’s
lifetime for training, and the remaining 1/3 for testing.
Examine performance for different prediction horizons h > 0.
14/19
Prediction results - Gain in accuracy
15/19
Gain in recall (sensitivity)
16/19
Gain in precision
17/19
References
Akcora C, Dey A, Gel Y, Kantarcioglu M, 2018. Forecasting bitcoin
price with graph chainlets. In PaKDD, 765–776.
Carlsson G, 2009. Topology and data. Bull. Amer. Math. Soc.
(N.S.) 46(2): 255–308.
Chazal F, Michel B, 2017. An introduction to topological data
analysis: fundamental and practical aspects for data scientists.
arXiv preprint arXiv:1710.04019 .
Dey A, Akcora C, Gel Y, Kantarcioglu M, 2018. On the role of
local blockchain network features in cryptocurrency price
formation. Under Submission : 1–35.
Maria C, Boissonnat J, Glisse M, Yvinec M, 2014. The gudhi
library: Simplicial complexes and persistent homology. In Int.
Congress on Mathematical Software, 167–174.
Pintado S, Romo J, 2005. Depth-based classification for functional
data 72.
18/19
Thank you very much for attending!
19/19

More Related Content

PPTX
2019 GDRR: Blockchain Data Analytics - ChainNet: Learning on Blockchain Graph...
PDF
2019 GDRR: Blockchain Data Analytics - Modeling Cryptocurrency Markets with T...
PDF
Object Detection Beyond Mask R-CNN and RetinaNet III
PDF
Webinar on Graph Neural Networks
PDF
Graph Convolutional Neural Networks
PPTX
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
PDF
Gnn overview
PDF
Mask R-CNN
2019 GDRR: Blockchain Data Analytics - ChainNet: Learning on Blockchain Graph...
2019 GDRR: Blockchain Data Analytics - Modeling Cryptocurrency Markets with T...
Object Detection Beyond Mask R-CNN and RetinaNet III
Webinar on Graph Neural Networks
Graph Convolutional Neural Networks
Introduction to Graph Neural Networks: Basics and Applications - Katsuhiko Is...
Gnn overview
Mask R-CNN

What's hot (20)

PPTX
Graph R-CNN for Scene Graph Generation
PPTX
Introduction to Graph neural networks @ Vienna Deep Learning meetup
PDF
Exploring Quantum Supremacy in Access Structures of Secret Sharing by Coding ...
PPTX
Graph Neural Network - Introduction
PDF
010_20160216_Variational Gaussian Process
PDF
CLIM Program: Remote Sensing Workshop, Optimization for Distributed Data Syst...
PDF
DAOR - Bridging the Gap between Community and Node Representations: Graph Emb...
PDF
Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...
PDF
Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018
PDF
How Powerful are Graph Networks?
PPTX
Digit recognizer by convolutional neural network
PDF
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
PPTX
Summarizing videos with Attention
PDF
Training and Inference for Deep Gaussian Processes
PDF
Graph Neural Network in practice
PDF
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
PDF
Safety Verification of Deep Neural Networks_.pdf
PDF
Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)
PDF
Chromatic Sparse Learning
PDF
Bridging knowledge graphs_to_generate_scene_graphs
Graph R-CNN for Scene Graph Generation
Introduction to Graph neural networks @ Vienna Deep Learning meetup
Exploring Quantum Supremacy in Access Structures of Secret Sharing by Coding ...
Graph Neural Network - Introduction
010_20160216_Variational Gaussian Process
CLIM Program: Remote Sensing Workshop, Optimization for Distributed Data Syst...
DAOR - Bridging the Gap between Community and Node Representations: Graph Emb...
Generative Models and Adversarial Training (D2L3 Insight@DCU Machine Learning...
Generative Adversarial Networks GAN - Santiago Pascual - UPC Barcelona 2018
How Powerful are Graph Networks?
Digit recognizer by convolutional neural network
Methods of Manifold Learning for Dimension Reduction of Large Data Sets
Summarizing videos with Attention
Training and Inference for Deep Gaussian Processes
Graph Neural Network in practice
Variational Autoencoders VAE - Santiago Pascual - UPC Barcelona 2018
Safety Verification of Deep Neural Networks_.pdf
Convolutional Neural Networks (D1L3 2017 UPC Deep Learning for Computer Vision)
Chromatic Sparse Learning
Bridging knowledge graphs_to_generate_scene_graphs
Ad

Similar to 2019 GDRR: Blockchain Data Analytics - Dissecting Blockchain Price Analytics: What We Learn from the Geometry of Ethereum - Umar Islambekov, October 6, 2019 (20)

PDF
PhD defense talk slides
PDF
GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...
PDF
T24144148
PPT
Aggregation computation over distributed data streams(the final version)
PPTX
A general multiobjective clustering approach based on multiple distance measures
PDF
Iaetsd a novel scheduling algorithms for mimo based wireless networks
PDF
Performance Analysis of Differential Beamforming in Decentralized Networks
PDF
InternshipReport
PDF
Tessellations and Sweeping Nets: Advancing the Calculus of Geometric Logic
PDF
Iterative Soft Decision Based Complex K-best MIMO Decoder
PDF
Iterative Soft Decision Based Complex K-best MIMO Decoder
PDF
Increasing electrical grid stability classification performance using ensemb...
PDF
Er24902905
PDF
Paper_KST
PDF
Paper id 26201482
PDF
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
PDF
Uncertain Volatility Models
PDF
9517cnc05
PPTX
R Language Introduction
PhD defense talk slides
GASGD: Stochastic Gradient Descent for Distributed Asynchronous Matrix Comple...
T24144148
Aggregation computation over distributed data streams(the final version)
A general multiobjective clustering approach based on multiple distance measures
Iaetsd a novel scheduling algorithms for mimo based wireless networks
Performance Analysis of Differential Beamforming in Decentralized Networks
InternshipReport
Tessellations and Sweeping Nets: Advancing the Calculus of Geometric Logic
Iterative Soft Decision Based Complex K-best MIMO Decoder
Iterative Soft Decision Based Complex K-best MIMO Decoder
Increasing electrical grid stability classification performance using ensemb...
Er24902905
Paper_KST
Paper id 26201482
Fixed Point Realization of Iterative LR-Aided Soft MIMO Decoding Algorithm
Uncertain Volatility Models
9517cnc05
R Language Introduction
Ad

More from The Statistical and Applied Mathematical Sciences Institute (20)

PDF
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
PDF
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
PDF
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
PDF
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
PDF
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
PDF
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
PPTX
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
PDF
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
PDF
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
PPTX
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
PDF
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
PDF
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
PDF
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
PDF
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
PDF
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
PDF
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
PPTX
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
PPTX
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
PDF
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
PDF
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...
Causal Inference Opening Workshop - Latent Variable Models, Causal Inference,...
2019 Fall Series: Special Guest Lecture - 0-1 Phase Transitions in High Dimen...
Causal Inference Opening Workshop - Causal Discovery in Neuroimaging Data - F...
Causal Inference Opening Workshop - Smooth Extensions to BART for Heterogeneo...
Causal Inference Opening Workshop - A Bracketing Relationship between Differe...
Causal Inference Opening Workshop - Testing Weak Nulls in Matched Observation...
Causal Inference Opening Workshop - Difference-in-differences: more than meet...
Causal Inference Opening Workshop - New Statistical Learning Methods for Esti...
Causal Inference Opening Workshop - Bipartite Causal Inference with Interfere...
Causal Inference Opening Workshop - Bridging the Gap Between Causal Literatur...
Causal Inference Opening Workshop - Some Applications of Reinforcement Learni...
Causal Inference Opening Workshop - Bracketing Bounds for Differences-in-Diff...
Causal Inference Opening Workshop - Assisting the Impact of State Polcies: Br...
Causal Inference Opening Workshop - Experimenting in Equilibrium - Stefan Wag...
Causal Inference Opening Workshop - Targeted Learning for Causal Inference Ba...
Causal Inference Opening Workshop - Bayesian Nonparametric Models for Treatme...
2019 Fall Series: Special Guest Lecture - Adversarial Risk Analysis of the Ge...
2019 Fall Series: Professional Development, Writing Academic Papers…What Work...
2019 GDRR: Blockchain Data Analytics - Machine Learning in/for Blockchain: Fu...
2019 GDRR: Blockchain Data Analytics - QuTrack: Model Life Cycle Management f...

Recently uploaded (20)

PPTX
TOTAL hIP ARTHROPLASTY Presentation.pptx
PDF
Biophysics 2.pdffffffffffffffffffffffffff
PDF
Sciences of Europe No 170 (2025)
PPT
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
PPTX
Comparative Structure of Integument in Vertebrates.pptx
PPTX
Taita Taveta Laboratory Technician Workshop Presentation.pptx
PPTX
INTRODUCTION TO EVS | Concept of sustainability
PPTX
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
PPTX
7. General Toxicologyfor clinical phrmacy.pptx
PDF
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
PPTX
Introduction to Cardiovascular system_structure and functions-1
PDF
Phytochemical Investigation of Miliusa longipes.pdf
PPTX
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
PPTX
microscope-Lecturecjchchchchcuvuvhc.pptx
PPTX
2. Earth - The Living Planet earth and life
PPTX
ECG_Course_Presentation د.محمد صقران ppt
PPTX
Derivatives of integument scales, beaks, horns,.pptx
PPTX
neck nodes and dissection types and lymph nodes levels
PPTX
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
PPTX
2Systematics of Living Organisms t-.pptx
TOTAL hIP ARTHROPLASTY Presentation.pptx
Biophysics 2.pdffffffffffffffffffffffffff
Sciences of Europe No 170 (2025)
The World of Physical Science, • Labs: Safety Simulation, Measurement Practice
Comparative Structure of Integument in Vertebrates.pptx
Taita Taveta Laboratory Technician Workshop Presentation.pptx
INTRODUCTION TO EVS | Concept of sustainability
ANEMIA WITH LEUKOPENIA MDS 07_25.pptx htggtftgt fredrctvg
7. General Toxicologyfor clinical phrmacy.pptx
VARICELLA VACCINATION: A POTENTIAL STRATEGY FOR PREVENTING MULTIPLE SCLEROSIS
Introduction to Cardiovascular system_structure and functions-1
Phytochemical Investigation of Miliusa longipes.pdf
DRUG THERAPY FOR SHOCK gjjjgfhhhhh.pptx.
microscope-Lecturecjchchchchcuvuvhc.pptx
2. Earth - The Living Planet earth and life
ECG_Course_Presentation د.محمد صقران ppt
Derivatives of integument scales, beaks, horns,.pptx
neck nodes and dissection types and lymph nodes levels
G5Q1W8 PPT SCIENCE.pptx 2025-2026 GRADE 5
2Systematics of Living Organisms t-.pptx

2019 GDRR: Blockchain Data Analytics - Dissecting Blockchain Price Analytics: What We Learn from the Geometry of Ethereum - Umar Islambekov, October 6, 2019

  • 1. Dissecting Blockchain Price Analytics: What We Learn from Geometry of Ethereum Umar Islambekov (Bowling Green State University) joint work with Y. Li, C. Ak¸cora, E. Smirnova, Y. Gel and M. Kantarcıo˘glu October 6, 2019 SAMSI Blockchain Analytics Workshop Partially supported by NSF DMS 1736417, ECCS 1824710, and IIS 1633331 1/19
  • 2. Overview Motivation Intro to Topological Data Analysis (TDA) Standard TDA pipeline Method of Persistence Homology Topological summaries TDA for price anomaly detection in Ethereum network Problem description and main idea Methodology Experimental results 2/19
  • 3. Motivation. Challenges posed by Ethereum networks Ethereum transaction graphs are very sparse and dynamic. Conventional graph analytic tools may not be good indicators of token activity. Need to explore the link between token price dynamics and topological structure underlying the transaction graph. This structure can be described using TDA methods and incorporated into statistical models for price dynamics. 3/19
  • 4. Standard TDA pipeline in applications 4/19
  • 5. Simplicial complexes Definition 1 (Abstract simplicial complex) Let X be a discrete set. An abstract simplicial complex is a collection C of finite subsets of X such that if σ ∈ C then τ ∈ C for all τ ⊆ σ. If |σ| = p + 1 then σ is called a p-simplex. Simplicial complexes: are higher-dimensional generalizations of graphs, are both topological spaces and combinatorial objects. identify topological features such as connected components, loops and voids. have simplices as their building blocks. 5/19
  • 6. Vietoris-Rips complex Vietoris-Rips (VR) is one of the most widely used simplicial complexes in applications. Definition 2 (Vietoris-Rips complex) Let X be a discrete set in a metric space (M, ρ). A Vietoris-Rips complex on X at scale ≥ 0 is defined as VR = {σ | ρ(xi , xj ) ≤ for all xi , xj ∈ σ} 6/19
  • 8. Why nested family of simplicial complexes? The method of Persistence Homology Simplicial complexes are constructed from a certain procedure involving a scale parameter. A key to obtain useful topological information is to avoid a fixed value of the scale. Instead, choose a range of increasing scale parameters. Construct a simplicial complex for each scale and compute topological summaries. Keep track of topological features as they appear and disappear. Features persisting across many scales are likely to be true features. Features with short lifespans correspond to noise. 8/19
  • 9. Persistence diagrams vs Betti sequences A persistence diagram (PD) is a multiset of paired scale values corresponding to the appearance and disappearance of topological features. Drawback: often cannot be directly incorporated into statistical or machine learning methods for further analysis of data. Betti sequences consist of the counts of topological features at increasing scale values. Advantage: can easily be used in a variety of settings such as statistical modeling, machine learning and functional data analysis. 9/19
  • 11. TDA for price anomaly detection in Ethereum network Given: Time evolving transaction network of an Ethereum token and time series of the token price. Nodes are user addresses. Edge weights represent dissimilarity between nodes based on transaction amount. Goal: Predict days of future anomalous prices. Main idea: Identify anomalous price events through discerning changes in topology/geometry of the evolving network. 11/19
  • 12. Notion of data depth Let X be a space of multivariate or functional data points (e.g., Rm or C([a, b])) Definition 3 (Data depth) With respect to a probability distribution P, a data depth function D : X → [0, 1] provides a center-outward ordering of points of X. There are several data depth functions that yield different such orderings. Highest depth values =⇒ most central points. Lowest depth values =⇒ most outlying points. 0 20 40 60 80 100 2.02.53.03.54.04.55.05.5 Modified Band Depth (MBD) deepest sample 12/19
  • 13. Methodology Consider an evolving token transaction network over days t = 1, 2, . . . , T. Trim the network keeping only the top K nodes and compute the associated Betti sequences {βp,t}T t=1. Employ a concept of functional data depth to track changes in Betti sequences. Calculate the modified band depth (MBD) (Pintado and Romo, 2005) of each day’s Betti sequence with respect to those of past w days: RDw (βp,t) := MBD(βp,t|βp,t, βp,t−1, . . . βp,t−w+1). 13/19
  • 14. Predictive machine learning models Model Explanation F : Input M1 Baseline Model PN, nE, nV , GC M2 Betti 0 PN, nE, nV , GC, RD7(β0) M3 Betti 0, 1 PN, nE, nV , GC, RD7(β0), RD7(β1) M4 Full model PN, nE, nV , GC, RD7(β0), . . . , RD7(β2) Token price return on day t: Rt = (Pt − Pt−1)/(Pt−1) . Normalized token open price: PNt = Pt/ max{P1, . . . , PTk }. # of edges: nEt; # of nodes: nVt. Average clustering coefficient: GCt. If |Rt| ≥ δ, δ > 0, then day t is considered as anomalous. Fit Random Forest models to data using 2/3 of a token’s lifetime for training, and the remaining 1/3 for testing. Examine performance for different prediction horizons h > 0. 14/19
  • 15. Prediction results - Gain in accuracy 15/19
  • 16. Gain in recall (sensitivity) 16/19
  • 18. References Akcora C, Dey A, Gel Y, Kantarcioglu M, 2018. Forecasting bitcoin price with graph chainlets. In PaKDD, 765–776. Carlsson G, 2009. Topology and data. Bull. Amer. Math. Soc. (N.S.) 46(2): 255–308. Chazal F, Michel B, 2017. An introduction to topological data analysis: fundamental and practical aspects for data scientists. arXiv preprint arXiv:1710.04019 . Dey A, Akcora C, Gel Y, Kantarcioglu M, 2018. On the role of local blockchain network features in cryptocurrency price formation. Under Submission : 1–35. Maria C, Boissonnat J, Glisse M, Yvinec M, 2014. The gudhi library: Simplicial complexes and persistent homology. In Int. Congress on Mathematical Software, 167–174. Pintado S, Romo J, 2005. Depth-based classification for functional data 72. 18/19
  • 19. Thank you very much for attending! 19/19