Additional Descriptive Statistics methods for Data Analysis / Maxim Neronov (Nexters)

Statistics in

Data Analysis

Additional methods in data analysis or
something useful you never knew exists
Maxim Neronov – Nexters Data Analyst
1

Work Experience
2 years as Game-Designer

4 years as Data Analyst
2
Statistics in Data Analysis
About Speaker
B.S. in Applied Mathematics

M.A. in Computer Science
Education
About Speaker

3
Statistics in Data Analysis
Main Themes
What are we actually talking about?
Stochastic dominance – can there be an order?

Empirical Distribution Functions and their estimations

An example of consequent tests with function of
random variables

Stochastic Dominance
Is there any way to order randomness?

5
Total Order is an instrument to

compare things
Random Variables & Stochastic processes
One random variable is stochastically dominant over another if probability of an event
ξ > x is larger or equal than v > x for all x
What are we actually talking about?
What is an order?

6
Simple Example
Which means that both mean and median
of
fi
rst distribution are less than those from
second distribution
Let’s see how it works with two Normal
Distributions with same variance, but
different expected values

7
Statistical Significance
1. Data is random → there may be variance in what we look at
2. We need a stable algorithm to prove that there is signi
fi
cance in dominance
3. Somehow we all familiar with this statistical test
«Let x and y be two random variables having continuous С.D.F. f and g
respectively. The variable x will be called stochastically smaller than y if
f(a) > g(a) for every a. We wish to test the hypothesis f = g against the
alternative that x is stochastically smaller than y»
H. B. Mann and D. R. Whitney – 1947

8
Not so Simple Example
Mean Median
Exponential 1.9795 1.3863
Normal 1.7816 1.78
Mean is larger
Median is larger

Test H1 p-value Result
Mann-Whitney Less 10-6 Less
Student t-test Less 0.999 Greater
KS-test
Less 10-6
Both
Greater 10-6
9
Comparing Exponential sample to Normal

We got:

– Sample mean is greater or equal

– Mann Whitney test called exponential
sample stochastically less

– Kolmogorov-Smirnov test says it both
greater and less
1. We can detect difference in mean

2. More elements are «less»

3. Both data samples are dominant on
the different part of axis

10
Although you can describe the null hypothesis in terms of dominance or mean
difference, it most often will be dif
fi
cult to say how it affect your product. In general,
Mann-Whitney detect dominance in terms of «most elements» being larger/smaller
Do not compare things that differ in distribution shape

11
How to use
* Python has a continuity correction for discrete values
When the metrics are continuous and «Stable»*

If the experiment has little effect on variance

Always look at the ECDF curve before conducting any kind of
analysis

Mann-Whitney U-test tells you which sample has more
element on the right/left side

2 sample KS-test tells you if there is difference at all

13
CDF & ECDF Estimation
It’s nice to have ECDF

We know how to work with pointwise estimation (mean, variance)

Can we do something more?

14
Kolmogorov Statistic
The maximum difference between CDF and
ECDF is de
fi
ned as Kolmogorov-Smirnov statistic

15
Kolmogorov Statistic
The maximum difference between CDF and
ECDF is de
fi
ned as Kolmogorov-Smirnov statistic

16
DKW – Inequality
KS test also may be used for testing hypothesis like
F(x) ≤ G(x). But more importantly it opens new ideas to
estimate the borders of ECDF
DKW Inequality allows such borders
When to use

Sometimes we change not only the mean/median/variance
but the nature of some events

17
Case with in-game mechanics
Let’s see an example where we somehow changed the arena

Any user can take up to 5 battles per day
How do we check the difference and know where it happened?
Mean Median
Sample 1 2.262 2
Sample 2 2.331 2

18
Let’s see an example where we somehow changed the arena

Any user can take up to 5 battles per day
How do we check the difference and know where it happened?
Mean Median
Sample 1 2.262 2
Sample 2 2.331 2

19
Let’s use a DKW-inequality to build bandwidth Con
fi
dence Intervals
We can see difference on the
fi
rst step as well as on the second one due to
the fact of ECDF growth

20
General Advice
Can be used to any kind of variables

Doesn’t give any specific answers about mean/mediaEasy to
calculate

Easy to visualise/explain

Would not replace statistical testing

Works good with mechanics and economy metrics

Consequent Testing
What to do with a functions of R.V.?

22
Problem
Test 1 Test 2 Test 3 Test N
A / B A / B A / B A / B
We want to conduct a series of consequent tests
For every test group B is much worse from the start and we don’t need to
test. But we are making small differences and trying to converge the
group B to group A
The questions we want to seek answer for

1. Does the metrics closes in gap between iterations

2. How much do we need to improve

3. How many more iterations we need to conduct

23
Problem
Test 1
Test 2
A – Tutorial: 55.8%

B – Tutorial: 47.4%

In both tests the difference in tutorial is signi
fi
cant

But is 0.892 signi
fi
cantly larger than 0.849 and we are going in the right direction?
Ratio B/A
Test 1 0.849
Test 2 0.892

24
Problem
Why is this even hard?
How do we
fi
nd the unknown
distribution F which is the function of
Random Variables?
What we have researched

1. Fieller’s Theorem

2. Bootstrapping

3. Delta Method

4. Analytical research of Ratio
Distributions

25
Solution
Farrington-Manning Test
How to apply

1. Let the B conversion a p1
2. Let the A conversion a p2
3. De
fi
ne r as the ratio from previous
test

4. Conduct one-sided test with greater
alternative hypothesis
Pros

- No additional assumptions

- Directly solves the problem

- Described by an article

- Has an answer for sample size
Cons

- s & r are constants

- No existing implementation in
Python

- The article itself was hard to
fi
nd

26
Back to the case
Test 1
Test 2



27
General Advice
Bootstrap is a good instrument, but sometimes you can
solve the problem directly

Look for the science articles or popular library packages

In case of binomial ratio – use Farrington-Manning Test

29
Source
- Wikipedia

- М.Б. Лагутин: «Наглядная Математическая Статистика» (глава 14)

- H.B. Mann, D.R. Whitney: «On a Test of Whether one of Two Random
Variables is Stochastically Larger than the Other» (1947, DOI: 10.1214/aoms/
1177730491)

- Fieller’s Theorem (wikipedia)

- Ratio Distribution (wikipedia)

- Delta Method (wikipedia)

- Con
fi
dence Intervals for a Ratio of Binomial Proportions Based on Direct and
Inverse Sampling Schemes (2016, DOI: 10.1134/S1995080216040132)

- Farrington-Manning: «Test Statistics and Sample Size Formulae for
Comparative Binomial Trials» (1990, DOI:10.1002/sim.1242)

Additional Descriptive Statistics methods for Data Analysis / Maxim Neronov (Nexters)

More Related Content

Similar to Additional Descriptive Statistics methods for Data Analysis / Maxim Neronov (Nexters) (20)

More from DevGAMM Conference (20)

Recently uploaded (20)

Additional Descriptive Statistics methods for Data Analysis / Maxim Neronov (Nexters)