SlideShare a Scribd company logo
Statistical Tests
Data Analysis
Statistics - a powerful tool for analyzing data
1. Descriptive Statistics - provide an overview
of the attributes of a data set. These include
measurements of central tendency (frequency
histograms, mean, median, & mode) and
dispersion (range, variance & standard
deviation)
2. Inferential Statistics - provide measures of how
well your data support your hypothesis and if
your data are generalizable beyond what was
tested (significance tests)
Inferential Statistics
2 4 10 4 6 8 7 10 4 3 7 9 6 7 5 2 5 8 2 10
7 2 3 5 2 9 3 9 6 1 4 2 6 4 9 3 4 1 8 7
9 1 8 1 10 10 6 4 2 7 1 1 9 10 4 4 6 6 2 5
9 10 2 6 8 10 1 6 10 10 4 4 4 9 2 1 4 5 9 6
6 2 7 8 8 6 6 10 6 6 7 5 9 2 6 4 8 6 6 10
5 7 1 9 1 10 8 8 5 10 1 4 8 3 6 7 1 5 2 4
4 10 5 8 5 1 1 4 3 6 7 3 1 5 4 3 6 2 7 8
3 3 6 6 2 8 6 5 9 8 4 6 3 8 3 3 10 8 10 5
7 5 1 4 3 2 1 10 2 10 6 10 7 9 8 8 4 9 9 10
3 7 6 2 1 1 10 3 5 7 4 1 2 9 10 10 6 1 3 2
1 3 9 9 4 2 2 2 1 8 3 1 5 9 9 8 3 2 5 4
4 2 3 10 8 2 3 4 1 3 3 2 10 10 5 7 3 3 10 1
5 7 5 1 2 5 8 7 3 8 9 2 10 8 1 1 5 3 3 7
6 7 9 8 8 4 9 8 4 3 10 8 10 4 10 2 3 5 6 3
1 9 8 1 10 2 3 1 6 3 8 9 6 2 4 4 2 7 8 4
4 4 4 10 8 5 9 3 10 5 3 6 9 3 7 4 2 3 10 2
5 1 6 8 5 6 8 1 8 5 7 6 4 1 2 7 2 9 5 3
8 2 3 2 9 9 1 1 5 7 8 5 6 3 8 5 4 10 6 9
5 1 10 10 5 1 4 3 2 3 6 9 10 2 6 3 1 2 8 6
1 8 7 8 5 3 7 2 4 1 8 9 10 10 5 1 3 6 5 8
3 3 8 8 2 7 1 6 9 8 2 10 3 7 9 2 1 9 7 7
3 1 9 6 8 2 6 4 6 3 7 10 9 6 1 10 7 5 3 10
1 6 5 4 3 2 4 4 1 5 5 10 6 2 1 1 1 5 6 3
8 10 8 10 9 7 7 7 8 4 8 1 3 5 8 1 8 4 4 6
4 7 2 4 9 1 8 5 3 3 5 10 1 4 6 3 3 8 2 2
The Population: =5.314
Population size = 500
2 4 10 4 6 8 7 10 4 3 7 9 6 7 5 2 5 8 2 10
7 2 3 5 2 9 3 9 6 1 4 2 6 4 9 3 4 1 8 7
9 1 8 1 10 10 6 4 2 7 1 1 9 10 4 4 6 6 2 5
9 10 2 6 8 10 1 6 10 10 4 4 4 9 2 1 4 5 9 6
6 2 7 8 8 6 6 10 6 6 7 5 9 2 6 4 8 6 6 10
5 7 1 9 1 10 8 8 5 10 1 4 8 3 6 7 1 5 2 4
4 10 5 8 5 1 1 4 3 6 7 3 1 5 4 3 6 2 7 8
3 3 6 6 2 8 6 5 9 8 4 6 3 8 3 3 10 8 10 5
7 5 1 4 3 2 1 10 2 10 6 10 7 9 8 8 4 9 9 10
3 7 6 2 1 1 10 3 5 7 4 1 2 9 10 10 6 1 3 2
1 3 9 9 4 2 2 2 1 8 3 1 5 9 9 8 3 2 5 4
4 2 3 10 8 2 3 4 1 3 3 2 10 10 5 7 3 3 10 1
5 7 5 1 2 5 8 7 3 8 9 2 10 8 1 1 5 3 3 7
6 7 9 8 8 4 9 8 4 3 10 8 10 4 10 2 3 5 6 3
1 9 8 1 10 2 3 1 6 3 8 9 6 2 4 4 2 7 8 4
4 4 4 10 8 5 9 3 10 5 3 6 9 3 7 4 2 3 10 2
5 1 6 8 5 6 8 1 8 5 7 6 4 1 2 7 2 9 5 3
8 2 3 2 9 9 1 1 5 7 8 5 6 3 8 5 4 10 6 9
5 1 10 10 5 1 4 3 2 3 6 9 10 2 6 3 1 2 8 6
1 8 7 8 5 3 7 2 4 1 8 9 10 10 5 1 3 6 5 8
3 3 8 8 2 7 1 6 9 8 2 10 3 7 9 2 1 9 7 7
3 1 9 6 8 2 6 4 6 3 7 10 9 6 1 10 7 5 3 10
1 6 5 4 3 2 4 4 1 5 5 10 6 2 1 1 1 5 6 3
8 10 8 10 9 7 7 7 8 4 8 1 3 5 8 1 8 4 4 6
4 7 2 4 9 1 8 5 3 3 5 10 1 4 6 3 3 8 2 2
The Sample: 7, 6, 4, 9, 8, 3, 2, 6, 1
mean = 5.111
The Population: =5.314
2 4 10 4 6 8 7 10 4 3 7 9 6 7 5 2 5 8 2 10
7 2 3 5 2 9 3 9 6 1 4 2 6 4 9 3 4 1 8 7
9 1 8 1 10 10 6 4 2 7 1 1 9 10 4 4 6 6 2 5
9 10 2 6 8 10 1 6 10 10 4 4 4 9 2 1 4 5 9 6
6 2 7 8 8 6 6 10 6 6 7 5 9 2 6 4 8 6 6 10
5 7 1 9 1 10 8 8 5 10 1 4 8 3 6 7 1 5 2 4
4 10 5 8 5 1 1 4 3 6 7 3 1 5 4 3 6 2 7 8
3 3 6 6 2 8 6 5 9 8 4 6 3 8 3 3 10 8 10 5
7 5 1 4 3 2 1 10 2 10 6 10 7 9 8 8 4 9 9 10
3 7 6 2 1 1 10 3 5 7 4 1 2 9 10 10 6 1 3 2
1 3 9 9 4 2 2 2 1 8 3 1 5 9 9 8 3 2 5 4
4 2 3 10 8 2 3 4 1 3 3 2 10 10 5 7 3 3 10 1
5 7 5 1 2 5 8 7 3 8 9 2 10 8 1 1 5 3 3 7
6 7 9 8 8 4 9 8 4 3 10 8 10 4 10 2 3 5 6 3
1 9 8 1 10 2 3 1 6 3 8 9 6 2 4 4 2 7 8 4
4 4 4 10 8 5 9 3 10 5 3 6 9 3 7 4 2 3 10 2
5 1 6 8 5 6 8 1 8 5 7 6 4 1 2 7 2 9 5 3
8 2 3 2 9 9 1 1 5 7 8 5 6 3 8 5 4 10 6 9
5 1 10 10 5 1 4 3 2 3 6 9 10 2 6 3 1 2 8 6
1 8 7 8 5 3 7 2 4 1 8 9 10 10 5 1 3 6 5 8
3 3 8 8 2 7 1 6 9 8 2 10 3 7 9 2 1 9 7 7
3 1 9 6 8 2 6 4 6 3 7 10 9 6 1 10 7 5 3 10
1 6 5 4 3 2 4 4 1 5 5 10 6 2 1 1 1 5 6 3
8 10 8 10 9 7 7 7 8 4 8 1 3 5 8 1 8 4 4 6
4 7 2 4 9 1 8 5 3 3 5 10 1 4 6 3 3 8 2 2
The Sample: 1, 5, 8, 7, 4, 1, 6, 6
mean = 4.75
The Population: =5.314
Parametric or Non-parametric?
•Parametric tests are restricted to data that:
1) show a normal distribution
2) * are independent of one another
3) * are on the same continuous scale of measurement
•Non-parametric tests are used on data that:
1) show an other-than normal distribution
2) are dependent or conditional on one another
3) in general, do not have a continuous scale of
measurement
e.g., the length and weight of something –> parametric
vs.
did the bacteria grow or not grow –> non-parametric
The First Question
After examining your data, ask: does what you're testing
seem to be a question of relatedness or a question of
difference?
If relatedness (between your control and your experimental
samples or between you dependent and independent variable),
you will be using tests for correlation (positive or negative)
or regression.
If difference (your control differs from your experimental),
you will be testing for independence between distributions,
means or variances. Different tests will be employed if
your data show parametric or non-parametric properties.
See Flow Chart on page 50 of HBI.
Inferential stat tests samples discuss 4
Tests for Differences
• Between Means
- t-Test - P
- ANOVA - P
- Friedman Test
- Kruskal-Wallis Test
- Sign Test
- Rank Sum Test
• Between Distributions
- Chi-square for goodness of fit
- Chi-square for independence
• Between Variances
- F-Test – P
P – parametric tests
Differences Between Means
Asks whether samples come from populations with
different means
Null Hypothesis Alternative Hypothesis
A
Y
B CA
Y
B C
There are different tests if you have 2 vs more than 2 samples
Differences Between Means – Parametric
Data
t-Tests compare the means of two parametric samples
E.g. Is there a difference in the mean height of men and
women?
HBI: t-Test
Excel: t-Test (paired and unpaired) – in Tools – Data
Analysis
A researcher compared the height of plants grown in high
and low light levels. Her results are shown below. Use a
T-test to determine whether there is a statistically
significant difference in the heights of the two groups
Low Light High Light
49 45
31 40
43 59
31 58
40 55
44 50
49 46
48 53
33 43
Differences Between Means – Parametric
Data
ANOVA (Analysis of Variance) compares the means of
two or more parametric samples.
E.g. Is there a difference in the mean height of plants
grown under red, green and blue light?
HBI: ANOVA
Excel: ANOVA – check type under Tools – Data Analysis
weight of pigs fed different foods
food 1 food 2 food 3 food 4
60.8 68.7 102.6 87.9
57.0 67.7 102.1 84.2
65.0 74.0 100.2 83.1
58.6 66.3 96.5 85.7
61.7 69.8 90.3
A researcher fed pigs on four different foods. At the end
of a month feeding, he weighed the pigs. Use an ANOVA
test to determine if the different foods resulted in
differences in growth of the pigs.
Aplysia punctata – the sea hare
Aplysia parts
Differences Between Means – Non-
Parametric Data
The Sign Test compares the means of two “paired”, non-
parametric samples
E.g. Is there a difference in the gill withdrawal response of
Aplysia in night versus day? Each subject has been tested
once at night and once during the day –> paired data.
HBI: Sign Test
Excel: N/A
Subject
Night
Response
Day
Response
1 2 5
2 1 3
3 2 2
Inferential stat tests samples discuss 4
The Friedman Test is like the Sign test, (compares the
means of “paired”, non-parametric samples) for more than
two samples.
E.g. Is there a difference in the gill withdrawal response of
Aplysia between morning, afternoon and evening? Each
subject has been tested once during each time period –>
paired data
HBI: Friedman Test
Excel: N/A
Subject
Morning
Response
Afternoon
Response
Evening.
Response
1 4 3 2
2 5 2 1
3 3 4 3
Differences Between Means – Non-
Parametric Data
Inferential stat tests samples discuss 4
The Rank Sum test compares the means of two non-
parametric samples
E.g. Is there a difference in the gill withdrawal response of
Aplysia in night versus day? Each subject has been tested
once, either during the night or during the day –> unpaired
data.
HBI: Rank Sum
Excel: N/A
Subject
Night
Response
Day
Response
1 5
2 1
3 2
4 3
5 4
6 1
7 5
Differences Between Means – Non-
Parametric Data
Inferential stat tests samples discuss 4
The Kruskal-Wallis Test compares the means of more
than two non-parametric, non-paired samples
E.g. Is there a difference in the gill withdrawal response of
Aplysia in night versus day? Each subject has been tested
once, either during the morning, afternoon or evening –>
unpaired data.
HBI: Kruskal-Wallis Test
Excel: N/A
Differences Between Means – Non-
Parametric Data
Subject
Morning
Response
Afternoon
Response
Evening.
Response
1 4
2 5
3 4
4 3
5 2
6 3
Inferential stat tests samples discuss 4
Chi square tests compare observed frequency
distributions, either to theoretical expectations or to other
observed frequency distributions.
Differences Between Distributions
Differences Between Distributions
E.g. The F2 generation of a cross between a round pea
and a wrinkled pea produced 72 round individuals and 20
wrinkled individuals. Does this differ from the expected 3:1
round:wrinkled ratio of a simple dominant trait?
HBI: Chi-Square One Sample Test (goodness of fit)
Excel: Chitest – under Function Key – Statistical
Smooth
Frequency
Wrinkled
E
E
E.g. 67 out of 100 seeds placed in plain water germinated
while 36 out of 100 seeds placed in “acid rain” water
germinated. Is there a difference in the germination rate?
HBI: Chi-Square Two or More Sample Test (independence)
Excel: Chitest – under Function key - Statistical
Plain Acid Plain
Proportion
Germination Acid
Proportion
Germination
Null Hypothesis
Alternative Hypothesis
Differences Between Distributions
Correlations look for relationships between two variables
which may not be functionally related. The variables may
be ordinal, interval, or ratio scale data. Remember,
correlation does not prove causation; thus there may not
be a cause and effect relationship between the variables.
E.g. Do species of birds with longer wings also have
longer necks?
HBI: Spearman’s Rank Correlation (NP)
Excel: Correlation (P)
Correlation
Question – is there a relationship between students aptitude
for mathemathics and for biology?
Student Math score Math Rank Biol. score Biology rank
1 57 3 83 7
2 45 1 37 1
3 72 7 41 2
4 78 8 84 8
5 53 2 56 3
6 63 5 85 9
7 86 9 77 6
8 98 10 87 10
9 59 4 70 5
10 71 6 59 4
Inferential stat tests samples discuss 4
Regressions look for functional relationships between two
continuous variables. A regression assumes that a
change in X causes a change in Y.
E.g. Does an increase in light intensity cause an increase
in plant growth?
HBI: Regression Analysis (P)
Excel: Regression (P)
Regression
Correlation & Regression
Looks for relationships between two continuous variables
Null Hypothesis Alternative Hypothesis
X
Y
X
Y
Is there a relationship between wing length and
tail length in songbirds?
wing length cm tail length cm
10.4 7.4
10.8 7.6
11.1 7.9
10.2 7.2
10.3 7.4
10.2 7.1
10.7 7.4
10.5 7.2
10.8 7.8
11.2 7.7
10.6 7.8
11.4 8.3
Is there a relationship between age and systolic
blood pressure?
Age (yr) systolic blood pressure
mm hg
30 108
30 110
30 106
40 125
40 120
40 118
40 119
50 132
50 137
50 134
60 148
60 151
60 146
60 147
60 144
70 162
70 156
70 164
70 158
70 159

More Related Content

PDF
Irreversible: The Public Relations Big Data Revolution
PDF
Election 2016: Mobile Activity in the U.S. on Election Day
PDF
Your App Metrics Are Telling You How To Make Them Better: The Digits Data Report
PPT
Parametric and non parametric test
PPTX
4.4 M. Ángeles Rol
PPTX
Good Enough Analytics
PPT
Gone Shopping: detailed retail mapping
PDF
Tableau for statistical graphic and data visualization
Irreversible: The Public Relations Big Data Revolution
Election 2016: Mobile Activity in the U.S. on Election Day
Your App Metrics Are Telling You How To Make Them Better: The Digits Data Report
Parametric and non parametric test
4.4 M. Ángeles Rol
Good Enough Analytics
Gone Shopping: detailed retail mapping
Tableau for statistical graphic and data visualization

Similar to Inferential stat tests samples discuss 4 (20)

PDF
Don't Forget This!
PDF
The Interpersonal communication code
PDF
Visualizing Your Startup Pitch Deck
PPT
Main msdi milad jangalvaee
PDF
Distribucion geografica de las areas de demanda de servicios
PDF
DSD-INT 2016 Urban water modelling - Meijer
PDF
Genetic Algorithm (GA) Optimization - Step-by-Step Example
PPTX
More Reliable Delivery with Monte Carlo & Story Mapping
PPTX
ch02.pptxCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCV
PPTX
nature of probability and statistics.pptx
PDF
Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...
PPTX
Properties of Normal Distribution
PPT
Week 12 (StatisticalProcessControl)-StatisticalProcessControl-StatisticalProc...
PPTX
Quarter 3 Week 1_Advanced Statistics.pptx
DOCX
Training needs analysis template tool
PPTX
sience 2.0 : an illustration of good research practices in a real study
DOCX
Examplesbasic-input.txtk 2 3 K 1 1 2 3 3 3k 2 3 K 1 1 2 4.docx
DOCX
wealth age region37 50 M24 88 U14 64 A13 63 U13 66 .docx
PDF
Sensitivity Analysis
PPTX
Agnė DZIDOLIKAITĖ. Evolutionary Approach in Optimization
Don't Forget This!
The Interpersonal communication code
Visualizing Your Startup Pitch Deck
Main msdi milad jangalvaee
Distribucion geografica de las areas de demanda de servicios
DSD-INT 2016 Urban water modelling - Meijer
Genetic Algorithm (GA) Optimization - Step-by-Step Example
More Reliable Delivery with Monte Carlo & Story Mapping
ch02.pptxCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCV
nature of probability and statistics.pptx
Eficiency and Low Cost: Pro Tips for you to save 50% of your money with Googl...
Properties of Normal Distribution
Week 12 (StatisticalProcessControl)-StatisticalProcessControl-StatisticalProc...
Quarter 3 Week 1_Advanced Statistics.pptx
Training needs analysis template tool
sience 2.0 : an illustration of good research practices in a real study
Examplesbasic-input.txtk 2 3 K 1 1 2 3 3 3k 2 3 K 1 1 2 4.docx
wealth age region37 50 M24 88 U14 64 A13 63 U13 66 .docx
Sensitivity Analysis
Agnė DZIDOLIKAITĖ. Evolutionary Approach in Optimization
Ad

More from Makati Science High School (20)

PPTX
Writing a Scientific Paper
DOCX
Research Paper Rubrics 2020
PPTX
Statistical test discuss 5
PDF
Measures of variation discuss 2.1
PDF
Measures of dispersion discuss 2.2
PPTX
Materials and methods discuss
PPT
Ds vs Is discuss 3.1
PDF
Descriptive inferential-discuss 1
PDF
Central tendency m,m,m 1.2
PPT
Central tendency discuss 2
PDF
Types of graphs and charts and their uses with examples and pics
PDF
Levels of measurement discuss
PDF
Gantt chart discuss 3
PDF
Gantt chart discuss 2
PDF
Gantt chart discuss 1
PPTX
Research Designs -9 experimental Designs
PPTX
Research designs Pt 1
PPTX
Identifying variables
PPTX
Kinds and classifications of research
PPTX
Research Ethical Issues
Writing a Scientific Paper
Research Paper Rubrics 2020
Statistical test discuss 5
Measures of variation discuss 2.1
Measures of dispersion discuss 2.2
Materials and methods discuss
Ds vs Is discuss 3.1
Descriptive inferential-discuss 1
Central tendency m,m,m 1.2
Central tendency discuss 2
Types of graphs and charts and their uses with examples and pics
Levels of measurement discuss
Gantt chart discuss 3
Gantt chart discuss 2
Gantt chart discuss 1
Research Designs -9 experimental Designs
Research designs Pt 1
Identifying variables
Kinds and classifications of research
Research Ethical Issues
Ad

Recently uploaded (20)

PDF
RMMM.pdf make it easy to upload and study
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
01-Introduction-to-Information-Management.pdf
PDF
Pre independence Education in Inndia.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
PPTX
Cell Types and Its function , kingdom of life
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
PPH.pptx obstetrics and gynecology in nursing
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
RMMM.pdf make it easy to upload and study
102 student loan defaulters named and shamed – Is someone you know on the list?
01-Introduction-to-Information-Management.pdf
Pre independence Education in Inndia.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Microbial diseases, their pathogenesis and prophylaxis
Anesthesia in Laparoscopic Surgery in India
Origin of periodic table-Mendeleev’s Periodic-Modern Periodic table
Cell Types and Its function , kingdom of life
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
VCE English Exam - Section C Student Revision Booklet
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
O5-L3 Freight Transport Ops (International) V1.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Pharmacology of Heart Failure /Pharmacotherapy of CHF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPH.pptx obstetrics and gynecology in nursing
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES

Inferential stat tests samples discuss 4

  • 2. Data Analysis Statistics - a powerful tool for analyzing data 1. Descriptive Statistics - provide an overview of the attributes of a data set. These include measurements of central tendency (frequency histograms, mean, median, & mode) and dispersion (range, variance & standard deviation) 2. Inferential Statistics - provide measures of how well your data support your hypothesis and if your data are generalizable beyond what was tested (significance tests)
  • 4. 2 4 10 4 6 8 7 10 4 3 7 9 6 7 5 2 5 8 2 10 7 2 3 5 2 9 3 9 6 1 4 2 6 4 9 3 4 1 8 7 9 1 8 1 10 10 6 4 2 7 1 1 9 10 4 4 6 6 2 5 9 10 2 6 8 10 1 6 10 10 4 4 4 9 2 1 4 5 9 6 6 2 7 8 8 6 6 10 6 6 7 5 9 2 6 4 8 6 6 10 5 7 1 9 1 10 8 8 5 10 1 4 8 3 6 7 1 5 2 4 4 10 5 8 5 1 1 4 3 6 7 3 1 5 4 3 6 2 7 8 3 3 6 6 2 8 6 5 9 8 4 6 3 8 3 3 10 8 10 5 7 5 1 4 3 2 1 10 2 10 6 10 7 9 8 8 4 9 9 10 3 7 6 2 1 1 10 3 5 7 4 1 2 9 10 10 6 1 3 2 1 3 9 9 4 2 2 2 1 8 3 1 5 9 9 8 3 2 5 4 4 2 3 10 8 2 3 4 1 3 3 2 10 10 5 7 3 3 10 1 5 7 5 1 2 5 8 7 3 8 9 2 10 8 1 1 5 3 3 7 6 7 9 8 8 4 9 8 4 3 10 8 10 4 10 2 3 5 6 3 1 9 8 1 10 2 3 1 6 3 8 9 6 2 4 4 2 7 8 4 4 4 4 10 8 5 9 3 10 5 3 6 9 3 7 4 2 3 10 2 5 1 6 8 5 6 8 1 8 5 7 6 4 1 2 7 2 9 5 3 8 2 3 2 9 9 1 1 5 7 8 5 6 3 8 5 4 10 6 9 5 1 10 10 5 1 4 3 2 3 6 9 10 2 6 3 1 2 8 6 1 8 7 8 5 3 7 2 4 1 8 9 10 10 5 1 3 6 5 8 3 3 8 8 2 7 1 6 9 8 2 10 3 7 9 2 1 9 7 7 3 1 9 6 8 2 6 4 6 3 7 10 9 6 1 10 7 5 3 10 1 6 5 4 3 2 4 4 1 5 5 10 6 2 1 1 1 5 6 3 8 10 8 10 9 7 7 7 8 4 8 1 3 5 8 1 8 4 4 6 4 7 2 4 9 1 8 5 3 3 5 10 1 4 6 3 3 8 2 2 The Population: =5.314 Population size = 500
  • 5. 2 4 10 4 6 8 7 10 4 3 7 9 6 7 5 2 5 8 2 10 7 2 3 5 2 9 3 9 6 1 4 2 6 4 9 3 4 1 8 7 9 1 8 1 10 10 6 4 2 7 1 1 9 10 4 4 6 6 2 5 9 10 2 6 8 10 1 6 10 10 4 4 4 9 2 1 4 5 9 6 6 2 7 8 8 6 6 10 6 6 7 5 9 2 6 4 8 6 6 10 5 7 1 9 1 10 8 8 5 10 1 4 8 3 6 7 1 5 2 4 4 10 5 8 5 1 1 4 3 6 7 3 1 5 4 3 6 2 7 8 3 3 6 6 2 8 6 5 9 8 4 6 3 8 3 3 10 8 10 5 7 5 1 4 3 2 1 10 2 10 6 10 7 9 8 8 4 9 9 10 3 7 6 2 1 1 10 3 5 7 4 1 2 9 10 10 6 1 3 2 1 3 9 9 4 2 2 2 1 8 3 1 5 9 9 8 3 2 5 4 4 2 3 10 8 2 3 4 1 3 3 2 10 10 5 7 3 3 10 1 5 7 5 1 2 5 8 7 3 8 9 2 10 8 1 1 5 3 3 7 6 7 9 8 8 4 9 8 4 3 10 8 10 4 10 2 3 5 6 3 1 9 8 1 10 2 3 1 6 3 8 9 6 2 4 4 2 7 8 4 4 4 4 10 8 5 9 3 10 5 3 6 9 3 7 4 2 3 10 2 5 1 6 8 5 6 8 1 8 5 7 6 4 1 2 7 2 9 5 3 8 2 3 2 9 9 1 1 5 7 8 5 6 3 8 5 4 10 6 9 5 1 10 10 5 1 4 3 2 3 6 9 10 2 6 3 1 2 8 6 1 8 7 8 5 3 7 2 4 1 8 9 10 10 5 1 3 6 5 8 3 3 8 8 2 7 1 6 9 8 2 10 3 7 9 2 1 9 7 7 3 1 9 6 8 2 6 4 6 3 7 10 9 6 1 10 7 5 3 10 1 6 5 4 3 2 4 4 1 5 5 10 6 2 1 1 1 5 6 3 8 10 8 10 9 7 7 7 8 4 8 1 3 5 8 1 8 4 4 6 4 7 2 4 9 1 8 5 3 3 5 10 1 4 6 3 3 8 2 2 The Sample: 7, 6, 4, 9, 8, 3, 2, 6, 1 mean = 5.111 The Population: =5.314
  • 6. 2 4 10 4 6 8 7 10 4 3 7 9 6 7 5 2 5 8 2 10 7 2 3 5 2 9 3 9 6 1 4 2 6 4 9 3 4 1 8 7 9 1 8 1 10 10 6 4 2 7 1 1 9 10 4 4 6 6 2 5 9 10 2 6 8 10 1 6 10 10 4 4 4 9 2 1 4 5 9 6 6 2 7 8 8 6 6 10 6 6 7 5 9 2 6 4 8 6 6 10 5 7 1 9 1 10 8 8 5 10 1 4 8 3 6 7 1 5 2 4 4 10 5 8 5 1 1 4 3 6 7 3 1 5 4 3 6 2 7 8 3 3 6 6 2 8 6 5 9 8 4 6 3 8 3 3 10 8 10 5 7 5 1 4 3 2 1 10 2 10 6 10 7 9 8 8 4 9 9 10 3 7 6 2 1 1 10 3 5 7 4 1 2 9 10 10 6 1 3 2 1 3 9 9 4 2 2 2 1 8 3 1 5 9 9 8 3 2 5 4 4 2 3 10 8 2 3 4 1 3 3 2 10 10 5 7 3 3 10 1 5 7 5 1 2 5 8 7 3 8 9 2 10 8 1 1 5 3 3 7 6 7 9 8 8 4 9 8 4 3 10 8 10 4 10 2 3 5 6 3 1 9 8 1 10 2 3 1 6 3 8 9 6 2 4 4 2 7 8 4 4 4 4 10 8 5 9 3 10 5 3 6 9 3 7 4 2 3 10 2 5 1 6 8 5 6 8 1 8 5 7 6 4 1 2 7 2 9 5 3 8 2 3 2 9 9 1 1 5 7 8 5 6 3 8 5 4 10 6 9 5 1 10 10 5 1 4 3 2 3 6 9 10 2 6 3 1 2 8 6 1 8 7 8 5 3 7 2 4 1 8 9 10 10 5 1 3 6 5 8 3 3 8 8 2 7 1 6 9 8 2 10 3 7 9 2 1 9 7 7 3 1 9 6 8 2 6 4 6 3 7 10 9 6 1 10 7 5 3 10 1 6 5 4 3 2 4 4 1 5 5 10 6 2 1 1 1 5 6 3 8 10 8 10 9 7 7 7 8 4 8 1 3 5 8 1 8 4 4 6 4 7 2 4 9 1 8 5 3 3 5 10 1 4 6 3 3 8 2 2 The Sample: 1, 5, 8, 7, 4, 1, 6, 6 mean = 4.75 The Population: =5.314
  • 7. Parametric or Non-parametric? •Parametric tests are restricted to data that: 1) show a normal distribution 2) * are independent of one another 3) * are on the same continuous scale of measurement •Non-parametric tests are used on data that: 1) show an other-than normal distribution 2) are dependent or conditional on one another 3) in general, do not have a continuous scale of measurement e.g., the length and weight of something –> parametric vs. did the bacteria grow or not grow –> non-parametric
  • 8. The First Question After examining your data, ask: does what you're testing seem to be a question of relatedness or a question of difference? If relatedness (between your control and your experimental samples or between you dependent and independent variable), you will be using tests for correlation (positive or negative) or regression. If difference (your control differs from your experimental), you will be testing for independence between distributions, means or variances. Different tests will be employed if your data show parametric or non-parametric properties. See Flow Chart on page 50 of HBI.
  • 10. Tests for Differences • Between Means - t-Test - P - ANOVA - P - Friedman Test - Kruskal-Wallis Test - Sign Test - Rank Sum Test • Between Distributions - Chi-square for goodness of fit - Chi-square for independence • Between Variances - F-Test – P P – parametric tests
  • 11. Differences Between Means Asks whether samples come from populations with different means Null Hypothesis Alternative Hypothesis A Y B CA Y B C There are different tests if you have 2 vs more than 2 samples
  • 12. Differences Between Means – Parametric Data t-Tests compare the means of two parametric samples E.g. Is there a difference in the mean height of men and women? HBI: t-Test Excel: t-Test (paired and unpaired) – in Tools – Data Analysis
  • 13. A researcher compared the height of plants grown in high and low light levels. Her results are shown below. Use a T-test to determine whether there is a statistically significant difference in the heights of the two groups Low Light High Light 49 45 31 40 43 59 31 58 40 55 44 50 49 46 48 53 33 43
  • 14. Differences Between Means – Parametric Data ANOVA (Analysis of Variance) compares the means of two or more parametric samples. E.g. Is there a difference in the mean height of plants grown under red, green and blue light? HBI: ANOVA Excel: ANOVA – check type under Tools – Data Analysis
  • 15. weight of pigs fed different foods food 1 food 2 food 3 food 4 60.8 68.7 102.6 87.9 57.0 67.7 102.1 84.2 65.0 74.0 100.2 83.1 58.6 66.3 96.5 85.7 61.7 69.8 90.3 A researcher fed pigs on four different foods. At the end of a month feeding, he weighed the pigs. Use an ANOVA test to determine if the different foods resulted in differences in growth of the pigs.
  • 16. Aplysia punctata – the sea hare
  • 18. Differences Between Means – Non- Parametric Data The Sign Test compares the means of two “paired”, non- parametric samples E.g. Is there a difference in the gill withdrawal response of Aplysia in night versus day? Each subject has been tested once at night and once during the day –> paired data. HBI: Sign Test Excel: N/A Subject Night Response Day Response 1 2 5 2 1 3 3 2 2
  • 20. The Friedman Test is like the Sign test, (compares the means of “paired”, non-parametric samples) for more than two samples. E.g. Is there a difference in the gill withdrawal response of Aplysia between morning, afternoon and evening? Each subject has been tested once during each time period –> paired data HBI: Friedman Test Excel: N/A Subject Morning Response Afternoon Response Evening. Response 1 4 3 2 2 5 2 1 3 3 4 3 Differences Between Means – Non- Parametric Data
  • 22. The Rank Sum test compares the means of two non- parametric samples E.g. Is there a difference in the gill withdrawal response of Aplysia in night versus day? Each subject has been tested once, either during the night or during the day –> unpaired data. HBI: Rank Sum Excel: N/A Subject Night Response Day Response 1 5 2 1 3 2 4 3 5 4 6 1 7 5 Differences Between Means – Non- Parametric Data
  • 24. The Kruskal-Wallis Test compares the means of more than two non-parametric, non-paired samples E.g. Is there a difference in the gill withdrawal response of Aplysia in night versus day? Each subject has been tested once, either during the morning, afternoon or evening –> unpaired data. HBI: Kruskal-Wallis Test Excel: N/A Differences Between Means – Non- Parametric Data Subject Morning Response Afternoon Response Evening. Response 1 4 2 5 3 4 4 3 5 2 6 3
  • 26. Chi square tests compare observed frequency distributions, either to theoretical expectations or to other observed frequency distributions. Differences Between Distributions
  • 27. Differences Between Distributions E.g. The F2 generation of a cross between a round pea and a wrinkled pea produced 72 round individuals and 20 wrinkled individuals. Does this differ from the expected 3:1 round:wrinkled ratio of a simple dominant trait? HBI: Chi-Square One Sample Test (goodness of fit) Excel: Chitest – under Function Key – Statistical Smooth Frequency Wrinkled E E
  • 28. E.g. 67 out of 100 seeds placed in plain water germinated while 36 out of 100 seeds placed in “acid rain” water germinated. Is there a difference in the germination rate? HBI: Chi-Square Two or More Sample Test (independence) Excel: Chitest – under Function key - Statistical Plain Acid Plain Proportion Germination Acid Proportion Germination Null Hypothesis Alternative Hypothesis Differences Between Distributions
  • 29. Correlations look for relationships between two variables which may not be functionally related. The variables may be ordinal, interval, or ratio scale data. Remember, correlation does not prove causation; thus there may not be a cause and effect relationship between the variables. E.g. Do species of birds with longer wings also have longer necks? HBI: Spearman’s Rank Correlation (NP) Excel: Correlation (P) Correlation
  • 30. Question – is there a relationship between students aptitude for mathemathics and for biology? Student Math score Math Rank Biol. score Biology rank 1 57 3 83 7 2 45 1 37 1 3 72 7 41 2 4 78 8 84 8 5 53 2 56 3 6 63 5 85 9 7 86 9 77 6 8 98 10 87 10 9 59 4 70 5 10 71 6 59 4
  • 32. Regressions look for functional relationships between two continuous variables. A regression assumes that a change in X causes a change in Y. E.g. Does an increase in light intensity cause an increase in plant growth? HBI: Regression Analysis (P) Excel: Regression (P) Regression
  • 33. Correlation & Regression Looks for relationships between two continuous variables Null Hypothesis Alternative Hypothesis X Y X Y
  • 34. Is there a relationship between wing length and tail length in songbirds? wing length cm tail length cm 10.4 7.4 10.8 7.6 11.1 7.9 10.2 7.2 10.3 7.4 10.2 7.1 10.7 7.4 10.5 7.2 10.8 7.8 11.2 7.7 10.6 7.8 11.4 8.3
  • 35. Is there a relationship between age and systolic blood pressure? Age (yr) systolic blood pressure mm hg 30 108 30 110 30 106 40 125 40 120 40 118 40 119 50 132 50 137 50 134 60 148 60 151 60 146 60 147 60 144 70 162 70 156 70 164 70 158 70 159