SlideShare a Scribd company logo
Statistical Analysis
Why do we need stats?To understand results of an experiment
Make effective conclusions
To be informed consumersWhat does this graph show us?This graph shows that people over 80 are the safest group of drivers. Drivers under 20 are safer than those between 20 and 24. Right?
The problem with that assumption is that number of accidents does not account for how much driving each of the groups do. Consider this other graph. Mile for mile, people over 80 have the most accidents followed by those under 20. This graph suggests that up until age 44, a person’s driving improves. After that, there is a decline in safety per mile driven. Over 74, there is a huge jump in accidents per mile driven.
 Neither graph prove that age is what causes    the incident of accidents. STANDARD DEVIATION There is almost always variation in biological data
This variation can be shown using a frequency distribution graph
The mean value is in the middle of the distribution
Mean- the average of the values (the sum of the values divided by the number of values Normal DistributionStandard Deviation-  The computed measure of how much the values vary around the mean score (above and below) 68% of the data is within 1 SD from the mean
95% of the data is within 2 SD from the mean
99% of the data is within 3 SD from the meanStarter QuestionsWhich Sx represents a set of data that is very similar to the mean?A. 4.5       B. 23.6        C. 0.6          D. 19.6What percentage of data falls within +/-1Sx of the mean?If the mean of a set of data is 55, and the SX=6 what is the value of data?         +/- 1Sx         +/-2Sx               +/-3SxDraw a normal distribution graph. Include a mean, and the percentage of data that fall with in +/-1 Sx     +/-2 Sx      +/-3 Sx
A low standard deviation indicates that the data points tend to be very close to the mean, whereas
A high standard deviation indicates that the data are spread out over a large range of values.A set of length measurements are taken with a mean of 2.5 cm and the standard deviation of 0.5cm. Which of the following is true?68% of all data lie between 2.5cm and 3.5cm2. 68% of all data lie between 1.5cm and 3.5cm3. 95% of all data lie between 1.5cm and 3.5cm4. 95% of all data lie between 2.0cm and 3.0cm95% of all data lie between 1.5cm and 3.5cm1 SD=0.5cm
68% of data is +/- 1SD, so 68% are between 2.0cm and 3.0cm
95% of data are within +/- 2SD, so 95% are between 1.5cm and 3.0cmError Bars2 Types of Error Bars                               Range of Data                               Standard Deviation
StarterIn a population of men the systolic blood pressure shows a normal distribution. The mean of the population is 125 (measured in mm and Hg) and the standard deviation is 10. If the population was 1000, how many of them have a blood pressure between 115 and 135mm Hg?680 men have blood pressure between 115 and 135mm Hg.If the mean is 125, and the standard deviation is 10, then +1 Sx is 135, and -1 Sx is 115, and we know that 68% of your data (in this case the men) are +/-1 Sx from the mean.
Using ExcelCreate your data
Find the mean of your data
Calculate the Standard Deviation (Sx) of your data Graph your mean
Insert Graph (Scatter)
Then go to layoutIn layout choose the Error Bars Tab
Choose the More Error bars OptionsSelect CustomFor Standard Deviation Error Bars select your Sx for both Positive and Negative ValuesFor Max/Min Error Bars select your max and your min. Take the difference from your   mean, and input that as your valueNow Label Your Graph!
MeansA = 10B = 20
MeansA = 10B = 20Is there a significant difference between the means?
MeansA = 10B = 20Is there a significant difference between the means?

More Related Content

PPTX
Measuresof spread
PPT
Statistics Chapter 01[1]
PPTX
Confidence interval
PPTX
Confidence interval
PPTX
Topic 1: Statistical Analysis
PPT
Normal Distributions
PPTX
5. testing differences
Measuresof spread
Statistics Chapter 01[1]
Confidence interval
Confidence interval
Topic 1: Statistical Analysis
Normal Distributions
5. testing differences

What's hot (20)

PPTX
4. correlations
PPTX
Estimation and confidence interval
PPT
Looking at data
PPT
Chapter 9 Regression
PDF
Mean, median, mode, Standard deviation for grouped data for Statistical Measu...
PPT
Statistics by DURGESH JHARIYA OF jnv,bn,jbp
PPTX
Machine learning session2
PDF
18 cleaning
PDF
Multiple linear regression
PPT
statistics
PPT
Ch8 Regression Revby Rao
DOCX
Chi sq explanation
PPT
Chapter 11
PPTX
determinatiion of
PPT
Confidence Intervals
PDF
Statistics lecture 8 (chapter 7)
PPTX
Recep maz msb 701 quantitative analysis for managers
PPT
Confidence Interval Estimation
PPTX
L10 confidence intervals
PDF
Scatter Diagrams
4. correlations
Estimation and confidence interval
Looking at data
Chapter 9 Regression
Mean, median, mode, Standard deviation for grouped data for Statistical Measu...
Statistics by DURGESH JHARIYA OF jnv,bn,jbp
Machine learning session2
18 cleaning
Multiple linear regression
statistics
Ch8 Regression Revby Rao
Chi sq explanation
Chapter 11
determinatiion of
Confidence Intervals
Statistics lecture 8 (chapter 7)
Recep maz msb 701 quantitative analysis for managers
Confidence Interval Estimation
L10 confidence intervals
Scatter Diagrams
Ad

Viewers also liked (16)

PDF
CANASTAS VIVERES Y SNACKS 2015 - LA CASITA
PPT
Lead roofing and detailing in Scotland
PPTX
Pour un écosystème mondial d'identification
PPT
PPT
PPTX
Towards Statistical Queries over Distributed Private User Data
KEY
川說藏情
PPT
Project management week 7
PDF
CANASTAS VIVERES 2015 - SUPERMERCADOS LA CASITA
PPTX
Lesson 1 the_moving_image
PPT
Timber and traditional buildings
PPTX
Topic 2 cells-1
PPTX
Final bcg matrix
PDF
Accidental Gamers
PPTX
App inventor 演講簡報
PPTX
Candy Crush Saga: 8 characteristics and 100 000 000 players
CANASTAS VIVERES Y SNACKS 2015 - LA CASITA
Lead roofing and detailing in Scotland
Pour un écosystème mondial d'identification
Towards Statistical Queries over Distributed Private User Data
川說藏情
Project management week 7
CANASTAS VIVERES 2015 - SUPERMERCADOS LA CASITA
Lesson 1 the_moving_image
Timber and traditional buildings
Topic 2 cells-1
Final bcg matrix
Accidental Gamers
App inventor 演講簡報
Candy Crush Saga: 8 characteristics and 100 000 000 players
Ad

Similar to Statistical analysis (20)

PPTX
Unit 1
PDF
Basics in Epidemiology & Biostatistics 2 RSS6 2014
PDF
Lecture_Wk08.pdf
PPT
statistics introduction
PDF
IV STATISTICS I.pdf
PPTX
PARAMETRIC TESTS.pptx
PPT
1Basic Statistics.ppt
PPTX
PPTX
Univariate Analysis
PPT
Presentation1group b
PPT
04 statistics presentation_notes
PPT
Bio statistics
PPTX
Seminar 10 BIOSTATISTICS
PPT
1.1 STATISTICS
PPT
Chapter 022
PPTX
scope and need of biostatics
PPTX
Basic statistics for pharmaceutical (Part 1)
PPTX
Statistical analysis.pptx
PPT
bio statistics for clinical research
PPT
Introductory Statistics
Unit 1
Basics in Epidemiology & Biostatistics 2 RSS6 2014
Lecture_Wk08.pdf
statistics introduction
IV STATISTICS I.pdf
PARAMETRIC TESTS.pptx
1Basic Statistics.ppt
Univariate Analysis
Presentation1group b
04 statistics presentation_notes
Bio statistics
Seminar 10 BIOSTATISTICS
1.1 STATISTICS
Chapter 022
scope and need of biostatics
Basic statistics for pharmaceutical (Part 1)
Statistical analysis.pptx
bio statistics for clinical research
Introductory Statistics

Recently uploaded (20)

PDF
Web App vs Mobile App What Should You Build First.pdf
PDF
Getting started with AI Agents and Multi-Agent Systems
PDF
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
PPTX
Tartificialntelligence_presentation.pptx
PDF
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
PPTX
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
PDF
Hybrid model detection and classification of lung cancer
PDF
August Patch Tuesday
PDF
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
PDF
Univ-Connecticut-ChatGPT-Presentaion.pdf
PDF
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
PPTX
Programs and apps: productivity, graphics, security and other tools
PDF
A comparative study of natural language inference in Swahili using monolingua...
PDF
gpt5_lecture_notes_comprehensive_20250812015547.pdf
PPTX
1. Introduction to Computer Programming.pptx
PPT
What is a Computer? Input Devices /output devices
PDF
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
PDF
A novel scalable deep ensemble learning framework for big data classification...
PPTX
cloud_computing_Infrastucture_as_cloud_p
PDF
1 - Historical Antecedents, Social Consideration.pdf
Web App vs Mobile App What Should You Build First.pdf
Getting started with AI Agents and Multi-Agent Systems
How ambidextrous entrepreneurial leaders react to the artificial intelligence...
Tartificialntelligence_presentation.pptx
From MVP to Full-Scale Product A Startup’s Software Journey.pdf
MicrosoftCybserSecurityReferenceArchitecture-April-2025.pptx
Hybrid model detection and classification of lung cancer
August Patch Tuesday
ENT215_Completing-a-large-scale-migration-and-modernization-with-AWS.pdf
Univ-Connecticut-ChatGPT-Presentaion.pdf
Video forgery: An extensive analysis of inter-and intra-frame manipulation al...
Programs and apps: productivity, graphics, security and other tools
A comparative study of natural language inference in Swahili using monolingua...
gpt5_lecture_notes_comprehensive_20250812015547.pdf
1. Introduction to Computer Programming.pptx
What is a Computer? Input Devices /output devices
TrustArc Webinar - Click, Consent, Trust: Winning the Privacy Game
A novel scalable deep ensemble learning framework for big data classification...
cloud_computing_Infrastucture_as_cloud_p
1 - Historical Antecedents, Social Consideration.pdf

Statistical analysis

  • 2. Why do we need stats?To understand results of an experiment
  • 4. To be informed consumersWhat does this graph show us?This graph shows that people over 80 are the safest group of drivers. Drivers under 20 are safer than those between 20 and 24. Right?
  • 5. The problem with that assumption is that number of accidents does not account for how much driving each of the groups do. Consider this other graph. Mile for mile, people over 80 have the most accidents followed by those under 20. This graph suggests that up until age 44, a person’s driving improves. After that, there is a decline in safety per mile driven. Over 74, there is a huge jump in accidents per mile driven.
  • 6. Neither graph prove that age is what causes the incident of accidents. STANDARD DEVIATION There is almost always variation in biological data
  • 7. This variation can be shown using a frequency distribution graph
  • 8. The mean value is in the middle of the distribution
  • 9. Mean- the average of the values (the sum of the values divided by the number of values Normal DistributionStandard Deviation- The computed measure of how much the values vary around the mean score (above and below) 68% of the data is within 1 SD from the mean
  • 10. 95% of the data is within 2 SD from the mean
  • 11. 99% of the data is within 3 SD from the meanStarter QuestionsWhich Sx represents a set of data that is very similar to the mean?A. 4.5 B. 23.6 C. 0.6 D. 19.6What percentage of data falls within +/-1Sx of the mean?If the mean of a set of data is 55, and the SX=6 what is the value of data? +/- 1Sx +/-2Sx +/-3SxDraw a normal distribution graph. Include a mean, and the percentage of data that fall with in +/-1 Sx +/-2 Sx +/-3 Sx
  • 12. A low standard deviation indicates that the data points tend to be very close to the mean, whereas
  • 13. A high standard deviation indicates that the data are spread out over a large range of values.A set of length measurements are taken with a mean of 2.5 cm and the standard deviation of 0.5cm. Which of the following is true?68% of all data lie between 2.5cm and 3.5cm2. 68% of all data lie between 1.5cm and 3.5cm3. 95% of all data lie between 1.5cm and 3.5cm4. 95% of all data lie between 2.0cm and 3.0cm95% of all data lie between 1.5cm and 3.5cm1 SD=0.5cm
  • 14. 68% of data is +/- 1SD, so 68% are between 2.0cm and 3.0cm
  • 15. 95% of data are within +/- 2SD, so 95% are between 1.5cm and 3.0cmError Bars2 Types of Error Bars Range of Data Standard Deviation
  • 16. StarterIn a population of men the systolic blood pressure shows a normal distribution. The mean of the population is 125 (measured in mm and Hg) and the standard deviation is 10. If the population was 1000, how many of them have a blood pressure between 115 and 135mm Hg?680 men have blood pressure between 115 and 135mm Hg.If the mean is 125, and the standard deviation is 10, then +1 Sx is 135, and -1 Sx is 115, and we know that 68% of your data (in this case the men) are +/-1 Sx from the mean.
  • 18. Find the mean of your data
  • 19. Calculate the Standard Deviation (Sx) of your data Graph your mean
  • 21. Then go to layoutIn layout choose the Error Bars Tab
  • 22. Choose the More Error bars OptionsSelect CustomFor Standard Deviation Error Bars select your Sx for both Positive and Negative ValuesFor Max/Min Error Bars select your max and your min. Take the difference from your mean, and input that as your valueNow Label Your Graph!
  • 24. MeansA = 10B = 20Is there a significant difference between the means?
  • 25. MeansA = 10B = 20Is there a significant difference between the means?
  • 26. MeansA = 10B = 20Is there a significant difference between the means?Would knowing the standard deviations help?What if both had “large” standard deviations?
  • 27. MeansA = 10B = 20Is there a significant difference between the means?Would knowing the standard deviations help?What if both had “small” standard deviations?
  • 28. MeansA = 10B = 20Is there a significant difference between the means?Would knowing the population size help?What if one had a large population size and the other a small size? What if both were large or both small?
  • 29. The t-test takes from both samples:the means, the standard deviations and the population size into account and will give you a t-value which you can use with a t-test table to determine if there is a statistically significant difference between the means. DO NOT learn the formula. The t-value will be given to you.
  • 30. 0.05 column is our Critical value
  • 32. Calculated Value of t > critical value it has is <0.05 which means there is a significant difference
  • 33. Calculated Value of t < critical value it has is >0.05 which means there is NO significant differenceCalculate Degrees of FreedomN= Population N1 + N2 -2 NOYES
  • 34. H0 Null Hypothesis states that there is no significant difference between the two groupsNever want to assume there is a differenceThe null hypothesis typically corresponds to a general or default position. For example, the null hypothesis might be that there is no relationship between two measured phenomena or that a potential treatment has no effect.