SlideShare a Scribd company logo
Data Science - Normal Distribution
Dr.M.Pyingkodi
Associate Professor
Dept. of MCA
Kongu Engineering College
Erode, Tamil Nadu,India
• A Normal Distribution
• A Normal Distribution is a bell-shaped curve that is symmetric about the
mean, where most of the data points are clustered around the center and
fewer are found as you move away from the mean.
• Characteristics
• 1. Symmetry
• The distribution is symmetric about the mean.
• This means that the left side of the distribution is a mirror image of the
right side.
• 2. Bell-Shaped Curve
• The graph of a normal distribution is shaped like a bell, with a single
peak at the mean.
• It rises gradually on both sides and tails off symmetrically.
• 3. Mean, Median, and Mode Equality
• In a normal distribution, the mean, median, and mode are all equal and
located at the center of the distribution.
• This central point is also known as the peak of the curve.
Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 2
• A Normal Distribution
• 4. 68-95-99.7 Rule
• Approximately 68% of the data falls within one standard deviation of the
mean, about 95% falls within two standard deviations, and about 99.7%
falls within three standard deviations.
• This is often referred to as the empirical rule.
• 5. Asymptotic Nature
• The tails of the normal distribution approach, but never actually touch,
the horizontal axis.
• This means that there is a theoretical possibility of observing values far
from the mean, though these occurrences become increasingly rare.
Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 3
z-score
• A z-score is a statistical measure that indicates how many standard
deviations a data point is from the mean of a dataset.
• It is a way to standardize scores from different distributions, allowing for
comparison.
What a Z-Score Indicates ?
• 1.Position Relative to the Mean
• A z-score tells you how far away a specific value (data point) is from the
mean.
 A positive z-score indicates that the data point is above the
mean.
 A negative z-score indicates that the data point is below the
mean.
 A z-score of 0 means the data point is exactly at the mean.
Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 4
• 2. Standard Deviations
• The value of the z-score represents the number of standard deviations
the data point is from the mean.
• For example, a z-score of 2 means the data point is two standard
deviations above the mean, while a z-score of -1.5 means
• it is one and a half standard deviations below the mean.
• Example
• Consider a dataset of test scores for a class, with the following
characteristics:
•
• Mean (μmuμ) - 70
• Standard Deviation (σsigmaσ) -10
• Suppose a student scores 85 on the test.
• To calculate the z-score for this student's score, we use the formula
• Where
• X = the data point (student's score)
• m= the mean
• σ = the standard deviation
Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 5
• Substituting in the values
• In this example, the student's z-score is 1.5.
• This means that the student scored 1.5 standard deviations above the
mean score of the class.
• This indicates a relatively high performance compared to the average
student in the class.
• Using z-scores allows us to understand the position of a data point
within the context of the overall dataset, making it easier to compare
scores across different distributions.
• Why Z-Score ?
• Comparison Across Different Datasets
• Z-scores allow us to compare values from different datasets, even if they
have different units or scales.
•
Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 6
• For instance, if you're comparing the performance of two students on
different tests, the z-scores normalize their scores, making it easier to
compare their relative performance.
• Probability and Percentiles
• In a normal distribution, z-scores are tied to probabilities.
• For example, a z-score of +1 corresponds to about the 84th percentile of
the data, meaning that 84% of the data points are below this score.
• Identifying Outliers
• Z-scores can highlight extreme values. If a z-score is very high or low
(e.g., beyond ±3), the value may be considered an outlier in the dataset.
• Imagine the average height of adult men in a population is 175 cm with a
standard deviation of 10 cm.
• This means he is 1.5 standard deviations taller than the average
• Z-scores make it easier to understand how unusual or typical a value is
relative to the overall data distribution.
Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 7

More Related Content

PPTX
Chapters 2 4
PDF
Lect. 04 Normal distribution biostatistics
PPTX
The standard normal curve & its application in biomedical sciences
PPT
Lesson 8 zscore
PPTX
Z scores lecture chapter 2 and 4
PPTX
probability and random single normal distribution presentation 017 (1).pptx
PPTX
probability and random single normal distribution presentation 017 (1).pptx
Chapters 2 4
Lect. 04 Normal distribution biostatistics
The standard normal curve & its application in biomedical sciences
Lesson 8 zscore
Z scores lecture chapter 2 and 4
probability and random single normal distribution presentation 017 (1).pptx
probability and random single normal distribution presentation 017 (1).pptx

Similar to Data Science Normal Distribution Z-Score (20)

PPTX
Chapters 2 & 4
PPTX
Chapters 2 & 4
PPTX
PSUnit_II_Lesson_2_Understanding_the_z-scores.pptx
PPTX
Introduction to z-Scores
PPTX
PSUnit_II_Lesson_2_Understanding_the_z-scores.pptx
PPT
Derived Scores
PPTX
Location scores
PPT
Day 4 normal curve and standard scores
PPTX
Z scores
PDF
chapter 5 Z-Scores (Essentials of Statistics for the Behavioral Sciences)
DOC
Z score
PPTX
What are z-scores.pptx
PPT
Standard Scores
PPTX
Introduction-to-Normal-Distribution.pptx
PDF
Analysis-Interpretation-and-Use-of-Test-Data.pdf
PPTX
m2_2_variation_z_scores.pptx
PPTX
normal curve distribution biostatics course.pptx
PDF
Z-score and probability in statistics.pdf
PPTX
Understanding-the-Z-scores. power point presentation
PDF
REPORT MATH.pdf
Chapters 2 & 4
Chapters 2 & 4
PSUnit_II_Lesson_2_Understanding_the_z-scores.pptx
Introduction to z-Scores
PSUnit_II_Lesson_2_Understanding_the_z-scores.pptx
Derived Scores
Location scores
Day 4 normal curve and standard scores
Z scores
chapter 5 Z-Scores (Essentials of Statistics for the Behavioral Sciences)
Z score
What are z-scores.pptx
Standard Scores
Introduction-to-Normal-Distribution.pptx
Analysis-Interpretation-and-Use-of-Test-Data.pdf
m2_2_variation_z_scores.pptx
normal curve distribution biostatics course.pptx
Z-score and probability in statistics.pdf
Understanding-the-Z-scores. power point presentation
REPORT MATH.pdf
Ad

More from Pyingkodi Maran (20)

PDF
Defining Identity as a Service (IDaaS) in Cloud Computing
PDF
Data Science Introduction and Process in Data Science
PDF
Database Manipulation with MYSQL Commands
PDF
Jquery Tutorials for designing Dynamic Web Site
PDF
Working with AWS Relational Database Instances
DOC
Health Monitoring System using IoT.doc
PPT
IoT Industry Adaptation of AI.ppt
PPT
IoT_Testing.ppt
PDF
Azure Devops
PDF
Creation of Web Portal using DURPAL
PDF
AWS Relational Database Instance
PDF
AWS S3 Buckets
PDF
Creation of AWS Instance in Cloud Platform
PDF
Amazon Web Service.pdf
PDF
Cloud Security
PDF
Cloud Computing Introduction
PDF
Supervised Machine Learning Algorithm
PDF
Unsupervised Learning in Machine Learning
PDF
Feature Engineering in Machine Learning
PDF
Normalization in DBMS
Defining Identity as a Service (IDaaS) in Cloud Computing
Data Science Introduction and Process in Data Science
Database Manipulation with MYSQL Commands
Jquery Tutorials for designing Dynamic Web Site
Working with AWS Relational Database Instances
Health Monitoring System using IoT.doc
IoT Industry Adaptation of AI.ppt
IoT_Testing.ppt
Azure Devops
Creation of Web Portal using DURPAL
AWS Relational Database Instance
AWS S3 Buckets
Creation of AWS Instance in Cloud Platform
Amazon Web Service.pdf
Cloud Security
Cloud Computing Introduction
Supervised Machine Learning Algorithm
Unsupervised Learning in Machine Learning
Feature Engineering in Machine Learning
Normalization in DBMS
Ad

Recently uploaded (20)

PPTX
Pilar Kemerdekaan dan Identi Bangsa.pptx
PDF
How to run a consulting project- client discovery
PPTX
STERILIZATION AND DISINFECTION-1.ppthhhbx
PPTX
Database Infoormation System (DBIS).pptx
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PDF
Optimise Shopper Experiences with a Strong Data Estate.pdf
PPTX
importance of Data-Visualization-in-Data-Science. for mba studnts
PPTX
CYBER SECURITY the Next Warefare Tactics
PDF
Microsoft Core Cloud Services powerpoint
PPTX
modul_python (1).pptx for professional and student
PPTX
Introduction to Inferential Statistics.pptx
PPTX
SAP 2 completion done . PRESENTATION.pptx
PPTX
New ISO 27001_2022 standard and the changes
PPTX
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
PPTX
Managing Community Partner Relationships
PDF
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
PDF
Introduction to the R Programming Language
PDF
annual-report-2024-2025 original latest.
PDF
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
PPTX
Acceptance and paychological effects of mandatory extra coach I classes.pptx
Pilar Kemerdekaan dan Identi Bangsa.pptx
How to run a consulting project- client discovery
STERILIZATION AND DISINFECTION-1.ppthhhbx
Database Infoormation System (DBIS).pptx
Qualitative Qantitative and Mixed Methods.pptx
Optimise Shopper Experiences with a Strong Data Estate.pdf
importance of Data-Visualization-in-Data-Science. for mba studnts
CYBER SECURITY the Next Warefare Tactics
Microsoft Core Cloud Services powerpoint
modul_python (1).pptx for professional and student
Introduction to Inferential Statistics.pptx
SAP 2 completion done . PRESENTATION.pptx
New ISO 27001_2022 standard and the changes
Microsoft-Fabric-Unifying-Analytics-for-the-Modern-Enterprise Solution.pptx
Managing Community Partner Relationships
168300704-gasification-ppt.pdfhghhhsjsjhsuxush
Introduction to the R Programming Language
annual-report-2024-2025 original latest.
Data Engineering Interview Questions & Answers Batch Processing (Spark, Hadoo...
Acceptance and paychological effects of mandatory extra coach I classes.pptx

Data Science Normal Distribution Z-Score

  • 1. Data Science - Normal Distribution Dr.M.Pyingkodi Associate Professor Dept. of MCA Kongu Engineering College Erode, Tamil Nadu,India
  • 2. • A Normal Distribution • A Normal Distribution is a bell-shaped curve that is symmetric about the mean, where most of the data points are clustered around the center and fewer are found as you move away from the mean. • Characteristics • 1. Symmetry • The distribution is symmetric about the mean. • This means that the left side of the distribution is a mirror image of the right side. • 2. Bell-Shaped Curve • The graph of a normal distribution is shaped like a bell, with a single peak at the mean. • It rises gradually on both sides and tails off symmetrically. • 3. Mean, Median, and Mode Equality • In a normal distribution, the mean, median, and mode are all equal and located at the center of the distribution. • This central point is also known as the peak of the curve. Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 2
  • 3. • A Normal Distribution • 4. 68-95-99.7 Rule • Approximately 68% of the data falls within one standard deviation of the mean, about 95% falls within two standard deviations, and about 99.7% falls within three standard deviations. • This is often referred to as the empirical rule. • 5. Asymptotic Nature • The tails of the normal distribution approach, but never actually touch, the horizontal axis. • This means that there is a theoretical possibility of observing values far from the mean, though these occurrences become increasingly rare. Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 3
  • 4. z-score • A z-score is a statistical measure that indicates how many standard deviations a data point is from the mean of a dataset. • It is a way to standardize scores from different distributions, allowing for comparison. What a Z-Score Indicates ? • 1.Position Relative to the Mean • A z-score tells you how far away a specific value (data point) is from the mean.  A positive z-score indicates that the data point is above the mean.  A negative z-score indicates that the data point is below the mean.  A z-score of 0 means the data point is exactly at the mean. Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 4
  • 5. • 2. Standard Deviations • The value of the z-score represents the number of standard deviations the data point is from the mean. • For example, a z-score of 2 means the data point is two standard deviations above the mean, while a z-score of -1.5 means • it is one and a half standard deviations below the mean. • Example • Consider a dataset of test scores for a class, with the following characteristics: • • Mean (μmuμ) - 70 • Standard Deviation (σsigmaσ) -10 • Suppose a student scores 85 on the test. • To calculate the z-score for this student's score, we use the formula • Where • X = the data point (student's score) • m= the mean • σ = the standard deviation Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 5
  • 6. • Substituting in the values • In this example, the student's z-score is 1.5. • This means that the student scored 1.5 standard deviations above the mean score of the class. • This indicates a relatively high performance compared to the average student in the class. • Using z-scores allows us to understand the position of a data point within the context of the overall dataset, making it easier to compare scores across different distributions. • Why Z-Score ? • Comparison Across Different Datasets • Z-scores allow us to compare values from different datasets, even if they have different units or scales. • Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 6
  • 7. • For instance, if you're comparing the performance of two students on different tests, the z-scores normalize their scores, making it easier to compare their relative performance. • Probability and Percentiles • In a normal distribution, z-scores are tied to probabilities. • For example, a z-score of +1 corresponds to about the 84th percentile of the data, meaning that 84% of the data points are below this score. • Identifying Outliers • Z-scores can highlight extreme values. If a z-score is very high or low (e.g., beyond ±3), the value may be considered an outlier in the dataset. • Imagine the average height of adult men in a population is 175 cm with a standard deviation of 10 cm. • This means he is 1.5 standard deviations taller than the average • Z-scores make it easier to understand how unusual or typical a value is relative to the overall data distribution. Dr.M.Pyingkodi, Associate Professor, Dept of MCA , Kongu Engineering College, Tamil Nadu, India 7