SlideShare a Scribd company logo
Data summarisation & visualisation
Frequency distribution
Summarising data in a presentable format that is in the form of class intervals and frequencies
56 weeks of number of people visiting a store
(Ungrouped data)
56 weeks of number of people visiting a store
(Grouped data)
Class width = Range / Number of classes
Range = Max – Min
Range = 60 – 11 = 49
Number of classes we want = 5
Class width = (49/5) = 9.8
Round 9.8 = 10
* Rule of thumb is to create between 5
and 15 classes
Class interval, Class midpoint, Relative frequencies, Cumulative frequencies
for number of people visiting a store
Relative frequency = Individual class frequency / Total
frequency
Relative frequency = 7 / 56 = 0.13
Cumulative frequency is a running total of frequencies
through the classes
Univariate data visualisation
Univariate data visualisation
Numerical data Categorical data
Histogram Bar graph
Quantitative
data graphs
Qualitative
data graphs
Ogive Pareto chart
Frequency polygon Pie chart
Stem and Leaf plot
Quantitative data graphs are plotted along a
numerical scale
Qualitative data graphs are plotted using non-
numerical categories
Univariate numerical data visualisation (Histogram)
1. Series of continous rectangles represent the frequency of data in given class intervals.
2. X axis : With class mid points and Y axis: With the frequencies.
3. Quick glance at a histogram helps revealing which class intervals produce highest frequency.
* If the class intervals are unequal then the width of the rectangle or area of the rectangles can be used for relative comparison.
Univariate numerical data visualisation (Frequency polygon)
1. Is like histogram, however instead of using rectangles like a histogram each class frequency is plotted as a dot at the class midpoint
and the dots are connected by a series of line segments
2. X axis : With class mid points and Y axis: With the frequencies.
Univariate numerical data visualisation (Ogive)
1. Ogive is a cumulative frequency polygon
2. X axis :Always class end points and Y axis: With the cumulative frequencies.
* Generally used by decission makers to see the running totals
Univariate numerical data visualisation (Stem and Leaf Plot)
1. Constructed by separating the digits for each number of data into two groups a stem and a leaf.
2. Stem: Consists higher valued digits & Leaves: Contain lower values
56 weeks of number of people visiting a store
(Ungrouped data)
Stem & Leaf plot
Univariate categorical data visualisation (Bar chart)
Univariate categorical data visualisation (Pie chart)
13%
7%
25%
32%
24%
Total sales contribution "Product wise"
Product 1
Product 2
Product 3
Product 4
Product 5
Univariate categorical data visualisation (Pareto chart)
Product 4 Product 3 Product 5 Product 1 Product 2
0
50
100
150
200
250
300
350
400
450
0%
10%
20%
30%
40%
50%
60%
70%
80%
90%
100%
32%
57%
81%
93%
100%
Sales (Pareto chart)
Product
Totalsales
Cumulativeproportion
Sort the data in the descending order and use cumulative proportion to plot pareto chart.
* Generally pareto chart are used in defect analysis that is types of defects that occur with a product and service.
* Most common types of defects ranked in order of occurence from left to right and accordingly control persons analyse pareto chart and
make the possible improvement from time to time.
Bivariate data visualisation
Bivariate data visualisation
Cross tabulation Scatter plot
A two dimensional table used to display the
frequency counts for two variables
simultaneously.
Two dimensional graph plot of pairs of points
from two numerical variables.
Bivariate data visualisation (Cross tabulation)
Employee survey data
Cross tabulation
* Cross tabulation is often called as contigency table
Bivariate data visualisation (Scatter plot)
63 64 65 66 67 68 69 70 71 72 73
0.00
10.00
20.00
30.00
40.00
50.00
60.00
70.00
80.00
Height Versus Weight (Scatter plot)
Height (Inches)
Weight(Kg's)
Scatter plot is often used to understand possible relationship between to variables.
* Here we are trying to understand the relationship between Height and Weight.

More Related Content

PDF
2. sampling techniques
PDF
Simple Random Sampling
PPT
Sampling presentation
PPTX
Sampling, measurement, and stats(2013)
PPT
050 sampling theory
PPT
Introduction to basic concept in sampling and sampling techniques
PPTX
MEASUREMENT AND SAMPLING TECHNIQUES
PPTX
Sampling Technique - Anish
2. sampling techniques
Simple Random Sampling
Sampling presentation
Sampling, measurement, and stats(2013)
050 sampling theory
Introduction to basic concept in sampling and sampling techniques
MEASUREMENT AND SAMPLING TECHNIQUES
Sampling Technique - Anish

What's hot (20)

PPTX
Systematic sampling in probability sampling
PPTX
sampling simple random sampling
PPTX
PPT
Sampling methods
PDF
Lecture7.1 data sampling
PDF
Sampling and sampling distribution tttt
PPTX
Chapter 2: Collection of Data
PDF
Research Method for Business chapter 10
PPT
PROBABILITY SAMPLING TECHNIQUES
PPTX
Errors in research
PPT
Sampling and Inference_Political_Science
PPTX
Simple random sampling
PDF
Sampling and Sampling Distribution
PPTX
Systematic ranom sampling for slide share
PDF
Research Method EMBA chapter 10
PPTX
Sampling
PPTX
Sampling (statistics and probability)
PPT
Statistics lesson 1
PPT
Lesson01_Static.11
PPTX
Sampling and sampling distributions
Systematic sampling in probability sampling
sampling simple random sampling
Sampling methods
Lecture7.1 data sampling
Sampling and sampling distribution tttt
Chapter 2: Collection of Data
Research Method for Business chapter 10
PROBABILITY SAMPLING TECHNIQUES
Errors in research
Sampling and Inference_Political_Science
Simple random sampling
Sampling and Sampling Distribution
Systematic ranom sampling for slide share
Research Method EMBA chapter 10
Sampling
Sampling (statistics and probability)
Statistics lesson 1
Lesson01_Static.11
Sampling and sampling distributions
Ad

Similar to 3. data visualisations (20)

PPTX
Charts and graphs
PPTX
Data presentation.pptx
PPT
Lesson02_Static.11
PPT
Lesson02_new
PDF
Chapter 1 - Displaying Descriptive Statistics.pdf
PDF
generic skills for college satictis and businesss
PPTX
Data Presentation biostatistics, school of public health
PPT
Basic Stat Notes
PDF
Data presentation
PPTX
Stats LECTURE 2.pptx
PPTX
Qt graphical representation of data
PPTX
Qt graphical representation of data
PPT
Source of DATA
PDF
2. Descriptive Statistics.pdf
PPT
Ch21 22 data analysis and interpretation
PDF
vishal stats.pdf education maths statis
PPTX
Biostatistics Graphical for grouped data
PPT
FREQUENCY DISTRIBUTION ( distribusi frekuensi) - STATISTICS
PPTX
Mastering Graphical Representations in Data Analysis
PDF
Business Statistics - Diagrammatic and Graphic representationPPT.pdf
Charts and graphs
Data presentation.pptx
Lesson02_Static.11
Lesson02_new
Chapter 1 - Displaying Descriptive Statistics.pdf
generic skills for college satictis and businesss
Data Presentation biostatistics, school of public health
Basic Stat Notes
Data presentation
Stats LECTURE 2.pptx
Qt graphical representation of data
Qt graphical representation of data
Source of DATA
2. Descriptive Statistics.pdf
Ch21 22 data analysis and interpretation
vishal stats.pdf education maths statis
Biostatistics Graphical for grouped data
FREQUENCY DISTRIBUTION ( distribusi frekuensi) - STATISTICS
Mastering Graphical Representations in Data Analysis
Business Statistics - Diagrammatic and Graphic representationPPT.pdf
Ad

More from Debasish Padhy (6)

PDF
1. introduction to statistics
PDF
4. descriptive statistics (central tendency)
PPTX
CISCO presentation
PPTX
Cisco Case study
PPT
Steam powered robots
PPTX
Garnier fructis
1. introduction to statistics
4. descriptive statistics (central tendency)
CISCO presentation
Cisco Case study
Steam powered robots
Garnier fructis

Recently uploaded (20)

PDF
Galatica Smart Energy Infrastructure Startup Pitch Deck
PDF
annual-report-2024-2025 original latest.
PDF
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
PDF
Foundation of Data Science unit number two notes
PPTX
Qualitative Qantitative and Mixed Methods.pptx
PPTX
Computer network topology notes for revision
PDF
.pdf is not working space design for the following data for the following dat...
PPTX
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
PDF
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
PPTX
oil_refinery_comprehensive_20250804084928 (1).pptx
PPTX
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
PPTX
Introduction-to-Cloud-ComputingFinal.pptx
PPTX
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
PPTX
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
PPTX
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
PPTX
IB Computer Science - Internal Assessment.pptx
PPTX
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
PDF
Fluorescence-microscope_Botany_detailed content
PPTX
Supervised vs unsupervised machine learning algorithms
PPTX
MODULE 8 - DISASTER risk PREPAREDNESS.pptx
Galatica Smart Energy Infrastructure Startup Pitch Deck
annual-report-2024-2025 original latest.
22.Patil - Early prediction of Alzheimer’s disease using convolutional neural...
Foundation of Data Science unit number two notes
Qualitative Qantitative and Mixed Methods.pptx
Computer network topology notes for revision
.pdf is not working space design for the following data for the following dat...
AI Strategy room jwfjksfksfjsjsjsjsjfsjfsj
BF and FI - Blockchain, fintech and Financial Innovation Lesson 2.pdf
oil_refinery_comprehensive_20250804084928 (1).pptx
Introduction to Firewall Analytics - Interfirewall and Transfirewall.pptx
Introduction-to-Cloud-ComputingFinal.pptx
ALIMENTARY AND BILIARY CONDITIONS 3-1.pptx
01_intro xxxxxxxxxxfffffffffffaaaaaaaaaaafg
DISORDERS OF THE LIVER, GALLBLADDER AND PANCREASE (1).pptx
IB Computer Science - Internal Assessment.pptx
Introduction to Basics of Ethical Hacking and Penetration Testing -Unit No. 1...
Fluorescence-microscope_Botany_detailed content
Supervised vs unsupervised machine learning algorithms
MODULE 8 - DISASTER risk PREPAREDNESS.pptx

3. data visualisations

  • 1. Data summarisation & visualisation
  • 2. Frequency distribution Summarising data in a presentable format that is in the form of class intervals and frequencies 56 weeks of number of people visiting a store (Ungrouped data) 56 weeks of number of people visiting a store (Grouped data) Class width = Range / Number of classes Range = Max – Min Range = 60 – 11 = 49 Number of classes we want = 5 Class width = (49/5) = 9.8 Round 9.8 = 10 * Rule of thumb is to create between 5 and 15 classes Class interval, Class midpoint, Relative frequencies, Cumulative frequencies for number of people visiting a store Relative frequency = Individual class frequency / Total frequency Relative frequency = 7 / 56 = 0.13 Cumulative frequency is a running total of frequencies through the classes
  • 3. Univariate data visualisation Univariate data visualisation Numerical data Categorical data Histogram Bar graph Quantitative data graphs Qualitative data graphs Ogive Pareto chart Frequency polygon Pie chart Stem and Leaf plot Quantitative data graphs are plotted along a numerical scale Qualitative data graphs are plotted using non- numerical categories
  • 4. Univariate numerical data visualisation (Histogram) 1. Series of continous rectangles represent the frequency of data in given class intervals. 2. X axis : With class mid points and Y axis: With the frequencies. 3. Quick glance at a histogram helps revealing which class intervals produce highest frequency. * If the class intervals are unequal then the width of the rectangle or area of the rectangles can be used for relative comparison.
  • 5. Univariate numerical data visualisation (Frequency polygon) 1. Is like histogram, however instead of using rectangles like a histogram each class frequency is plotted as a dot at the class midpoint and the dots are connected by a series of line segments 2. X axis : With class mid points and Y axis: With the frequencies.
  • 6. Univariate numerical data visualisation (Ogive) 1. Ogive is a cumulative frequency polygon 2. X axis :Always class end points and Y axis: With the cumulative frequencies. * Generally used by decission makers to see the running totals
  • 7. Univariate numerical data visualisation (Stem and Leaf Plot) 1. Constructed by separating the digits for each number of data into two groups a stem and a leaf. 2. Stem: Consists higher valued digits & Leaves: Contain lower values 56 weeks of number of people visiting a store (Ungrouped data) Stem & Leaf plot
  • 8. Univariate categorical data visualisation (Bar chart)
  • 9. Univariate categorical data visualisation (Pie chart) 13% 7% 25% 32% 24% Total sales contribution "Product wise" Product 1 Product 2 Product 3 Product 4 Product 5
  • 10. Univariate categorical data visualisation (Pareto chart) Product 4 Product 3 Product 5 Product 1 Product 2 0 50 100 150 200 250 300 350 400 450 0% 10% 20% 30% 40% 50% 60% 70% 80% 90% 100% 32% 57% 81% 93% 100% Sales (Pareto chart) Product Totalsales Cumulativeproportion Sort the data in the descending order and use cumulative proportion to plot pareto chart. * Generally pareto chart are used in defect analysis that is types of defects that occur with a product and service. * Most common types of defects ranked in order of occurence from left to right and accordingly control persons analyse pareto chart and make the possible improvement from time to time.
  • 11. Bivariate data visualisation Bivariate data visualisation Cross tabulation Scatter plot A two dimensional table used to display the frequency counts for two variables simultaneously. Two dimensional graph plot of pairs of points from two numerical variables.
  • 12. Bivariate data visualisation (Cross tabulation) Employee survey data Cross tabulation * Cross tabulation is often called as contigency table
  • 13. Bivariate data visualisation (Scatter plot) 63 64 65 66 67 68 69 70 71 72 73 0.00 10.00 20.00 30.00 40.00 50.00 60.00 70.00 80.00 Height Versus Weight (Scatter plot) Height (Inches) Weight(Kg's) Scatter plot is often used to understand possible relationship between to variables. * Here we are trying to understand the relationship between Height and Weight.