SlideShare a Scribd company logo
4
Most read
6
Most read
7
Most read
Four Types of Data
Data types
Quantitative Qualitative
Discrete Continuous
Understanding data type is an important concept in statistics, when you
are designing an experiment, you want to know what type of data you are
dealing with, that will decide what type of statistical analysis,
visualizations and prediction algorithms could be used. Also, you can use
a particular statistical measurement only for specific data types.
Qualitative & Quantitative are two basic types of data. As the name
suggests, qualitative deals with the quality (Characteristics) & some
statisticians calls it categorical. Quantitative deals with numbers. Below
is the difference between Data Types.
Qualitative Quantitative
Non numerical, Categories, Attributes Numerical
Examples: Colour, Ethnicity, Gender, political leanings,
religion, Education etc.
Examples: time, count, weight, length, wages, Income
Even though some data are number mathematical
operations are meaningless. Zipcode and phone
numbers
Mathematical operations are meaningful.
Table1: Difference between types of data
• Quantitative are of two types viz. Discrete and Continuous. They are
explained below:
• Discrete: Discrete data are countable or Finite. Finite means there are certain
number of values you can pick from and countable means you can count
them. This type of data can’t be measured but it can be counted. These are
natural numbers and are count of something.
• Let me give you an example of countable, if you are asked to count number of cars on
road during certain time of the day, it can take numbers like 100,125 or 1000 but never
100.5 or 125.72 etc.
• Likewise, example of finite is, the number which comes on rolling a dice. There are only 6
possible choices like 1,2,3…6 but never more than 6 or 4.5 etc. likewise if you flip a coin it
has only heads or tails, so, there are certain number of values you can pick from.
• Continuous: Continuous data are uncountable or infinite or to put it
differently, there are infinite number of possible values and is not countable.
Usually is a measurement of something and cannot be counted. Continuous
can take absolutely any value.
• For example, a person’s height or weight. Height can be 5.23, 5.24, 5.76 etc. similarly
weight can be any value like 75.82 Kgs or 62.35 kgs etc. Height, weight, length, speeds,
temperatures etc. are examples of continuous data.
Levels of measurement/data
QuantitativeQualitative
Interval RatiosNominal Ordinal
In the figure above as we move from left to right, we see an increase in
order of information i.e. Ordinal will have the characteristics of Nominal
and something more, similarly, interval will have the characteristics of
Ordinal and something more and Ratios will have characteristics of Interval
and something more… Let’s see what each one of these data levels means.
• Nominal:
• Nominal means a name only, Nominal scales are used for labelling variables, without
any quantitative value, it is a categorical data which has no order. Red car, Blue car,
yellow car, Black Car, White car etc. are just categories, they don’t have any order to
them. All you can do is list them out. For example, Flavour, Religion, Ethnicity, Gender
etc. these are nominal data without any order. For statistical analysis we can assign
numbers to these categories for example: Red=1, White=2 and Black=3, these
numerical values assigned does not have any mathematical significance. A sub-type
of nominal scale with only two categories (e.g. male/female, hot/cold, good/bad) is
called “dichotomous.”
• Ordinal:
• it is a categorical data which has an implied order. Like size of clothing as Small,
Medium and Large or Likert scale questions having a scale from 1 to 5 for example
Agree=1, neutral=2 disagree=3 etc. The numbers do not have any mathematical
significance, but they are the labels. Subtraction and division don’t make any
mathematical significance. For example, Ranking (ranking in school exams i.e 1st,2nd
and 3rd etc., ranking in the Army i.e Captain, Major, Colonel etc.), survey done on
Likert scale, educational background(under graduate, Postgraduate, Doctoral etc.).in
all the above examples there is an order associated with it.
• Interval: Interval values are Categorical and ordered data in addition to that
they have scale to them. Interval values data don’t have a true zero. True
zero means absence of the variable. For example, zero degrees
temperature does not mean there is no temperature, it just means, it is too
cold. Here Addition and subtraction are significant, but division and
multiplication are not. With interval data, we can add and subtract, but we
cannot multiply, divide or calculate ratios. Good examples are Time,
temperature
• Ratio: Ratio values are Categorical, ordered data having scale to them and
they have a natural or true zero. They can be discrete or continuous. Good
examples are height, weight, length, Duration etc. Since they have natural
zero it allows for a wide range of both descriptive and inferential statistics
to be applied. For example, zero dollars means there is no money. If length
of two rods is 2 fts and 4 fts…then we can say that one rod is twice the
length of the other. Sales figures, Sales of zero means that you sold nothing
and so sales didn’t exist.
Summary
• In this post, you discovered the different data types that are used by
Data Scientists. You learned the difference between discrete &
continuous data and learned about levels of data viz. nominal,
ordinal, interval and ratio measurement scales.
• Nominal are used to “name,” or label a data.
• Ordinal provides information about the order of values,
• Interval give us the order of values plus the ability to quantify the difference
between each one.
• Finally, Ratio give us the order, interval values, plus the ability to calculate
ratios since a “true zero” can be defined.

More Related Content

PPTX
1.2 types of data
PPT
Epidemiology notes
PPTX
DATA Types
PPTX
Sources of data collection
PPTX
Introduction of Biostatistics
PPTX
Basics stat ppt-types of data
PPTX
1.2 types of data
Epidemiology notes
DATA Types
Sources of data collection
Introduction of Biostatistics
Basics stat ppt-types of data

What's hot (20)

PPT
Software packages for statistical analysis - SPSS
PPTX
Data and its Types
PPTX
Introduction to Statistics (Part -I)
PPTX
PPT
Research Methodology (MBA II SEM) - Introduction to SPSS
PPT
Bivariate analysis
PPTX
Correlation analysis
PDF
Statistics: Chapter One
PPTX
(Manual spss)
PPTX
Type of data
PPTX
Testing of hypotheses
PPT
Introduction to spss
PPTX
Introduction to statistics
PPTX
Basics of Educational Statistics (Descriptive statistics)
PPTX
Basic Statistics
PPT
Spss lecture notes
PPTX
Parametric vs Nonparametric Tests: When to use which
PPT
Univariate Analysis
PPTX
Statistical inference
PPSX
Types of Statistics
Software packages for statistical analysis - SPSS
Data and its Types
Introduction to Statistics (Part -I)
Research Methodology (MBA II SEM) - Introduction to SPSS
Bivariate analysis
Correlation analysis
Statistics: Chapter One
(Manual spss)
Type of data
Testing of hypotheses
Introduction to spss
Introduction to statistics
Basics of Educational Statistics (Descriptive statistics)
Basic Statistics
Spss lecture notes
Parametric vs Nonparametric Tests: When to use which
Univariate Analysis
Statistical inference
Types of Statistics
Ad

Similar to Four data types Data Scientist should know (20)

PPTX
Introduction of Data and Type of data in Statstics
DOCX
Categorical DataCategorical data represents characteristics..docx
PPTX
Data analysis for business decisions
PPTX
Types of Data, Key Concept
PPTX
Data And Variable In Scientific Research
PPTX
TYPES OF DATA.pptx- Bsc nursing 4th year- research and statistics
PPTX
TYPES OF DATA- Research and statistics - BSc Nursing 4th year
PPTX
Data analytics – I: classification of data and measurement scale
PPTX
Measurementand scaling-10
PPTX
Presentation1.pptx
PPTX
Basic statistics
PDF
INTRODUCTION TO STATISTICS QUANTITATIVE TECHNIQUES.pdf
PDF
2. Numerical Descriptive Measures[1].pdf
PPTX
Introduction to Statistics in Nursing.
PPTX
Introduction to Health statistics and biostats
PPTX
Statistics and Business Research Methods
PPTX
Data presentation. Faculty will demonstrate use of MS excel in preparing var...
PPTX
UNIT 3 Measurement and scaling.pptx university
PPTX
Scale presentations in research methodology
Introduction of Data and Type of data in Statstics
Categorical DataCategorical data represents characteristics..docx
Data analysis for business decisions
Types of Data, Key Concept
Data And Variable In Scientific Research
TYPES OF DATA.pptx- Bsc nursing 4th year- research and statistics
TYPES OF DATA- Research and statistics - BSc Nursing 4th year
Data analytics – I: classification of data and measurement scale
Measurementand scaling-10
Presentation1.pptx
Basic statistics
INTRODUCTION TO STATISTICS QUANTITATIVE TECHNIQUES.pdf
2. Numerical Descriptive Measures[1].pdf
Introduction to Statistics in Nursing.
Introduction to Health statistics and biostats
Statistics and Business Research Methods
Data presentation. Faculty will demonstrate use of MS excel in preparing var...
UNIT 3 Measurement and scaling.pptx university
Scale presentations in research methodology
Ad

Recently uploaded (20)

PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Renaissance Architecture: A Journey from Faith to Humanism
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Lesson notes of climatology university.
PPTX
Cell Types and Its function , kingdom of life
PPTX
Institutional Correction lecture only . . .
PDF
Computing-Curriculum for Schools in Ghana
PDF
Classroom Observation Tools for Teachers
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PDF
RMMM.pdf make it easy to upload and study
PPTX
Pharma ospi slides which help in ospi learning
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Renaissance Architecture: A Journey from Faith to Humanism
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Abdominal Access Techniques with Prof. Dr. R K Mishra
Supply Chain Operations Speaking Notes -ICLT Program
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
human mycosis Human fungal infections are called human mycosis..pptx
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Final Presentation General Medicine 03-08-2024.pptx
Lesson notes of climatology university.
Cell Types and Its function , kingdom of life
Institutional Correction lecture only . . .
Computing-Curriculum for Schools in Ghana
Classroom Observation Tools for Teachers
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPH.pptx obstetrics and gynecology in nursing
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
RMMM.pdf make it easy to upload and study
Pharma ospi slides which help in ospi learning

Four data types Data Scientist should know

  • 2. Data types Quantitative Qualitative Discrete Continuous Understanding data type is an important concept in statistics, when you are designing an experiment, you want to know what type of data you are dealing with, that will decide what type of statistical analysis, visualizations and prediction algorithms could be used. Also, you can use a particular statistical measurement only for specific data types.
  • 3. Qualitative & Quantitative are two basic types of data. As the name suggests, qualitative deals with the quality (Characteristics) & some statisticians calls it categorical. Quantitative deals with numbers. Below is the difference between Data Types. Qualitative Quantitative Non numerical, Categories, Attributes Numerical Examples: Colour, Ethnicity, Gender, political leanings, religion, Education etc. Examples: time, count, weight, length, wages, Income Even though some data are number mathematical operations are meaningless. Zipcode and phone numbers Mathematical operations are meaningful. Table1: Difference between types of data
  • 4. • Quantitative are of two types viz. Discrete and Continuous. They are explained below: • Discrete: Discrete data are countable or Finite. Finite means there are certain number of values you can pick from and countable means you can count them. This type of data can’t be measured but it can be counted. These are natural numbers and are count of something. • Let me give you an example of countable, if you are asked to count number of cars on road during certain time of the day, it can take numbers like 100,125 or 1000 but never 100.5 or 125.72 etc. • Likewise, example of finite is, the number which comes on rolling a dice. There are only 6 possible choices like 1,2,3…6 but never more than 6 or 4.5 etc. likewise if you flip a coin it has only heads or tails, so, there are certain number of values you can pick from. • Continuous: Continuous data are uncountable or infinite or to put it differently, there are infinite number of possible values and is not countable. Usually is a measurement of something and cannot be counted. Continuous can take absolutely any value. • For example, a person’s height or weight. Height can be 5.23, 5.24, 5.76 etc. similarly weight can be any value like 75.82 Kgs or 62.35 kgs etc. Height, weight, length, speeds, temperatures etc. are examples of continuous data.
  • 5. Levels of measurement/data QuantitativeQualitative Interval RatiosNominal Ordinal In the figure above as we move from left to right, we see an increase in order of information i.e. Ordinal will have the characteristics of Nominal and something more, similarly, interval will have the characteristics of Ordinal and something more and Ratios will have characteristics of Interval and something more… Let’s see what each one of these data levels means.
  • 6. • Nominal: • Nominal means a name only, Nominal scales are used for labelling variables, without any quantitative value, it is a categorical data which has no order. Red car, Blue car, yellow car, Black Car, White car etc. are just categories, they don’t have any order to them. All you can do is list them out. For example, Flavour, Religion, Ethnicity, Gender etc. these are nominal data without any order. For statistical analysis we can assign numbers to these categories for example: Red=1, White=2 and Black=3, these numerical values assigned does not have any mathematical significance. A sub-type of nominal scale with only two categories (e.g. male/female, hot/cold, good/bad) is called “dichotomous.” • Ordinal: • it is a categorical data which has an implied order. Like size of clothing as Small, Medium and Large or Likert scale questions having a scale from 1 to 5 for example Agree=1, neutral=2 disagree=3 etc. The numbers do not have any mathematical significance, but they are the labels. Subtraction and division don’t make any mathematical significance. For example, Ranking (ranking in school exams i.e 1st,2nd and 3rd etc., ranking in the Army i.e Captain, Major, Colonel etc.), survey done on Likert scale, educational background(under graduate, Postgraduate, Doctoral etc.).in all the above examples there is an order associated with it.
  • 7. • Interval: Interval values are Categorical and ordered data in addition to that they have scale to them. Interval values data don’t have a true zero. True zero means absence of the variable. For example, zero degrees temperature does not mean there is no temperature, it just means, it is too cold. Here Addition and subtraction are significant, but division and multiplication are not. With interval data, we can add and subtract, but we cannot multiply, divide or calculate ratios. Good examples are Time, temperature • Ratio: Ratio values are Categorical, ordered data having scale to them and they have a natural or true zero. They can be discrete or continuous. Good examples are height, weight, length, Duration etc. Since they have natural zero it allows for a wide range of both descriptive and inferential statistics to be applied. For example, zero dollars means there is no money. If length of two rods is 2 fts and 4 fts…then we can say that one rod is twice the length of the other. Sales figures, Sales of zero means that you sold nothing and so sales didn’t exist.
  • 8. Summary • In this post, you discovered the different data types that are used by Data Scientists. You learned the difference between discrete & continuous data and learned about levels of data viz. nominal, ordinal, interval and ratio measurement scales. • Nominal are used to “name,” or label a data. • Ordinal provides information about the order of values, • Interval give us the order of values plus the ability to quantify the difference between each one. • Finally, Ratio give us the order, interval values, plus the ability to calculate ratios since a “true zero” can be defined.