SlideShare a Scribd company logo
2
Most read
5
Most read
9
Most read
“Strengthening the Research Capacity through
Statistical Interventions”
Dr. Jitendra Kumar Chaudhary
Assistant Professor
Department of Animal Genetics & Breeding,
CVSc. & A.H., Central Agricultural University (CAU),
Selesih, Aizawl, Mizoram-796014
Email-vetjitu@gmail.com
Data Transformation:
If a measurement variable does not fit a normal
distribution or has greatly different standard
deviations in different groups, you should go for data
transformation.
Normality Test:
1)Skewness & Kurtosis
2) Shapiro Wilk test
3) Histogram
Procedure to test normality in SPSS
Go to analyze, descriptive explore, take measurement
in dependent and group in factor box tick on plots, tick on
histogram and normality plots with test, continue and ok.
Results
Divide the skewness and kurtosis by std. error and see if the
values lies between -1.96 to +1.96 then the data are normally
distributed (a minor difference is also normally distributed).
See that the Shapiro wilk test is significant or not, if it is non-
significant means data is normal otherwise data is not normal.
The following are the three transformations,
which are being used most commonly, in biological
research
a) Logarithmic Transformation
b) Square root Transformation
c) Arc sine or Angular Transformation
Logarithmic transformation for whole number counts
with wide range
 This transformation is suitable for the data where the
variance is proportional to the square of the mean or the
coefficient of the variation (S.D./Mean) is constant or
where effects are multiplicative.
 These conditions are generally found in the data that are
whole numbers and cover a wide range of values.
 For example- number of insect per plot, number of egg
mass per plant or per unit area etc.
Natural logarithm of a positive number=LN (Give cell
number for which transformation to be done),
Natural logarithm is based on the constant e
(2.718281828845904)
Logarithm of a positive number at base 10=Log10 (Give
cell number for which transformation to be done), Or Log
(number, 10)
Square root transformation
 The transformation is appropriate for the data sets where
the variance is proportional to the mean.
 Here the data consists of small whole numbers, for
example data obtained in counting rare events, such as
number of infested plants in a plot, the number of insects
caught in traps, number of weeds per plot.
Square root transformation
 These data sets generally follow the Poisson distribution
and square root transformation approximates Poisson to
normal distribution.
 Square root transformation=SQRT (Give cell number for
which transformation to be done)
Arcsine Transformation for Proportions or Percentage
 The transformation is appropriate for the data on
proportion i.e. data obtained from a count and the data
expressed as decimal fraction and percentage.
 The distribution of percentage is binomial and this
transformation makes the distribution normal.
Arcsine Transformation=ASIN (Cell Identification e.g.
(0.05)*(180)*(7/22) result will be in degrees.

More Related Content

PPTX
Sampling Distributions and Estimators
PDF
Inferential Statistics
PPT
Descriptive Statistics and Data Visualization
PDF
PPTX
Four data types Data Scientist should know
PPT
Introduction to statistics
PPTX
Transformation of variables
PPTX
Presentation chi-square test & Anova
Sampling Distributions and Estimators
Inferential Statistics
Descriptive Statistics and Data Visualization
Four data types Data Scientist should know
Introduction to statistics
Transformation of variables
Presentation chi-square test & Anova

What's hot (20)

PPTX
Basic Concepts of Split-Plot Design,Analysis Of Covariance(ANCOVA)& Response ...
PPT
RBD design.ppt
PPTX
Basic terminology of experimental design in Agriculture
PPTX
Experimental design.pptx
PPT
Split Plot Design(ppt).ppt
PPTX
Latin square design
PPTX
Experimental design in Plant Breeding
PPTX
Statistics and agricultural
PDF
Split-plot Designs
PPTX
Completely randomized design
PPTX
Ducan’s multiple range test - - Dr. Manu Melwin Joy - School of Management St...
PPT
ANOVA & EXPERIMENTAL DESIGNS
PPTX
D-Square statistic
PDF
Unit 1 lecture-1 soil fertility and soil productivity
PPTX
Crop Modeling - Types of crop growth models in agriculture
PPT
Principles of experimental design
PPTX
Remote sensing in agriculture
PPTX
Global agriculture research system
PDF
Randomized complete block_design_rcbd_
PPTX
comparison of CRD, RBD and LSD
Basic Concepts of Split-Plot Design,Analysis Of Covariance(ANCOVA)& Response ...
RBD design.ppt
Basic terminology of experimental design in Agriculture
Experimental design.pptx
Split Plot Design(ppt).ppt
Latin square design
Experimental design in Plant Breeding
Statistics and agricultural
Split-plot Designs
Completely randomized design
Ducan’s multiple range test - - Dr. Manu Melwin Joy - School of Management St...
ANOVA & EXPERIMENTAL DESIGNS
D-Square statistic
Unit 1 lecture-1 soil fertility and soil productivity
Crop Modeling - Types of crop growth models in agriculture
Principles of experimental design
Remote sensing in agriculture
Global agriculture research system
Randomized complete block_design_rcbd_
comparison of CRD, RBD and LSD
Ad

Similar to Data Transformation.ppt (20)

PPTX
Transformation technique and when it is used
PDF
Transformasi Data Penelitian
PPTX
Data Normality (1).pptx
PPTX
PPTX
Confidently Conduct and Interpret Tests for Normality
PDF
Lecture 2 practical_guidelines_assignment
PPTX
Transformers: Data in Disguise
PPTX
Computing transformations
PDF
Normality tests
PDF
Transformation To Normality, References
PDF
article.pdf
PPTX
Normal distribtion curve
PDF
Why are data transformations a bad choice in statistics
PPTX
Lec 5 - Normality Testing.pptx
PPTX
Normality evaluation in a data
PPTX
Normality test on SPSS
PDF
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
PPT
Statistics Primer
PDF
Data Science - Part III - EDA & Model Selection
Transformation technique and when it is used
Transformasi Data Penelitian
Data Normality (1).pptx
Confidently Conduct and Interpret Tests for Normality
Lecture 2 practical_guidelines_assignment
Transformers: Data in Disguise
Computing transformations
Normality tests
Transformation To Normality, References
article.pdf
Normal distribtion curve
Why are data transformations a bad choice in statistics
Lec 5 - Normality Testing.pptx
Normality evaluation in a data
Normality test on SPSS
INFLUENCE OF DATA GEOMETRY IN RANDOM SUBSET FEATURE SELECTION
Statistics Primer
Data Science - Part III - EDA & Model Selection
Ad

Recently uploaded (20)

PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
PPH.pptx obstetrics and gynecology in nursing
PDF
Anesthesia in Laparoscopic Surgery in India
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Lesson notes of climatology university.
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PPTX
Cell Structure & Organelles in detailed.
PDF
Sports Quiz easy sports quiz sports quiz
PDF
Classroom Observation Tools for Teachers
PPTX
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
GDM (1) (1).pptx small presentation for students
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
O7-L3 Supply Chain Operations - ICLT Program
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPH.pptx obstetrics and gynecology in nursing
Anesthesia in Laparoscopic Surgery in India
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Microbial diseases, their pathogenesis and prophylaxis
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Lesson notes of climatology university.
102 student loan defaulters named and shamed – Is someone you know on the list?
STATICS OF THE RIGID BODIES Hibbelers.pdf
human mycosis Human fungal infections are called human mycosis..pptx
Cell Structure & Organelles in detailed.
Sports Quiz easy sports quiz sports quiz
Classroom Observation Tools for Teachers
Introduction_to_Human_Anatomy_and_Physiology_for_B.Pharm.pptx
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
GDM (1) (1).pptx small presentation for students
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...

Data Transformation.ppt

  • 1. “Strengthening the Research Capacity through Statistical Interventions” Dr. Jitendra Kumar Chaudhary Assistant Professor Department of Animal Genetics & Breeding, CVSc. & A.H., Central Agricultural University (CAU), Selesih, Aizawl, Mizoram-796014 Email-vetjitu@gmail.com
  • 2. Data Transformation: If a measurement variable does not fit a normal distribution or has greatly different standard deviations in different groups, you should go for data transformation. Normality Test: 1)Skewness & Kurtosis 2) Shapiro Wilk test 3) Histogram
  • 3. Procedure to test normality in SPSS Go to analyze, descriptive explore, take measurement in dependent and group in factor box tick on plots, tick on histogram and normality plots with test, continue and ok. Results Divide the skewness and kurtosis by std. error and see if the values lies between -1.96 to +1.96 then the data are normally distributed (a minor difference is also normally distributed). See that the Shapiro wilk test is significant or not, if it is non- significant means data is normal otherwise data is not normal.
  • 4. The following are the three transformations, which are being used most commonly, in biological research a) Logarithmic Transformation b) Square root Transformation c) Arc sine or Angular Transformation
  • 5. Logarithmic transformation for whole number counts with wide range  This transformation is suitable for the data where the variance is proportional to the square of the mean or the coefficient of the variation (S.D./Mean) is constant or where effects are multiplicative.  These conditions are generally found in the data that are whole numbers and cover a wide range of values.  For example- number of insect per plot, number of egg mass per plant or per unit area etc.
  • 6. Natural logarithm of a positive number=LN (Give cell number for which transformation to be done), Natural logarithm is based on the constant e (2.718281828845904) Logarithm of a positive number at base 10=Log10 (Give cell number for which transformation to be done), Or Log (number, 10)
  • 7. Square root transformation  The transformation is appropriate for the data sets where the variance is proportional to the mean.  Here the data consists of small whole numbers, for example data obtained in counting rare events, such as number of infested plants in a plot, the number of insects caught in traps, number of weeds per plot.
  • 8. Square root transformation  These data sets generally follow the Poisson distribution and square root transformation approximates Poisson to normal distribution.  Square root transformation=SQRT (Give cell number for which transformation to be done)
  • 9. Arcsine Transformation for Proportions or Percentage  The transformation is appropriate for the data on proportion i.e. data obtained from a count and the data expressed as decimal fraction and percentage.  The distribution of percentage is binomial and this transformation makes the distribution normal. Arcsine Transformation=ASIN (Cell Identification e.g. (0.05)*(180)*(7/22) result will be in degrees.