SlideShare a Scribd company logo
Bio-statistics in Bio-medical
Research
Dr. Shinjan Patra
D.M. ( Endocrinology) Resident
All India Institute of Medical Sciences
Jodhpur
Introduction
• Collecting, analyzing, and interpreting data are
essential components of biomedical research
and require biostatistics.
• Doing various statistical tests has been made
easy by sophisticated computer software.
First learning with Bio-statistics
Evidence Based!!
Clinical
Expertise
Research
Evidence
Patient
Preferences
EBM
Are our patients
always able to judge
what is best for
them?
Our clinical experience
alone may not improve
our patient’s lives
A lot of research is
aimed at improving
the lives of patients!
4
Basic need for Bio-statistics
• To choose the right statistical test for the
computer to perform based on the nature of
data derived from one’s own research
• To understand if an analysis was performed
appropriately during review and interpretation
of others’ research
Drawing
Conclusions
Designing &
Implementing
TRUTH IN THE
UNIVERSE
Research
Question
TRUTH IN THE
STUDY
Study
Plan
FINDINGS IN
THE STUDY
Actual
Study
Infer Infer
Design Implement
Process of Research Project
6
Methods of Analysis
I. Descriptive Methods :
Tables
Diagrams
Charts
II. Inferential Methods :
 Estimation
 Point Estimation
Mean / proportion
etc.
 Interval Estimation
i.e. Confidence interval of
point estimate
 Hypotheses Testing
 Comparison between the
treatments
 Association
7
Overview
• Variables
• Distribution of Data
• Statistical Tests
• Hypothesis testing and Error
• Confounding/Bias
• Measures of Association
• Diagnostic Tests
• Regression analysis
Variables
• Any characteristic that can be observed,
measured, or categorized
• Divided into Two-
Categorical
Continuous
Categorical
• Not suitable for quantification; classified into
categories
Nominal- Named categories, with no implied
value- like- blood groups. Existential, no
inherent order or superiority. Nominal data
with only two groups are referred to as
dichotomous or binary (male or female)
Ordinal: Named with an order/ superiority like
grades of GO
Continuous
• can have an infinite number of possible values
Interval: Equal interval between values but no
meaningful zero point ( body temp)
Ratio: Equal intervals with a meaningful zero
point and all mathematical operations are
functional ( amount of secretions from a
diabetic wound)
Bio-Statistics in Bio-Medical research
13
Intervening Variable
• An intervening variable is on the causal pathway to the
outcome
Choice of Statistical test
• Parametric tests can be used with interval and
ratio data but not with nominal or ordinal
data
• Nonparametric tests can be used with any
type of variable, including nominal or ordinal
data
Distribution of the Data
Central Tendency
• Estimates the “center” of the Distribution
 Mean- representative of all data points and is
the most efficient estimator of the middle of a
normal (Gaussian) distribution;
 However, inappropriate as a measure of
central tendency if data are skewed.
 Influenced by outlying values
 Commonly used for interval and ratio data.
Median- not influenced by outlying values
 more appropriate for skewed data
 commonly used for ordinal data.
Mode- particularly useful while describing
data distributed in a bimodal pattern when
mean and median are not appropriate.
commonly used with nominal data.
Normal Distribution
• Gaussian. Symmetric bell-shaped frequency
distribution, in which mean, median, and
mode all have the same value
• Appropriate statistical test would be a
parametric test such as a t-test or an analysis
of variance (ANOVA).
• Data are usually represented as mean (SD).
Bio-Statistics in Bio-Medical research
Skewed Distribution
• Positive/Right and Negative/Left
• Appropriate statistical test would be a
nonparametric test, such as Wilcoxon test or
Mann-Whitney test.
• Data are usually represented as median,
interquartile range (IQR)
Bio-Statistics in Bio-Medical research
Measures of Dispersion
• Range- difference between the highest and
the lowest values. Range can change
drastically when the study is repeated
• INTERQUARTILE RANGE (IQR) is the range
between the 25th and 75th percentiles or the
difference between the medians of the lower
half and upper half of the data and comprises
the middle 50% of the data
• VARIANCE is a measure of dispersion or average
deviation from the mean
• STANDARD DEVIATION (SD) is the square root of
variance and is the most common measure of
dispersion used for normally distributed data
• STANDARD ERROR OF MEAN (SEM) is calculated
by dividing the SD by the square root of n.
Statistical tests
• Parametric- assume the underlying population
to be normally distributed and are based on
means and SDs
• Non-parametric- No assumption of population
distribution
t Tests
• Student’s t test is a simple, commonly used
parametric test to compare two groups of
continuous variables
• Paired : Each patient/subject serves as his/her
own control before and after an intervention
• Unpaired: Two groups of patients/subjects are
compared with each other
Analysis of Variance ( ANOVA)
• One-way ANOVA is an extension of the two-
sample t test to three or more samples and
deals with statistical test on more than two
groups
• Other methods, such as planned or post hoc
comparisons, are conducted to examine
specific comparisons among individual means
Wilcoxon Rank-Sum test
• Mostly for Skewed distribution and Ordinal
data.
• Others- x2 test (Chi-squared test)- common
test used to compare categorical data. Data
first entered into a 2*2 contingency table
• If the numbers are small (expected value is
<5), an alternative test called Fisher’s exact
test.
Bio-Statistics in Bio-Medical research
Simplified form
• Nominal variable- Chi Square / Fischers
• Ordinal Variable- Wilcoxon/ Mann-Whitney U
• Interval/Ratio- Normal distribution- t test/ANOVA
• Interval/Ratio- Skewed distribution- Wilcoxon/
Mann-Whitney U
Hypothesis testing
• Null hypothesis refers to restating the
research hypothesis to one that proposes no
difference between groups being compared
• An alternative hypothesis proposes an
association- One-sided/two-sided
Errors
• TYPE I ERROR (false-positive, also known as a
rejection error) is rejection of a null
hypothesis that is actually true in the
population
• TYPE II ERROR (false-negative, also known as
an acceptance error) is failure to reject a null
hypothesis that is actually false
Bio-Statistics in Bio-Medical research
Interpretation of p Value
• p VALUE is the probability of the null
hypothesis being true by chance alone.
• It is also the probability of committing a type I
error.
• p value of 0.05 or less is commonly used to
denote significance
• This value informs the investigator that there
is at least a 95% chance that the two samples
represent different populations
• lower p value (<0.01) indicates a lower likelihood
(1%) that the null hypothesis may be true due to
chance alone.
• lower p value does not infer a higher strength of
association or clinical importance of an
association
• Factors that tend to decrease p value and
increase significance are increased sample size,
increased difference in control and experimental
means and less variance
Bio-Statistics in Bio-Medical research
Confidence interval
• Range of values that you expect to include the
actual mean of the true population
• Typically 95% confidence intervals are used in
research
• Values at either extreme of this range are
called confidence limits
Sample Size and C.I
• The larger the sample size, the narrower the
C.I .
• p value and C.I together provide the best
information about the role of chance
• Sample size is an important determinant of
the power of the study to detect significant
differences
Confounding
• Distortion in a measure of effect that may arise
because we fail to control for other variables that
are previously known risk factors for the health
outcome being studied
• Can lead to the observation of apparent
differences between the study groups when they
do not truly exist, or conversely, the observation
of no difference when they do exist.
38
Confounding variable
• Independent risk factor (cause) of outcome
• Unevenly distributed among exposed and
unexposed
• Not on the causal pathway between exposure
and outcome
39
THE DIFFERENCE BETWEEN BIAS
AND CONFOUNDING
 Bias creates an association that is not true,
 Confounding describes an association that is true,
but potentially misleading.
40
Example
• 2 groups of bottle-fed babies and breast-milk
fed babies compared for Gastro-enteritis ( GE)
• Bias- Finding less GE in bottle-fed babies due
to less follow-up to the doctors
• Confounding- Better-hygiene and less
crowding can prevent GE in any of the groups;
but that doesn’t correspond to true protective
effects of breast-milk
Least square mean changes
• When there is study imbalance such that
some blocks of experimental units are under-
represented relative to other block
• Least square means (estimated population
marginal means) provide an opportunity to
obtain an unbiased estimate of averages in
the face of this kind of study imbalance
Bio-Statistics in Bio-Medical research
Measures of Association
• Absolute risk = a/(a + b) or c/ ( c+ d)
• Relative risk/ Risk Ratio = (a/[a + b]) / (c/[c +
d])
• Hazard ratio is a measure of relative risk over
time in circumstances in which we are
interested not only in the total number of
events, but in their timing as well.
Diagnostic tests
Bio-Statistics in Bio-Medical research
Bio-Statistics in Bio-Medical research
Key points
• Sensitivity and specificity are prevalence
independent.
• Increased prevalence of disease will increase PPV
• Reduced disease prevalence increase NPV
• Increasing the threshold for a positive test
reduces false-positives and increases specificity
• Reducing the threshold or cutoff value for a
positive test reduces false-negatives and
increases sensitivity
Bio-Statistics in Bio-Medical research
Gold standard
• Unambiguous method of determining
whether or not a patient has a particular
disease or outcome
Regression Analysis
• Simple Regression- Many relationships
between variables can be fit to a straight line.
Like duration of treatment
• Logistics Regression- In situations in which the
response of interest is dichotomous (binary)
rather than continuous. Like outcome of death
or alive
Summary
• Bio-statistics is integral part of understanding
any bio-medical research and Evidence-based
medicines.
• The concept of various variable and
understanding the various statistical tests are
equally important
• Sample size and the confidence limit concepts
hold the key
Thank you
CAS-3 & CAS-4
• CAS-3- PPV- 61%
• NPV- 60%
• Sensitivity- 64%
• Specificity- 57%
• CAS-4- PPV- 80%
• NPV- 64%
• Sensitivity- 55%
• Specificity- 86%
• M. Ph. Mourits et al.

More Related Content

PDF
Sample size calculation in medical research
PPTX
Statistical Power
PPTX
Meta analysis
PPTX
Evaluating diagnostic tests.pptx
PPTX
Sudhakar singh meta analysis
PPTX
Medical Statistics.pptx
PPTX
Sample size calculation - a brief overview
PPTX
Statistical significance vs Clinical significance
Sample size calculation in medical research
Statistical Power
Meta analysis
Evaluating diagnostic tests.pptx
Sudhakar singh meta analysis
Medical Statistics.pptx
Sample size calculation - a brief overview
Statistical significance vs Clinical significance

What's hot (20)

PDF
Survival analysis & Kaplan Meire
PDF
Bias and confounding
PPTX
Sample size calculation
PPTX
Nested case control,
PPTX
Analyzing the randomised control trial (rct)
PPTX
Odds ratio
PDF
Bias and Confounding
PPTX
Techniques in clinical epidemiology
PPT
Bradford Hill Criteria.ppt
PPTX
Odds ratio and confidence interval
PPTX
Case control study
PDF
Cross sectional study
PPTX
NNT: Number Needed to Treat
PPTX
Study design in research
PPTX
Bias in epidemiology uploaded
PPTX
PPTX
Sample size calculation
PDF
Measuring Disease Frequency
PPT
Randomization
Survival analysis & Kaplan Meire
Bias and confounding
Sample size calculation
Nested case control,
Analyzing the randomised control trial (rct)
Odds ratio
Bias and Confounding
Techniques in clinical epidemiology
Bradford Hill Criteria.ppt
Odds ratio and confidence interval
Case control study
Cross sectional study
NNT: Number Needed to Treat
Study design in research
Bias in epidemiology uploaded
Sample size calculation
Measuring Disease Frequency
Randomization
Ad

Similar to Bio-Statistics in Bio-Medical research (20)

PPTX
TEST OF SIGNIFICANCE.pptx
PPTX
Biostatistics.pptx
PPT
Statistics basics for oncologist kiran
PPTX
scope and need of biostatics
PPTX
PPTX
Application of statistical tests in Biomedical Research .pptx
PPT
Overview-of-Biostatistics-Jody-Krieman-5-6-09 (1).ppt
PPT
Overview-of-Biostatistics-Jody-Kriemanpt
PPTX
BIOSTATISTICS.pptx
PPTX
Overview of different statistical tests used in epidemiological
PPT
PARAMETRIC TEST in Public health dentistry.ppt
PPT
Biostatistics
PPTX
PRESENTATION ON TESTS OF SIGNIFICANCE.pptx
PPTX
PARAMETRIC TESTS.pptx
PPT
statistics introduction
PPT
Biostatistics
PPTX
Introduction to Basic Biostatistics (Biostats)
PPT
Introductory Statistics
PPTX
Tests of significance Periodontology
TEST OF SIGNIFICANCE.pptx
Biostatistics.pptx
Statistics basics for oncologist kiran
scope and need of biostatics
Application of statistical tests in Biomedical Research .pptx
Overview-of-Biostatistics-Jody-Krieman-5-6-09 (1).ppt
Overview-of-Biostatistics-Jody-Kriemanpt
BIOSTATISTICS.pptx
Overview of different statistical tests used in epidemiological
PARAMETRIC TEST in Public health dentistry.ppt
Biostatistics
PRESENTATION ON TESTS OF SIGNIFICANCE.pptx
PARAMETRIC TESTS.pptx
statistics introduction
Biostatistics
Introduction to Basic Biostatistics (Biostats)
Introductory Statistics
Tests of significance Periodontology
Ad

More from Shinjan Patra (20)

PPTX
Medical nutrition therapy in diabetes
PPTX
ISBMR osteoporosis guidelines
PPTX
Delayed Puberty
PPTX
COVID vaccination and prevention strategies
PPTX
Diagnostics and classifications of Corona Viruses
PPTX
Management of Diabetic Autonomic Neuropathy
PPTX
Approach to Rickets and relevant metabolic bone disorders
PPTX
Adreno Cortical Carcinoma in a Nut Shell
PPTX
PPT Monoclonal antibodies in Endocrinology
PPTX
Ppt Insulin delivery systems
PPTX
Ppt COVID vaccination
PPTX
Ppt diagnostics & classification of corona viruses
PPTX
Ppt Calcium and Phosphate metabolism
PPTX
Ppt Approach to Rickets and it's Management
PPTX
Ppt ACC follow up
PPTX
Ppt ISBMR Osteoporosis guidelines
PPTX
Pulsatility in Endocrinology
PPTX
Case Review of Adult-onset Congenital Adrenal Hyperplasia due to 21-OH defici...
PPTX
Wnt-Signalling in Endocrinology
PPTX
Approach to microcytic hypochromic anemia
Medical nutrition therapy in diabetes
ISBMR osteoporosis guidelines
Delayed Puberty
COVID vaccination and prevention strategies
Diagnostics and classifications of Corona Viruses
Management of Diabetic Autonomic Neuropathy
Approach to Rickets and relevant metabolic bone disorders
Adreno Cortical Carcinoma in a Nut Shell
PPT Monoclonal antibodies in Endocrinology
Ppt Insulin delivery systems
Ppt COVID vaccination
Ppt diagnostics & classification of corona viruses
Ppt Calcium and Phosphate metabolism
Ppt Approach to Rickets and it's Management
Ppt ACC follow up
Ppt ISBMR Osteoporosis guidelines
Pulsatility in Endocrinology
Case Review of Adult-onset Congenital Adrenal Hyperplasia due to 21-OH defici...
Wnt-Signalling in Endocrinology
Approach to microcytic hypochromic anemia

Recently uploaded (20)

PPTX
surgery guide for USMLE step 2-part 1.pptx
PPTX
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus
PDF
Intl J Gynecology Obste - 2021 - Melamed - FIGO International Federation o...
PPTX
Uterus anatomy embryology, and clinical aspects
PDF
Handout_ NURS 220 Topic 10-Abnormal Pregnancy.pdf
PPTX
post stroke aphasia rehabilitation physician
PPTX
neonatal infection(7392992y282939y5.pptx
PPT
ASRH Presentation for students and teachers 2770633.ppt
PPTX
CME 2 Acute Chest Pain preentation for education
PPT
MENTAL HEALTH - NOTES.ppt for nursing students
PPTX
Fundamentals of human energy transfer .pptx
PDF
Oral Aspect of Metabolic Disease_20250717_192438_0000.pdf
PDF
Medical Evidence in the Criminal Justice Delivery System in.pdf
PPT
OPIOID ANALGESICS AND THEIR IMPLICATIONS
PPTX
History and examination of abdomen, & pelvis .pptx
PPTX
15.MENINGITIS AND ENCEPHALITIS-elias.pptx
PPT
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
PPT
Breast Cancer management for medicsl student.ppt
PPTX
anaemia in PGJKKKKKKKKKKKKKKKKHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH...
PPT
genitourinary-cancers_1.ppt Nursing care of clients with GU cancer
surgery guide for USMLE step 2-part 1.pptx
ca esophagus molecula biology detailaed molecular biology of tumors of esophagus
Intl J Gynecology Obste - 2021 - Melamed - FIGO International Federation o...
Uterus anatomy embryology, and clinical aspects
Handout_ NURS 220 Topic 10-Abnormal Pregnancy.pdf
post stroke aphasia rehabilitation physician
neonatal infection(7392992y282939y5.pptx
ASRH Presentation for students and teachers 2770633.ppt
CME 2 Acute Chest Pain preentation for education
MENTAL HEALTH - NOTES.ppt for nursing students
Fundamentals of human energy transfer .pptx
Oral Aspect of Metabolic Disease_20250717_192438_0000.pdf
Medical Evidence in the Criminal Justice Delivery System in.pdf
OPIOID ANALGESICS AND THEIR IMPLICATIONS
History and examination of abdomen, & pelvis .pptx
15.MENINGITIS AND ENCEPHALITIS-elias.pptx
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
Breast Cancer management for medicsl student.ppt
anaemia in PGJKKKKKKKKKKKKKKKKHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH...
genitourinary-cancers_1.ppt Nursing care of clients with GU cancer

Bio-Statistics in Bio-Medical research

  • 1. Bio-statistics in Bio-medical Research Dr. Shinjan Patra D.M. ( Endocrinology) Resident All India Institute of Medical Sciences Jodhpur
  • 2. Introduction • Collecting, analyzing, and interpreting data are essential components of biomedical research and require biostatistics. • Doing various statistical tests has been made easy by sophisticated computer software.
  • 3. First learning with Bio-statistics
  • 4. Evidence Based!! Clinical Expertise Research Evidence Patient Preferences EBM Are our patients always able to judge what is best for them? Our clinical experience alone may not improve our patient’s lives A lot of research is aimed at improving the lives of patients! 4
  • 5. Basic need for Bio-statistics • To choose the right statistical test for the computer to perform based on the nature of data derived from one’s own research • To understand if an analysis was performed appropriately during review and interpretation of others’ research
  • 6. Drawing Conclusions Designing & Implementing TRUTH IN THE UNIVERSE Research Question TRUTH IN THE STUDY Study Plan FINDINGS IN THE STUDY Actual Study Infer Infer Design Implement Process of Research Project 6
  • 7. Methods of Analysis I. Descriptive Methods : Tables Diagrams Charts II. Inferential Methods :  Estimation  Point Estimation Mean / proportion etc.  Interval Estimation i.e. Confidence interval of point estimate  Hypotheses Testing  Comparison between the treatments  Association 7
  • 8. Overview • Variables • Distribution of Data • Statistical Tests • Hypothesis testing and Error • Confounding/Bias • Measures of Association • Diagnostic Tests • Regression analysis
  • 9. Variables • Any characteristic that can be observed, measured, or categorized • Divided into Two- Categorical Continuous
  • 10. Categorical • Not suitable for quantification; classified into categories Nominal- Named categories, with no implied value- like- blood groups. Existential, no inherent order or superiority. Nominal data with only two groups are referred to as dichotomous or binary (male or female) Ordinal: Named with an order/ superiority like grades of GO
  • 11. Continuous • can have an infinite number of possible values Interval: Equal interval between values but no meaningful zero point ( body temp) Ratio: Equal intervals with a meaningful zero point and all mathematical operations are functional ( amount of secretions from a diabetic wound)
  • 13. 13 Intervening Variable • An intervening variable is on the causal pathway to the outcome
  • 14. Choice of Statistical test • Parametric tests can be used with interval and ratio data but not with nominal or ordinal data • Nonparametric tests can be used with any type of variable, including nominal or ordinal data
  • 16. Central Tendency • Estimates the “center” of the Distribution  Mean- representative of all data points and is the most efficient estimator of the middle of a normal (Gaussian) distribution;  However, inappropriate as a measure of central tendency if data are skewed.  Influenced by outlying values  Commonly used for interval and ratio data.
  • 17. Median- not influenced by outlying values  more appropriate for skewed data  commonly used for ordinal data. Mode- particularly useful while describing data distributed in a bimodal pattern when mean and median are not appropriate. commonly used with nominal data.
  • 18. Normal Distribution • Gaussian. Symmetric bell-shaped frequency distribution, in which mean, median, and mode all have the same value • Appropriate statistical test would be a parametric test such as a t-test or an analysis of variance (ANOVA). • Data are usually represented as mean (SD).
  • 20. Skewed Distribution • Positive/Right and Negative/Left • Appropriate statistical test would be a nonparametric test, such as Wilcoxon test or Mann-Whitney test. • Data are usually represented as median, interquartile range (IQR)
  • 22. Measures of Dispersion • Range- difference between the highest and the lowest values. Range can change drastically when the study is repeated • INTERQUARTILE RANGE (IQR) is the range between the 25th and 75th percentiles or the difference between the medians of the lower half and upper half of the data and comprises the middle 50% of the data
  • 23. • VARIANCE is a measure of dispersion or average deviation from the mean • STANDARD DEVIATION (SD) is the square root of variance and is the most common measure of dispersion used for normally distributed data • STANDARD ERROR OF MEAN (SEM) is calculated by dividing the SD by the square root of n.
  • 24. Statistical tests • Parametric- assume the underlying population to be normally distributed and are based on means and SDs • Non-parametric- No assumption of population distribution
  • 25. t Tests • Student’s t test is a simple, commonly used parametric test to compare two groups of continuous variables • Paired : Each patient/subject serves as his/her own control before and after an intervention • Unpaired: Two groups of patients/subjects are compared with each other
  • 26. Analysis of Variance ( ANOVA) • One-way ANOVA is an extension of the two- sample t test to three or more samples and deals with statistical test on more than two groups • Other methods, such as planned or post hoc comparisons, are conducted to examine specific comparisons among individual means
  • 27. Wilcoxon Rank-Sum test • Mostly for Skewed distribution and Ordinal data. • Others- x2 test (Chi-squared test)- common test used to compare categorical data. Data first entered into a 2*2 contingency table • If the numbers are small (expected value is <5), an alternative test called Fisher’s exact test.
  • 29. Simplified form • Nominal variable- Chi Square / Fischers • Ordinal Variable- Wilcoxon/ Mann-Whitney U • Interval/Ratio- Normal distribution- t test/ANOVA • Interval/Ratio- Skewed distribution- Wilcoxon/ Mann-Whitney U
  • 30. Hypothesis testing • Null hypothesis refers to restating the research hypothesis to one that proposes no difference between groups being compared • An alternative hypothesis proposes an association- One-sided/two-sided
  • 31. Errors • TYPE I ERROR (false-positive, also known as a rejection error) is rejection of a null hypothesis that is actually true in the population • TYPE II ERROR (false-negative, also known as an acceptance error) is failure to reject a null hypothesis that is actually false
  • 33. Interpretation of p Value • p VALUE is the probability of the null hypothesis being true by chance alone. • It is also the probability of committing a type I error. • p value of 0.05 or less is commonly used to denote significance • This value informs the investigator that there is at least a 95% chance that the two samples represent different populations
  • 34. • lower p value (<0.01) indicates a lower likelihood (1%) that the null hypothesis may be true due to chance alone. • lower p value does not infer a higher strength of association or clinical importance of an association • Factors that tend to decrease p value and increase significance are increased sample size, increased difference in control and experimental means and less variance
  • 36. Confidence interval • Range of values that you expect to include the actual mean of the true population • Typically 95% confidence intervals are used in research • Values at either extreme of this range are called confidence limits
  • 37. Sample Size and C.I • The larger the sample size, the narrower the C.I . • p value and C.I together provide the best information about the role of chance • Sample size is an important determinant of the power of the study to detect significant differences
  • 38. Confounding • Distortion in a measure of effect that may arise because we fail to control for other variables that are previously known risk factors for the health outcome being studied • Can lead to the observation of apparent differences between the study groups when they do not truly exist, or conversely, the observation of no difference when they do exist. 38
  • 39. Confounding variable • Independent risk factor (cause) of outcome • Unevenly distributed among exposed and unexposed • Not on the causal pathway between exposure and outcome 39
  • 40. THE DIFFERENCE BETWEEN BIAS AND CONFOUNDING  Bias creates an association that is not true,  Confounding describes an association that is true, but potentially misleading. 40
  • 41. Example • 2 groups of bottle-fed babies and breast-milk fed babies compared for Gastro-enteritis ( GE) • Bias- Finding less GE in bottle-fed babies due to less follow-up to the doctors • Confounding- Better-hygiene and less crowding can prevent GE in any of the groups; but that doesn’t correspond to true protective effects of breast-milk
  • 42. Least square mean changes • When there is study imbalance such that some blocks of experimental units are under- represented relative to other block • Least square means (estimated population marginal means) provide an opportunity to obtain an unbiased estimate of averages in the face of this kind of study imbalance
  • 45. • Absolute risk = a/(a + b) or c/ ( c+ d) • Relative risk/ Risk Ratio = (a/[a + b]) / (c/[c + d]) • Hazard ratio is a measure of relative risk over time in circumstances in which we are interested not only in the total number of events, but in their timing as well.
  • 49. Key points • Sensitivity and specificity are prevalence independent. • Increased prevalence of disease will increase PPV • Reduced disease prevalence increase NPV • Increasing the threshold for a positive test reduces false-positives and increases specificity • Reducing the threshold or cutoff value for a positive test reduces false-negatives and increases sensitivity
  • 51. Gold standard • Unambiguous method of determining whether or not a patient has a particular disease or outcome
  • 52. Regression Analysis • Simple Regression- Many relationships between variables can be fit to a straight line. Like duration of treatment • Logistics Regression- In situations in which the response of interest is dichotomous (binary) rather than continuous. Like outcome of death or alive
  • 53. Summary • Bio-statistics is integral part of understanding any bio-medical research and Evidence-based medicines. • The concept of various variable and understanding the various statistical tests are equally important • Sample size and the confidence limit concepts hold the key
  • 55. CAS-3 & CAS-4 • CAS-3- PPV- 61% • NPV- 60% • Sensitivity- 64% • Specificity- 57% • CAS-4- PPV- 80% • NPV- 64% • Sensitivity- 55% • Specificity- 86% • M. Ph. Mourits et al.