SlideShare a Scribd company logo
Evaluation of a Diagnostic Test
Evaluation of a Diagnostic Test
Accuracy or Validity –
The degree to which a measurement or
study reaches a correct conclusion
determining the ‘True Status’ of the
disease
• Sensitivity and Specificity describe the
validity of the test relative to gold standard
• First step in the evaluation of a test is
determining True status of the disease
using ‘Gold Standard’ for the Test
Definitions
• Sensitivity – the percentage of diseased persons
having positive test
• Specificity – the percentage of Non- diseased
persons having negative test
• Predictive Value – estimation of the probability of
Dis. after test results –PPV & NPV
• PPV – %age of persons with +ve Test, having Dis.
• NPV - %age of persons with -ve Test, not having Dis
• The greater the sensitivity of a test, more likely
the test will detect the disease.
The test with Great Sensitivity are usefull to
rule out presence of a disease bcz -ve test will
virtually exclude the possibility that patient has
the Disease.
• The greater the specificity of a test, the more
likely it is that persons without the disease will
have –ve test.
Very specific tests often are used to confirm the
presence of a disease. If a test is highly specific;
a +ve test result would strongly suggest the
presence of the disease
Surgical Biopsy (Gold Standard- TRUTH)
FNA
results
Positive
Disease
(Biopsy + ve)
No Disease
(Biopsy - ve)
Total
14

8 22
Negative
1
91

92
TOTAL 15 99 114
True Positives
False Positives
False Negatives
True Negatives
 Type I error
 Type II error Power 1 - 
Sensitivity
14/15=93%
Specificity
(91/99 = 92%)
PPV=14/22
= 64%
NPV =
91/92 = 99%
Prev = 13%
(15/114)
How Diagnostic Test help
• Whether the probability of Br. Ca is 13% or
64% :- further workup is required!!!
• But a –ve test result would reduce the
probability that Br. Ca is present to 1%
(100% minus NPV)
So!
Now no Biopsy…but keep watch
14 8
1 91
Without Palpable Mass
Surgical Biopsy
Cancer No Cancer
FNA
Result
+
-
Total 15 99
22
92
114
113 15
8 181
Surgical Biopsy
Cancer No Cancer
FNA
Result
+
-
Total 121 196
128
189
317
With Palpable Mass
Total
Total
Prevalence = 13%
Sensitivity = 93%
Specificity = 92%
PPV = 64%
NPV = 99%
Prevalence = 38%
Sensitivity = 93%
Specificity = 92%
PPV = 88%
NPV = 96%
Two Stage Screening - Net Specificity
Net Sensitivity = 315/500 = 63% ( )
Net Specificity = 7600 + 1710/ 9500 =
98% ( )
Assessing the Validity and Reliability of Diagnostic and Screening Test
PPV-?
Second Method of Applying Dx Test - Simultaneous Testing
Simultaneous Testing
Net Sensitivity
16+144+36
200
=196
200
= 98%
Net Sensitivity =
Simultaneous Testing
Net Specificity
=432
800
= 54%
Net Specificity =
Two Stage vs Simultaneous Testing
• Compared with either test alone, there is :-
– a loss in Net Sensitivity and a gain in Net Specificity
in Two Stage Testing
– a gain in Net Sensitivity and a loss in Net Specificity
in Simultaneous Testing
• Decision to use either Two Stage or Simultaneous
Testing depends on objective and practical
considerations (like reducing hospital stay, cost
and invasiveness ! Insurance coverage)
PPV is affected by
Dis. Prev. & Specificity
Without palpable masses; Prevalence= 13%
With palpable masses; Prevalence= 38%
Prevalence= 13%
Sensitivity = 93%
Specificity = 92%
PV + = 64%
PV - = 99%
Prevalence= 38%
Sensitivity = 93%
Specificity = 92%
PV + = 88%
PV - = 96%
Predictive value and Prev.
Surgical biopsy
FNA
results
positive
Cancer No
Cancer
Total
41 5 46
negative 3 65 68
Total 44 70 114
Surgical biopsy
FNA
results
positive
Cancer No
Cancer
Total
14 8 22
negative 1 91 92
Total 15 99 114
Suspicious FNA results considered positive
Prevalence= 38%
Sensitivity = 93%
Specificity = 92%
PV + = 88%
PV - = 96%
Surgical biopsy
FNA
results
Positive
Cancer No Cancer Total
33 0 33
negative 11 70 81
Total 44 70 114
Suspicious FNA results considered negative
Prevalence= 38%
Sensitivity = 75%
Specificity = 100%
PV + = 100%
PV - = 87%
Specificity & Predictive Value
Surgical biopsy
FNA
results
positive
Cancer No
Cancer
Total
41 5 46
negative 3 65 68
Total 44 70 114
screening and diagnostic  testing
screening and diagnostic  testing
screening and diagnostic  testing
Decide whether to order ESR or directly MRI?
• A 57 yr old man presents with h/o aching low back pain that
persists at rest and is worse by bending and lifting.
Progressively getting worse in last 6 wks-awakening him at
night.
• Within past 10 days he has noticed numbness in Rt buttock
and thigh and weakness in Rt lower limb.
• He had no fever but has lost 10 lb Wt. in last 4 months.
• O/E – Temp. is 99.6F, tenderness in the lower lumber spine,
decrease in sensation over dorso-lateral aspect of Rt. foot,
weakness in Rt ankle aversion. Deep tendon reflexes normal
• ! You suspect that man has 20% chance of spinal malignancy
• ESR ≥20 mm/h has 78% sensitivity & 67% specificity
• MRI has 95% sensitivity AND 95% specificity!!
• Suppose we have 1000 patients
We can use any of he following methods
• 2 X 2 Table Method
• Likelihood Ratio (gives odds)
• Decision Tree method And
• Bayes Theoram
Don’t
Tt
Test Treat
Do
n’t
Tt
Test Treat
Don’t
Tt
Test Treat
No difference
Test is quite accurate
with little Risk
Test is of low accuracy
or Risky
How will the prior probability of 20% change with +ve ESR?
Disease
Test (ESR) D+ D-
T+ (TP) 156 (FP) 264 420
T- (FN) 44 (TN) 536 580
200 800 1000
Predictive Value of a Positive test PPV (PV+) = 156/420 = 0.37
Increased from 20% to 37%
Predictive Value of a Negative test NPV = 536/580 = 0.92
A) - What is the probability that the patient doesn’t have the Dis. though the test is +ve (FP)?
B)- What is the probability that the patient does have the disease though the test is –ve (FN)?
A) - 264/420 = 0.63 (hence minimal use as screening test), and B)- 44/580 = 0.08
MRI has 95% sensitivity AND 95% specificity
And will have PVs as follows:-
Disease
Test (MRI) D+ D-
T+ (TP) 351.5 (FP) 31.5 383
T- (FN) 18.5 (TN) 598.5 617
370 630 1000
Predictive Value of a Positive test PPV (PV+) = 351.5/383 = 0.918
Predictive Value of a Negative test NPV (NPV) = 598.5/617= 0.970
Likelihood Ratio
LR = Chance of picking the Dis. out of total Diseased
Chance of picking the Dis. out of total Dis. Free
= sensitivity/ 1-specificity (FN)
= LR = 0.78/ 1-0.67 = 0.78/0.33 = 2.36
(In case of ESR where sensitivity = 78% & Specificity = 67%);
Pre Test odds = Prior Probability
1- Prior Probability
Post Test odds = LR X Pre test odds = 0.78 X 0.2
0.33 X 0.8
= 2.36 X 0.25 = 0.59
Posterior probability (PPV) = post test odds/ 1+ post test odds
= 0.59/ 1+ 0.59 = 0.37 = 37%
= 0.20/1- 0.2 = 0.25
=0.156/ 0.264
Reliability or Repeatability of a Test
• Factors responsible for variation in the results:
1. Intra subject (within the individual) variation
2. Intra observer variation (variation in the
reading of test result by the same observer)-
greater the subjective element in the reading
more is this error
3. Inter observer variation (variation in the
reading of test result between observers)
Inter observer Variation
Reading No. 1
Reading No. 2 Abnormal Suspect Doubtful Normal
Abnormal A B C D
Suspect E F G H
Doubtful I J K L
Normal M N O P
Percent Agreement =
A + F + K + P
Total readings
X 100
Negative Test Result, & there is likely to be
Considerable agreement on this !
Kappa Statistics
• The extent to which two observers (physician/
nurse/ radiologist etc) agree is an important
Index of good quality of care
• Yet, there is a fraction based ‘solely on chance’
for agreement between two observers
• What we want to know is – to what extent
did the education/ training that the observers
received improve the quality of their
observation (how much increased percent
agreement between them beyond chance! )
Rationale of the kappa statistics
• First we want to know – how much better is the
agreement between the observers’ readings than
would be expected by chance
= (% agreement observed - % agreement expected
by chance alone)
• What is the maximum improvement the
observers can have than expected by chance
100% - % agreement expected by chance alone
• Kappa expresses the extent to which the observed
agreement exceeds chance agreement relative to
maximum that the observer can hope to improve
• Kappa =
[Percent Agreement
Observed]
[Percent Agreement
expected by chance alone]
-
[Percent Agreement
expected by chance alone]
100% -
Landis and Koch suggested that :-
kappa greater than 0.75 = excellent agreement
Kappa of 0.40 to 0.75 = intermediate to good agreement
Histological grading
Of lung cancer by
two pathologist
screening and diagnostic  testing
Relationship between Validity and Reliability
Exercise
• PA was used to screen Br. Ca. in 2,500 women
with biopsy proven adenocarcinoma and in
5,000 age matched control wome. The result
of PA were +ve in 1800 cases and 800 control.
• What is the sensitivity, specificity and PPV of
PA ?
• 72%, 84% and 69%
• A screening test is used in the same way in two
similar Pop. But the proportion of FPs among those
who test +ve in Pop. A is lower than that among
those who test +ve in Pop. B
• What is likely explanation for this finding ?
• Prevalence of disease is higher in Pop. A
• Compare PA and Audiometry test for hearing
problem using sen.itivity and specificity ?
PA
Test
Hearing Prob.
D+ D-
T+ 240 40
T- 60 160
Audiometry
Test
Hearing Prob.
D+ D-
T+ 240 40
T- 60 160
More sensitive and less specific
• Two Pead. Test for streptococcal infection one
(X) using standard culture test which is 90%
Sen. And 96% Sp. While other (Y) uses New
culture test which is 96% Sen. And 96% Sp.
• If 200 patients undergo culture with both tests
which is correct:-
a) X will correctly identify more people
b) X will correctly identify fewer people
c) X will correctly identify more people without infection
d) The Prev. of Strep. Inf. Is needed to know who will
correctly identify more people
e) Ans is b
• In a screening for colon Ca. 50-75 yrs old were
screened with Hemoccult test.
• If Hemoccult test has 70% Sen. & 75% Sp. And
Prev. of Ca. colon is 12/1000, what is the PPV
• 3.13%
• If Hemoccult test is –ve no further testing is
done but if its +ve Hemoccult test II is done .
• If this second test is also +ve for blood in stool,
more extensive tests are done. What is the
effect on Net Sen. & Sp. of this method?
• Net Sen. Is decreased and Net Sp. Increased
• Two physicians were asked to classify X-rays
abnormal or normal independently.
Physician 2
Physician 1 Abnormal Normal Total
Abnormal 40 20 60
Normal 10 30 40
Total 50 50 100
1. What is the simple % agreement between the two physician - …………..
2. The % agreement between the two physician, excluding the X-rays that bot
classified as normal is ………………..
3. The value of kappa is ………….
4. How will you rate this kappa ………….
Test Result
without the disease
with the disease
‘‘-’’ ‘‘+’’
Moving the Threshold: right
Test Result
without the disease
with the disease
‘‘-’’ ‘‘+’’
Moving the Threshold: left
True
Positive
Rate
(sensitivity)
0%
100%
False Positive Rate
(1-specificity)
0% 100%
ROC curve
True
Positive
Rate
0%
100%
False Positive Rate
0
%
100%
True
Positive
Rate
0%
100%
False Positive Rate
0% 100%
A good test: A poor test:
ROC curve comparison
Best Test: Worst test:
True
Positive
Rate
0
%
100%
False Positive Rate
0
%
100
%
True
Positive
Rate
0
%
100%
False Positive
Rate
0
%
100
%
The distributions
don’t overlap at all
The distributions
overlap completely
ROC curve extremes
1 0
%1s
%0s
Say
1
Say
0
Say
1
Say
0
Say
1
Say
0
Say
1
Say
0
Say
0
Say
1
Say
1
Say
0
Say
1
Say
0
Say
0
Say
1
screening and diagnostic  testing
The process
of making
an objective
and
systematic
analysis of
information
from all the
randomized
controlled
trials

More Related Content

PPTX
Likelihood Ratio, ROC and kappa Statistics
PPTX
Screening of Diseases
PPT
Diagnostic testing 2009
PPTX
Diagnotic and screening tests
PPSX
Validity and Reliability
PPTX
Screening of diseases
PPT
Risk assessment
PDF
Bayesian clinical test
Likelihood Ratio, ROC and kappa Statistics
Screening of Diseases
Diagnostic testing 2009
Diagnotic and screening tests
Validity and Reliability
Screening of diseases
Risk assessment
Bayesian clinical test

What's hot (8)

PDF
MRCPsych - How To Analyse Diagnostic Test Studies (May09)
PPTX
Enhancing Treatment Experiences in Castration-Resistant Prostate Cancer: The ...
PPT
Dose selection nyu 012815 2
PPT
05 diagnostic tests cwq
PDF
[Workshop] The science of screening in Psycho-oncology (Oct10)
PDF
Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...
PPT
Screening tests
PPT
Cs in Naser Medical Complx- Gaza - Palestine
MRCPsych - How To Analyse Diagnostic Test Studies (May09)
Enhancing Treatment Experiences in Castration-Resistant Prostate Cancer: The ...
Dose selection nyu 012815 2
05 diagnostic tests cwq
[Workshop] The science of screening in Psycho-oncology (Oct10)
Contemporary Management of HIV. New Data From IDWeek 2018 and Other Fall 2018...
Screening tests
Cs in Naser Medical Complx- Gaza - Palestine
Ad

Similar to screening and diagnostic testing (20)

PPTX
Likelihood Ratio, ROC and kappa
PPT
Validity and reliability of screening/ diagnostic tests
PPT
Validity and Screening Test
PPTX
Screening test (basic concepts)
PPTX
evalution for screening test and its importance .pptx
PPTX
Screening and diagnostic testing
PPTX
5 EPIDIMIOLOGY GROUP 5-1.pptx Assessinging
PPT
Cem 2012 rational diagnostician2
PPTX
John Tidy - Adjunctive colposcopic technologies
PPTX
8 screening.pptxscreening.pptxscreening.
PPT
Module 9 clinical labs values edited.ppt
PPTX
Epidemiological method to determine utility of a diagnostic test
PPT
Diagnostic tests - research .-6thoct.ppt
PPT
Validity of a screening test
PPTX
session three epidemiology.pptx
PPTX
session three epidemiology.pptx
PDF
1 - Diagnostic methods.pdf
PPTX
Dr Amit Diagnostic Tests.pptx
PPTX
Testing and Screening Nov 2021-undergraduate.pptx
PPTX
CAT 1 -MPH 5101 - FOUNDATIONS OF EPIDEMIOLOGY (1).pptx
Likelihood Ratio, ROC and kappa
Validity and reliability of screening/ diagnostic tests
Validity and Screening Test
Screening test (basic concepts)
evalution for screening test and its importance .pptx
Screening and diagnostic testing
5 EPIDIMIOLOGY GROUP 5-1.pptx Assessinging
Cem 2012 rational diagnostician2
John Tidy - Adjunctive colposcopic technologies
8 screening.pptxscreening.pptxscreening.
Module 9 clinical labs values edited.ppt
Epidemiological method to determine utility of a diagnostic test
Diagnostic tests - research .-6thoct.ppt
Validity of a screening test
session three epidemiology.pptx
session three epidemiology.pptx
1 - Diagnostic methods.pdf
Dr Amit Diagnostic Tests.pptx
Testing and Screening Nov 2021-undergraduate.pptx
CAT 1 -MPH 5101 - FOUNDATIONS OF EPIDEMIOLOGY (1).pptx
Ad

More from amitakashyap1 (20)

PPT
Tortoise and hare story for leadership
PPT
Leadership
PPTX
Effective public health communication old
PPT
Cohort design
PPT
Case control design
PPTX
Effective public health communication 5th april
PPT
Copper T insertion
PPT
Contraceptive methods updated
PPT
Basic Concepts of PH
PPTX
Basic Concepts of PH
PPT
Basic Concepts of PH
PPT
Basic Concepts PH
PPT
Public Health and Its Role
PPT
Concepts 1 evolution of com med
PPT
Concept of sufficient cause and component causes
PPT
Epidemiology Lectures for UG
PPT
Epidemiology Lectures for UG
PPTX
Epidemiology Lectures for UG
PPT
Epidemiology Lectures for UG
PPT
Epidemiology Lectures for UG
Tortoise and hare story for leadership
Leadership
Effective public health communication old
Cohort design
Case control design
Effective public health communication 5th april
Copper T insertion
Contraceptive methods updated
Basic Concepts of PH
Basic Concepts of PH
Basic Concepts of PH
Basic Concepts PH
Public Health and Its Role
Concepts 1 evolution of com med
Concept of sufficient cause and component causes
Epidemiology Lectures for UG
Epidemiology Lectures for UG
Epidemiology Lectures for UG
Epidemiology Lectures for UG
Epidemiology Lectures for UG

Recently uploaded (20)

PPTX
Clinical approach and Radiotherapy principles.pptx
PPT
ASRH Presentation for students and teachers 2770633.ppt
PPT
1b - INTRODUCTION TO EPIDEMIOLOGY (comm med).ppt
PDF
Copy of OB - Exam #2 Study Guide. pdf
PPT
OPIOID ANALGESICS AND THEIR IMPLICATIONS
PDF
Medical Evidence in the Criminal Justice Delivery System in.pdf
PDF
focused on the development and application of glycoHILIC, pepHILIC, and comm...
PPT
STD NOTES INTRODUCTION TO COMMUNITY HEALT STRATEGY.ppt
PDF
Transcultural that can help you someday.
PDF
Handout_ NURS 220 Topic 10-Abnormal Pregnancy.pdf
PPT
MENTAL HEALTH - NOTES.ppt for nursing students
PPT
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
PPTX
Electrolyte Disturbance in Paediatric - Nitthi.pptx
PPTX
preoerative assessment in anesthesia and critical care medicine
PPTX
Important Obstetric Emergency that must be recognised
PDF
شيت_عطا_0000000000000000000000000000.pdf
PPTX
CHEM421 - Biochemistry (Chapter 1 - Introduction)
PDF
Intl J Gynecology Obste - 2021 - Melamed - FIGO International Federation o...
PDF
Human Health And Disease hggyutgghg .pdf
PDF
Cardiology Pearls for Primary Care Providers
Clinical approach and Radiotherapy principles.pptx
ASRH Presentation for students and teachers 2770633.ppt
1b - INTRODUCTION TO EPIDEMIOLOGY (comm med).ppt
Copy of OB - Exam #2 Study Guide. pdf
OPIOID ANALGESICS AND THEIR IMPLICATIONS
Medical Evidence in the Criminal Justice Delivery System in.pdf
focused on the development and application of glycoHILIC, pepHILIC, and comm...
STD NOTES INTRODUCTION TO COMMUNITY HEALT STRATEGY.ppt
Transcultural that can help you someday.
Handout_ NURS 220 Topic 10-Abnormal Pregnancy.pdf
MENTAL HEALTH - NOTES.ppt for nursing students
Copy-Histopathology Practical by CMDA ESUTH CHAPTER(0) - Copy.ppt
Electrolyte Disturbance in Paediatric - Nitthi.pptx
preoerative assessment in anesthesia and critical care medicine
Important Obstetric Emergency that must be recognised
شيت_عطا_0000000000000000000000000000.pdf
CHEM421 - Biochemistry (Chapter 1 - Introduction)
Intl J Gynecology Obste - 2021 - Melamed - FIGO International Federation o...
Human Health And Disease hggyutgghg .pdf
Cardiology Pearls for Primary Care Providers

screening and diagnostic testing

  • 1. Evaluation of a Diagnostic Test
  • 2. Evaluation of a Diagnostic Test Accuracy or Validity – The degree to which a measurement or study reaches a correct conclusion determining the ‘True Status’ of the disease • Sensitivity and Specificity describe the validity of the test relative to gold standard • First step in the evaluation of a test is determining True status of the disease using ‘Gold Standard’ for the Test
  • 3. Definitions • Sensitivity – the percentage of diseased persons having positive test • Specificity – the percentage of Non- diseased persons having negative test • Predictive Value – estimation of the probability of Dis. after test results –PPV & NPV • PPV – %age of persons with +ve Test, having Dis. • NPV - %age of persons with -ve Test, not having Dis
  • 4. • The greater the sensitivity of a test, more likely the test will detect the disease. The test with Great Sensitivity are usefull to rule out presence of a disease bcz -ve test will virtually exclude the possibility that patient has the Disease. • The greater the specificity of a test, the more likely it is that persons without the disease will have –ve test. Very specific tests often are used to confirm the presence of a disease. If a test is highly specific; a +ve test result would strongly suggest the presence of the disease
  • 5. Surgical Biopsy (Gold Standard- TRUTH) FNA results Positive Disease (Biopsy + ve) No Disease (Biopsy - ve) Total 14  8 22 Negative 1 91  92 TOTAL 15 99 114 True Positives False Positives False Negatives True Negatives  Type I error  Type II error Power 1 -  Sensitivity 14/15=93% Specificity (91/99 = 92%) PPV=14/22 = 64% NPV = 91/92 = 99% Prev = 13% (15/114)
  • 6. How Diagnostic Test help • Whether the probability of Br. Ca is 13% or 64% :- further workup is required!!! • But a –ve test result would reduce the probability that Br. Ca is present to 1% (100% minus NPV) So! Now no Biopsy…but keep watch
  • 7. 14 8 1 91 Without Palpable Mass Surgical Biopsy Cancer No Cancer FNA Result + - Total 15 99 22 92 114 113 15 8 181 Surgical Biopsy Cancer No Cancer FNA Result + - Total 121 196 128 189 317 With Palpable Mass Total Total Prevalence = 13% Sensitivity = 93% Specificity = 92% PPV = 64% NPV = 99% Prevalence = 38% Sensitivity = 93% Specificity = 92% PPV = 88% NPV = 96%
  • 8. Two Stage Screening - Net Specificity Net Sensitivity = 315/500 = 63% ( ) Net Specificity = 7600 + 1710/ 9500 = 98% ( ) Assessing the Validity and Reliability of Diagnostic and Screening Test PPV-?
  • 9. Second Method of Applying Dx Test - Simultaneous Testing
  • 12. Two Stage vs Simultaneous Testing • Compared with either test alone, there is :- – a loss in Net Sensitivity and a gain in Net Specificity in Two Stage Testing – a gain in Net Sensitivity and a loss in Net Specificity in Simultaneous Testing • Decision to use either Two Stage or Simultaneous Testing depends on objective and practical considerations (like reducing hospital stay, cost and invasiveness ! Insurance coverage)
  • 13. PPV is affected by Dis. Prev. & Specificity
  • 14. Without palpable masses; Prevalence= 13% With palpable masses; Prevalence= 38% Prevalence= 13% Sensitivity = 93% Specificity = 92% PV + = 64% PV - = 99% Prevalence= 38% Sensitivity = 93% Specificity = 92% PV + = 88% PV - = 96% Predictive value and Prev. Surgical biopsy FNA results positive Cancer No Cancer Total 41 5 46 negative 3 65 68 Total 44 70 114 Surgical biopsy FNA results positive Cancer No Cancer Total 14 8 22 negative 1 91 92 Total 15 99 114
  • 15. Suspicious FNA results considered positive Prevalence= 38% Sensitivity = 93% Specificity = 92% PV + = 88% PV - = 96% Surgical biopsy FNA results Positive Cancer No Cancer Total 33 0 33 negative 11 70 81 Total 44 70 114 Suspicious FNA results considered negative Prevalence= 38% Sensitivity = 75% Specificity = 100% PV + = 100% PV - = 87% Specificity & Predictive Value Surgical biopsy FNA results positive Cancer No Cancer Total 41 5 46 negative 3 65 68 Total 44 70 114
  • 19. Decide whether to order ESR or directly MRI? • A 57 yr old man presents with h/o aching low back pain that persists at rest and is worse by bending and lifting. Progressively getting worse in last 6 wks-awakening him at night. • Within past 10 days he has noticed numbness in Rt buttock and thigh and weakness in Rt lower limb. • He had no fever but has lost 10 lb Wt. in last 4 months. • O/E – Temp. is 99.6F, tenderness in the lower lumber spine, decrease in sensation over dorso-lateral aspect of Rt. foot, weakness in Rt ankle aversion. Deep tendon reflexes normal • ! You suspect that man has 20% chance of spinal malignancy • ESR ≥20 mm/h has 78% sensitivity & 67% specificity • MRI has 95% sensitivity AND 95% specificity!! • Suppose we have 1000 patients
  • 20. We can use any of he following methods • 2 X 2 Table Method • Likelihood Ratio (gives odds) • Decision Tree method And • Bayes Theoram Don’t Tt Test Treat Do n’t Tt Test Treat Don’t Tt Test Treat No difference Test is quite accurate with little Risk Test is of low accuracy or Risky
  • 21. How will the prior probability of 20% change with +ve ESR? Disease Test (ESR) D+ D- T+ (TP) 156 (FP) 264 420 T- (FN) 44 (TN) 536 580 200 800 1000 Predictive Value of a Positive test PPV (PV+) = 156/420 = 0.37 Increased from 20% to 37% Predictive Value of a Negative test NPV = 536/580 = 0.92 A) - What is the probability that the patient doesn’t have the Dis. though the test is +ve (FP)? B)- What is the probability that the patient does have the disease though the test is –ve (FN)? A) - 264/420 = 0.63 (hence minimal use as screening test), and B)- 44/580 = 0.08
  • 22. MRI has 95% sensitivity AND 95% specificity And will have PVs as follows:- Disease Test (MRI) D+ D- T+ (TP) 351.5 (FP) 31.5 383 T- (FN) 18.5 (TN) 598.5 617 370 630 1000 Predictive Value of a Positive test PPV (PV+) = 351.5/383 = 0.918 Predictive Value of a Negative test NPV (NPV) = 598.5/617= 0.970
  • 23. Likelihood Ratio LR = Chance of picking the Dis. out of total Diseased Chance of picking the Dis. out of total Dis. Free = sensitivity/ 1-specificity (FN) = LR = 0.78/ 1-0.67 = 0.78/0.33 = 2.36 (In case of ESR where sensitivity = 78% & Specificity = 67%); Pre Test odds = Prior Probability 1- Prior Probability Post Test odds = LR X Pre test odds = 0.78 X 0.2 0.33 X 0.8 = 2.36 X 0.25 = 0.59 Posterior probability (PPV) = post test odds/ 1+ post test odds = 0.59/ 1+ 0.59 = 0.37 = 37% = 0.20/1- 0.2 = 0.25 =0.156/ 0.264
  • 24. Reliability or Repeatability of a Test • Factors responsible for variation in the results: 1. Intra subject (within the individual) variation 2. Intra observer variation (variation in the reading of test result by the same observer)- greater the subjective element in the reading more is this error 3. Inter observer variation (variation in the reading of test result between observers)
  • 25. Inter observer Variation Reading No. 1 Reading No. 2 Abnormal Suspect Doubtful Normal Abnormal A B C D Suspect E F G H Doubtful I J K L Normal M N O P Percent Agreement = A + F + K + P Total readings X 100
  • 26. Negative Test Result, & there is likely to be Considerable agreement on this !
  • 27. Kappa Statistics • The extent to which two observers (physician/ nurse/ radiologist etc) agree is an important Index of good quality of care • Yet, there is a fraction based ‘solely on chance’ for agreement between two observers • What we want to know is – to what extent did the education/ training that the observers received improve the quality of their observation (how much increased percent agreement between them beyond chance! )
  • 28. Rationale of the kappa statistics • First we want to know – how much better is the agreement between the observers’ readings than would be expected by chance = (% agreement observed - % agreement expected by chance alone) • What is the maximum improvement the observers can have than expected by chance 100% - % agreement expected by chance alone • Kappa expresses the extent to which the observed agreement exceeds chance agreement relative to maximum that the observer can hope to improve
  • 29. • Kappa = [Percent Agreement Observed] [Percent Agreement expected by chance alone] - [Percent Agreement expected by chance alone] 100% - Landis and Koch suggested that :- kappa greater than 0.75 = excellent agreement Kappa of 0.40 to 0.75 = intermediate to good agreement
  • 30. Histological grading Of lung cancer by two pathologist
  • 32. Relationship between Validity and Reliability
  • 33. Exercise • PA was used to screen Br. Ca. in 2,500 women with biopsy proven adenocarcinoma and in 5,000 age matched control wome. The result of PA were +ve in 1800 cases and 800 control. • What is the sensitivity, specificity and PPV of PA ? • 72%, 84% and 69%
  • 34. • A screening test is used in the same way in two similar Pop. But the proportion of FPs among those who test +ve in Pop. A is lower than that among those who test +ve in Pop. B • What is likely explanation for this finding ? • Prevalence of disease is higher in Pop. A • Compare PA and Audiometry test for hearing problem using sen.itivity and specificity ? PA Test Hearing Prob. D+ D- T+ 240 40 T- 60 160 Audiometry Test Hearing Prob. D+ D- T+ 240 40 T- 60 160 More sensitive and less specific
  • 35. • Two Pead. Test for streptococcal infection one (X) using standard culture test which is 90% Sen. And 96% Sp. While other (Y) uses New culture test which is 96% Sen. And 96% Sp. • If 200 patients undergo culture with both tests which is correct:- a) X will correctly identify more people b) X will correctly identify fewer people c) X will correctly identify more people without infection d) The Prev. of Strep. Inf. Is needed to know who will correctly identify more people e) Ans is b
  • 36. • In a screening for colon Ca. 50-75 yrs old were screened with Hemoccult test. • If Hemoccult test has 70% Sen. & 75% Sp. And Prev. of Ca. colon is 12/1000, what is the PPV • 3.13% • If Hemoccult test is –ve no further testing is done but if its +ve Hemoccult test II is done . • If this second test is also +ve for blood in stool, more extensive tests are done. What is the effect on Net Sen. & Sp. of this method? • Net Sen. Is decreased and Net Sp. Increased
  • 37. • Two physicians were asked to classify X-rays abnormal or normal independently. Physician 2 Physician 1 Abnormal Normal Total Abnormal 40 20 60 Normal 10 30 40 Total 50 50 100 1. What is the simple % agreement between the two physician - ………….. 2. The % agreement between the two physician, excluding the X-rays that bot classified as normal is ……………….. 3. The value of kappa is …………. 4. How will you rate this kappa ………….
  • 38. Test Result without the disease with the disease ‘‘-’’ ‘‘+’’ Moving the Threshold: right
  • 39. Test Result without the disease with the disease ‘‘-’’ ‘‘+’’ Moving the Threshold: left
  • 41. True Positive Rate 0% 100% False Positive Rate 0 % 100% True Positive Rate 0% 100% False Positive Rate 0% 100% A good test: A poor test: ROC curve comparison
  • 42. Best Test: Worst test: True Positive Rate 0 % 100% False Positive Rate 0 % 100 % True Positive Rate 0 % 100% False Positive Rate 0 % 100 % The distributions don’t overlap at all The distributions overlap completely ROC curve extremes
  • 52. The process of making an objective and systematic analysis of information from all the randomized controlled trials

Editor's Notes

  • #3: In a perfect world medical test would always be correct +ve test will mean disease and –Ve test will mean ‘no disease’ but in actuality that is not the case!!!! Internal Validity – the extent to which the results reflect the true situation. To improve on it :- Restricting the type of subjects (Inclusion/Exclusion Criteria) The environment in which the study is performed External Validity – Generalizability (sample size!)
  • #5: The greater the sensitivity of a test, more likely the test will detect the disease. For FNA 93% of all breast cancer patients had positive results Negative result with test having High Sensitivity would virtually exclude the possibility that patient has the Disease The greater the specificity of a test, the more likely it is that persons without the disease will be excluded from the consideration of having the disease
  • #6: Patient Profile extended – 114 such patients were tested with both biopsy and FNA test
  • #9: PPV-? – 15.6% to 63%
  • #13: If we do not want False Positives – two stage testing If we do not want False Negatives – Simultaneous testing
  • #14: Prev. = 50%, Sen. = 50%, Specif. = 90% PPV = 250/330 = 76% Prev. = 50%, Sen. = 90%, Specif. = 50% PPV = 450/8500 = 53%
  • #20: A decision must be reached on whether to order ESR or proceed directly with lumber MRI. It depends on – prior probability of Spinal Malignancy the accuracy of ESR in detecting malignancy among who have it (sensitivity) its ability to label a person not having disease among disease free people (specificity)
  • #21: Application of probabilistic and statistical principles to individual patient is evidence based medicine evaluating new diagnostic procedures determining most cost effective approach evaluating available Tt options A physician’s best guess (index of suspicion that pt has the disease – prior probability) depends on knowledge of prev. and is revised upwards or downwards depending up on S/S and other characteristics like race/ age/ sex etc Decision to do test &/ or Tt depends on the risk of the Diagnostic test, the benefits of Tt to patient, the risk of Tt to patient with and without disease and the accuracy of the Test.
  • #28: If we ask two roadside persons to mark a set of X-rays as positive and negative indepently there will be some agreement between their readings just by chance Kappa statistics proposed by Cohen in 1960
  • #52: The ROC plot of a given test is obtained by calculating the sensitivity and specificity of every observed value, and then plotting sensitivity (on the Y axis) against 1 - specificity (on the X axis). A test that does not discriminate between normal and abnormal would give a diagonal straight line from the bottom left corner to the top right corner. All points on such a line represent a 1:1 ratio of true to false positives. An ideal test would give a rectangular plot passing from the origin at the bottom left hand corner towards top left hand corner at first and thence to the top right hand corner. In reality the ROC curves of many of the tests in common use fall in between these extremes. The cut-off point for deciding between normal and abnormal is selected arbitrarily where the ROC curve changes direction from being vertical to horizontal. The more the ROC curve arches into the upper left hand corner away from the diagonal, the better the test.
  • #53: The overall odds ratio is then calculated by pooling the data of all the studies. This is also called "typical" odds ratio. It is calculated by the difference between number of deaths in the treatment group (i.e. the number observed) and the number of deaths in this group if the treatment were ineffective (i.e. number expected). This gives the Observed minus the Expected statistic. The confidence interval of O  E is also calculated The outcome in the case of each study can be estimated separately by calculating the O  E value. If the observed number (O) differs systematically from the expected number (E), there is clear evidence of effect. The totaled O  E gives a measure of the overall statistical significance and the effect size