SlideShare a Scribd company logo
© 2013 Springer Publishing Company, LLC.
Chapter 16
Interpreting Test Scores
&Oermann Gaberson
Evaluation and Testing in Nursing Education
4th edition
© 2013 Springer Publishing Company, LLC.
Interpreting Test Scores
♦ A test produces a score
– Number with no intrinsic meaning
– Must be compared with something that has
meaning
♦ Interpretations can be norm- or criterion-
referenced
2
© 2013 Springer Publishing Company, LLC.
Test Score Distributions
♦ Scoring a test produces a collection of raw
scores, recorded by student name or number
– Difficult to interpret characteristics of the scores
♦ Arrange in rank order, highest to lowest
– Reveals range of scores
– Still difficult to judge how a typical student
performed on the test or other characteristics of
the obtained scores
3
© 2013 Springer Publishing Company, LLC.
Test Score Distributions
♦ Frequency distribution
– Remove student names or numbers
– List each score once
– Tally number of times each score occurs
– Identify how well the group of students performed on the
exam more easily
– Can represent graphically as a histogram or frequency
polygon
• Display scores that occurred most frequently, score distribution
shape, range
4
© 2013 Springer Publishing Company, LLC.
Characteristics of
Score Distributions
♦ Symmetry
♦ Skewness
♦ Modality
♦ Kurtosis
5
© 2013 Springer Publishing Company, LLC.
Symmetry
♦ Symmetric distribution or curve
– Equal halves, mirror images of each other
♦ Nonsymmetric or asymmetric distribution
or curve
– Scores cluster at one end, tail toward other end
– Most nursing test score distributions
6
© 2013 Springer Publishing Company, LLC.
Skewness
♦ Skew—direction in which the tail extends
– Positive skew—tail toward the right (in the
direction of positive numbers on a scale)
• Positively skewed distribution—cluster of scores at
low end
– Negative skew—tail toward the left (in the
direction of negative numbers)
• Cluster of scores at the high end
• Most nursing test score distributions
7
© 2013 Springer Publishing Company, LLC.
Modality
♦ Number of peaks (cluster of scores) in the
distribution
♦ Mode
– Most frequently occurring score in the distribution
♦ Unimodal—one peak
♦ Bimodal—two peaks
♦ Multimodal—many peaks
8
© 2013 Springer Publishing Company, LLC.
Kurtosis
♦ Relative flatness or peakedness of the curve
♦ Platykurtic—relatively flat, gently curved
♦ Mesokurtic—moderately curved
♦ Leptokurtic—sharply peaked
9
© 2013 Springer Publishing Company, LLC.
“Curving” Grades
♦ Not appropriate if scores lack characteristics
of a normal curve
– Bell-shaped: symmetric, unimodal, mesokurtic
© 2013 Springer Publishing Company, LLC.
“Curving” Grades
♦ Most score distributions from teacher-made
tests not normally distributed
♦ Shape of distribution affected by:
– Test characteristics
• Difficult test → positively skewed curve
– Ability of students
• Nursing content knowledge not normally distributed
– Students admitted to nursing program not representative
of general population
11
© 2013 Springer Publishing Company, LLC.
Measures of Central Tendency
♦ Ways of indicating the score that is most
characteristic or typical of the distribution
♦ “Middle” of a distribution, scores tend to
cluster around it
♦ Three measures
– Mode
– Median
– Mean
12
© 2013 Springer Publishing Company, LLC.
Mode
♦ Most frequently occurring score in a distribution
♦ Must be an actual obtained score
♦ Identified from frequency distribution or graphic
display without mathematical calculation
♦ Rough indication of central tendency
♦ Least stable measure of central tendency
– Can fluctuate considerably among samples drawn from
the same population
13
© 2013 Springer Publishing Company, LLC.
Median
♦ Point that divides a score distribution into equal halves
♦ 50th percentile—50% of scores are above and 50% are below
♦ Does not have to be an actual obtained score
– Even number of scores—median is halfway between the two
middle scores
– Odd number of scores—median is the middle score
♦ Index of location—not influenced by the value of each score
– Good for skewed distribution
14
© 2013 Springer Publishing Company, LLC.
Mean
♦ Mathematical average of all scores
– Computed by summing individual scores and dividing by
the total number of scores
– Does not have to be an actual obtained score
♦ Value of the mean is affected by every score in the
distribution
– Influenced by extremely high or low scores
– Not the most accurate measure of central tendency in
highly skewed distributions
15
© 2013 Springer Publishing Company, LLC.
Selecting a Measure of
Central Tendency
♦ Relationship between shape of a distribution
and locations of measures of central tendency
– Normal distribution
• Mean, median, and mode have the same value
– Positively skewed distribution
• Mean is highest, mode is lowest
– Negatively skewed distribution
• Mode is highest, mean is lowest
16
© 2013 Springer Publishing Company, LLC.
Measures of Variability
♦ Used to determine how similar or different
the test scores are
♦ Score distributions may have similar measures
of central tendency and different degrees of
variability
♦ Most common measures
– Range
– Standard deviation
17
© 2013 Springer Publishing Company, LLC.
Range
♦ Simplest measure of variability
♦ Difference between the highest and lowest scores in
the distribution
– Sometimes expressed as highest and lowest scores, rather
than a difference score (e.g., 42 to 60)
♦ Can be highly unstable—based on only two values
♦ Tends to increase with number of scores
– Wider range of test scores from large group of students
because of likelihood of an extreme score
18
© 2013 Springer Publishing Company, LLC.
Standard Deviation (SD)
♦ Most common and useful measure of variability
♦ Takes every score in the distribution into
consideration
♦ Based on differences between each score and the
mean
♦ Represents average amount by which scores differ
from the mean
– Smaller if scores cluster tightly around the mean
– Larger if scores widely scattered over large range
19
© 2013 Springer Publishing Company, LLC.
Interpreting an Individual Score
♦ Scores on teacher-made tests
– Norm-referenced interpretations
• Use mean and SD to interpret individual scores
– Criterion-referenced interpretations
• Used in most nursing education settings
• Scores are compared to a preset standard
• Example: percentage-correct score
– Comparison of a student’s score with the maximum
possible score
20
© 2013 Springer Publishing Company, LLC.
Percentage-Correct Scores
♦ Derived (not raw) score
♦ Often used as a basis for assigning grades
♦ Determined more by test item difficulty than by
quality of performance
– If test is more difficult than expected, teacher may
want to adjust the raw scores before calculating the
percentage correct
♦ Not to be confused with percentile score
– Norm-referenced interpretation
21
© 2013 Springer Publishing Company, LLC.
Interpreting an Individual Score
♦ Scores on standardized tests
– Usually used to make norm-referenced
interpretations
– More relevant to general rather than specific
instructional goals
• Should not be used to determine course grades
– Usually reported in derived scores
• Percentile ranks
• Standard scores
• Norm-group scores
(cont’d)
22
© 2013 Springer Publishing Company, LLC.
Interpreting an Individual Score
♦ Scores on standardized tests (cont’d)
– Important to specify an appropriate norm group
for comparison
– User’s manual includes norm tables with
descriptions of each norm group
– Teacher should select the norm group that most
closely matches the group of students
• Examples: type of nursing program, public or private
23

More Related Content

PPTX
Six steps for avoiding misinterpretations
PPT
Standardized tests
PPT
Interpreting Test Scores
PPTX
Types of Scores & Types of Standard Scores
PPTX
Interpretation of test Scores
PPTX
Interpreting Assessment and Reporting
PPT
Interpreting scores from Test
PPTX
Utilization of assessment data
Six steps for avoiding misinterpretations
Standardized tests
Interpreting Test Scores
Types of Scores & Types of Standard Scores
Interpretation of test Scores
Interpreting Assessment and Reporting
Interpreting scores from Test
Utilization of assessment data

What's hot (20)

PPT
Chapter 15 ppt eval & testing 4e formatted 01.10 kg edits
PPT
Test scores
PPTX
Test standardization
PPT
Norms[1]
PPTX
Interpretation of scores and the grading system in public schools
PPTX
Administering,scoring and reporting a test ppt
PPTX
Norm Reference Test
PPTX
Nature of Interpretation: Norm referenced, Criterion referenced
PPTX
Administering, Scoring, and Analyzing Test
PPTX
Spe 501 class 11
PPTX
Test interpretation
PPTX
Test appraisal
PPT
PDF
Norm & Criterion referenced tests
PPTX
measurment, testing & eveluation
PDF
Types of tests in measurement and evaluation
PDF
Standardized test and non standarded test
PPTX
Administration of the Test and Analysis of Students’ Performance
PPT
Assessment Jeopardy
PDF
Test standardization and norming
Chapter 15 ppt eval & testing 4e formatted 01.10 kg edits
Test scores
Test standardization
Norms[1]
Interpretation of scores and the grading system in public schools
Administering,scoring and reporting a test ppt
Norm Reference Test
Nature of Interpretation: Norm referenced, Criterion referenced
Administering, Scoring, and Analyzing Test
Spe 501 class 11
Test interpretation
Test appraisal
Norm & Criterion referenced tests
measurment, testing & eveluation
Types of tests in measurement and evaluation
Standardized test and non standarded test
Administration of the Test and Analysis of Students’ Performance
Assessment Jeopardy
Test standardization and norming
Ad

Similar to Chapter 16 ppt eval & testing 4e formatted 01.10 kg edits (20)

PDF
Statistics in nursing research
PPTX
Education Assessment in Learnings 1.pptx
PPT
Statistics for interpreting test scores
PPTX
measures of central tendency statistics.pptx
PPTX
measures of central tendency statistics.pptx
PPTX
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
PPT
Refresher Course Psychological Statistics
PPTX
3_Statistics-Refresher.pptx Pschological Assessment
PDF
Describing and Interpreting Test Scores: Frequency Distribution to Measures o...
PPT
ppt for the normal distribution Nominal ordinal
PPTX
Statistics(Basic)
PPT
The_Measures_of_position_for_grade_10_week3
PDF
Statistics Refresher.pdf
PPT
Descriptive_Statistics_PPT.ppt
PPT
Descriptive statistics ppt
PPT
Coun 224 Session2
PPTX
Norms and the Meaning of Test Scores
PPT
EEX 501 Assessment
PPT
EEX 501 Assess Ch4,5,6,7,All
PPTX
Measures of Central Tendency and Measures of Variability
Statistics in nursing research
Education Assessment in Learnings 1.pptx
Statistics for interpreting test scores
measures of central tendency statistics.pptx
measures of central tendency statistics.pptx
LESSON-8-ANALYSIS-INTERPRETATION-AND-USE-OF-TEST-DATA.pptx
Refresher Course Psychological Statistics
3_Statistics-Refresher.pptx Pschological Assessment
Describing and Interpreting Test Scores: Frequency Distribution to Measures o...
ppt for the normal distribution Nominal ordinal
Statistics(Basic)
The_Measures_of_position_for_grade_10_week3
Statistics Refresher.pdf
Descriptive_Statistics_PPT.ppt
Descriptive statistics ppt
Coun 224 Session2
Norms and the Meaning of Test Scores
EEX 501 Assessment
EEX 501 Assess Ch4,5,6,7,All
Measures of Central Tendency and Measures of Variability
Ad

More from stanbridge (20)

PPTX
Micro Lab 3 Lecture
PPTX
Creating a poster v2
PPTX
Creating a poster
PPTX
Sample poster
PPTX
OT 5018 Thesis Dissemination
PPTX
Ot5101 005 week 5
PPTX
Ot5101 005 week4
PPTX
Compliance, motivation, and health behaviors
PPTX
Ch 5 developmental stages of the learner
PPTX
OT 5101 week2 theory policy
PPTX
OT 5101 week3 planning needs assessment
PPTX
Ot5101 week1
PPT
NUR 304 Chapter005
PPT
NUR 3043 Chapter007
PPT
NUR 3043 Chapter006
PPT
NUR 3043 Chapter004
PPT
3043 Chapter009
PPT
3043 Chapter008
PPT
Melnyk ppt chapter_21
PPT
Melnyk ppt chapter_22
Micro Lab 3 Lecture
Creating a poster v2
Creating a poster
Sample poster
OT 5018 Thesis Dissemination
Ot5101 005 week 5
Ot5101 005 week4
Compliance, motivation, and health behaviors
Ch 5 developmental stages of the learner
OT 5101 week2 theory policy
OT 5101 week3 planning needs assessment
Ot5101 week1
NUR 304 Chapter005
NUR 3043 Chapter007
NUR 3043 Chapter006
NUR 3043 Chapter004
3043 Chapter009
3043 Chapter008
Melnyk ppt chapter_21
Melnyk ppt chapter_22

Chapter 16 ppt eval & testing 4e formatted 01.10 kg edits

  • 1. © 2013 Springer Publishing Company, LLC. Chapter 16 Interpreting Test Scores &Oermann Gaberson Evaluation and Testing in Nursing Education 4th edition
  • 2. © 2013 Springer Publishing Company, LLC. Interpreting Test Scores ♦ A test produces a score – Number with no intrinsic meaning – Must be compared with something that has meaning ♦ Interpretations can be norm- or criterion- referenced 2
  • 3. © 2013 Springer Publishing Company, LLC. Test Score Distributions ♦ Scoring a test produces a collection of raw scores, recorded by student name or number – Difficult to interpret characteristics of the scores ♦ Arrange in rank order, highest to lowest – Reveals range of scores – Still difficult to judge how a typical student performed on the test or other characteristics of the obtained scores 3
  • 4. © 2013 Springer Publishing Company, LLC. Test Score Distributions ♦ Frequency distribution – Remove student names or numbers – List each score once – Tally number of times each score occurs – Identify how well the group of students performed on the exam more easily – Can represent graphically as a histogram or frequency polygon • Display scores that occurred most frequently, score distribution shape, range 4
  • 5. © 2013 Springer Publishing Company, LLC. Characteristics of Score Distributions ♦ Symmetry ♦ Skewness ♦ Modality ♦ Kurtosis 5
  • 6. © 2013 Springer Publishing Company, LLC. Symmetry ♦ Symmetric distribution or curve – Equal halves, mirror images of each other ♦ Nonsymmetric or asymmetric distribution or curve – Scores cluster at one end, tail toward other end – Most nursing test score distributions 6
  • 7. © 2013 Springer Publishing Company, LLC. Skewness ♦ Skew—direction in which the tail extends – Positive skew—tail toward the right (in the direction of positive numbers on a scale) • Positively skewed distribution—cluster of scores at low end – Negative skew—tail toward the left (in the direction of negative numbers) • Cluster of scores at the high end • Most nursing test score distributions 7
  • 8. © 2013 Springer Publishing Company, LLC. Modality ♦ Number of peaks (cluster of scores) in the distribution ♦ Mode – Most frequently occurring score in the distribution ♦ Unimodal—one peak ♦ Bimodal—two peaks ♦ Multimodal—many peaks 8
  • 9. © 2013 Springer Publishing Company, LLC. Kurtosis ♦ Relative flatness or peakedness of the curve ♦ Platykurtic—relatively flat, gently curved ♦ Mesokurtic—moderately curved ♦ Leptokurtic—sharply peaked 9
  • 10. © 2013 Springer Publishing Company, LLC. “Curving” Grades ♦ Not appropriate if scores lack characteristics of a normal curve – Bell-shaped: symmetric, unimodal, mesokurtic
  • 11. © 2013 Springer Publishing Company, LLC. “Curving” Grades ♦ Most score distributions from teacher-made tests not normally distributed ♦ Shape of distribution affected by: – Test characteristics • Difficult test → positively skewed curve – Ability of students • Nursing content knowledge not normally distributed – Students admitted to nursing program not representative of general population 11
  • 12. © 2013 Springer Publishing Company, LLC. Measures of Central Tendency ♦ Ways of indicating the score that is most characteristic or typical of the distribution ♦ “Middle” of a distribution, scores tend to cluster around it ♦ Three measures – Mode – Median – Mean 12
  • 13. © 2013 Springer Publishing Company, LLC. Mode ♦ Most frequently occurring score in a distribution ♦ Must be an actual obtained score ♦ Identified from frequency distribution or graphic display without mathematical calculation ♦ Rough indication of central tendency ♦ Least stable measure of central tendency – Can fluctuate considerably among samples drawn from the same population 13
  • 14. © 2013 Springer Publishing Company, LLC. Median ♦ Point that divides a score distribution into equal halves ♦ 50th percentile—50% of scores are above and 50% are below ♦ Does not have to be an actual obtained score – Even number of scores—median is halfway between the two middle scores – Odd number of scores—median is the middle score ♦ Index of location—not influenced by the value of each score – Good for skewed distribution 14
  • 15. © 2013 Springer Publishing Company, LLC. Mean ♦ Mathematical average of all scores – Computed by summing individual scores and dividing by the total number of scores – Does not have to be an actual obtained score ♦ Value of the mean is affected by every score in the distribution – Influenced by extremely high or low scores – Not the most accurate measure of central tendency in highly skewed distributions 15
  • 16. © 2013 Springer Publishing Company, LLC. Selecting a Measure of Central Tendency ♦ Relationship between shape of a distribution and locations of measures of central tendency – Normal distribution • Mean, median, and mode have the same value – Positively skewed distribution • Mean is highest, mode is lowest – Negatively skewed distribution • Mode is highest, mean is lowest 16
  • 17. © 2013 Springer Publishing Company, LLC. Measures of Variability ♦ Used to determine how similar or different the test scores are ♦ Score distributions may have similar measures of central tendency and different degrees of variability ♦ Most common measures – Range – Standard deviation 17
  • 18. © 2013 Springer Publishing Company, LLC. Range ♦ Simplest measure of variability ♦ Difference between the highest and lowest scores in the distribution – Sometimes expressed as highest and lowest scores, rather than a difference score (e.g., 42 to 60) ♦ Can be highly unstable—based on only two values ♦ Tends to increase with number of scores – Wider range of test scores from large group of students because of likelihood of an extreme score 18
  • 19. © 2013 Springer Publishing Company, LLC. Standard Deviation (SD) ♦ Most common and useful measure of variability ♦ Takes every score in the distribution into consideration ♦ Based on differences between each score and the mean ♦ Represents average amount by which scores differ from the mean – Smaller if scores cluster tightly around the mean – Larger if scores widely scattered over large range 19
  • 20. © 2013 Springer Publishing Company, LLC. Interpreting an Individual Score ♦ Scores on teacher-made tests – Norm-referenced interpretations • Use mean and SD to interpret individual scores – Criterion-referenced interpretations • Used in most nursing education settings • Scores are compared to a preset standard • Example: percentage-correct score – Comparison of a student’s score with the maximum possible score 20
  • 21. © 2013 Springer Publishing Company, LLC. Percentage-Correct Scores ♦ Derived (not raw) score ♦ Often used as a basis for assigning grades ♦ Determined more by test item difficulty than by quality of performance – If test is more difficult than expected, teacher may want to adjust the raw scores before calculating the percentage correct ♦ Not to be confused with percentile score – Norm-referenced interpretation 21
  • 22. © 2013 Springer Publishing Company, LLC. Interpreting an Individual Score ♦ Scores on standardized tests – Usually used to make norm-referenced interpretations – More relevant to general rather than specific instructional goals • Should not be used to determine course grades – Usually reported in derived scores • Percentile ranks • Standard scores • Norm-group scores (cont’d) 22
  • 23. © 2013 Springer Publishing Company, LLC. Interpreting an Individual Score ♦ Scores on standardized tests (cont’d) – Important to specify an appropriate norm group for comparison – User’s manual includes norm tables with descriptions of each norm group – Teacher should select the norm group that most closely matches the group of students • Examples: type of nursing program, public or private 23