SlideShare a Scribd company logo
2
Most read
3
Most read
7
Most read
SETTING CUTSCORES
FOR CERTIFICATION
EXAMS
Nathan A. Thompson, Ph.D.
Vice President, ASC
Adjunct Faculty, University of Cincinnati
Why are cutscores necessary?
 As Glaser (1963) pointed out, the reason for
the existence of many tests is to make
decisions about people
 Mastery: Pass/Fail educational content
 Credentialing: Award/not professional credential
(certification, certificate, license)
 Pre-employment: Hire/not for job (or eligible as
candidate)
 University selection: Admission/not to university
or program
 From Livingston (1980), discussing the
rationale for cutscores:
Why are cutscores necessary?
 What does that mean?
 That most of what we want to measure is in a
continuum (knowledge, intelligence) and not
naturally in “states” (e.g., male/female)
 So we need to set a cutscore (or cutscores) on
the continuum to sort examinees into groups
that reflect interpretations and meanings that
are useful to us
 Pass is “qualified” and Fail is “unqualified”
How do we set a cutscore?
 As the Livingston excerpt notes, all cutscores
involve a level of subjectivity or arbitrariness
 The higher the stakes of the exam, the more
we need to reduce the arbitrariness
 Standard setting methods differ in their level of
objectivity
 A more objective method provides an anchor
to validity and defensibility
How do we set a cutscore?
Approach Example Arbitrarine
ss
Arbitrary round
number
70% of items MOST
Quota Whatever passes 85%
of people (z=-1.0)
MOST
Examinee-based Borderline,
Contrasting Groups
LEAST
Content-based Angoff, Bookmark LEAST
Examinee-based methods
 Borderline Method
 Experts familiar with content AND all examinees
identify those examinees they consider
“borderline”
 The mean or median score for those examinees
is the cutscore
 Contrasting Groups Method
 Experts familiar with content AND all examinees
sort examinees into Pass and Fail Groups (or
external criterion is used)
 The point where the two score distributions cross
is the cutscore
Examinee-based methods
 Are conceptually appealing but have two large
disadvantages:
 Require examinees to take the test first, so
pass/fail decisions cannot be made after they
finish the test
 Require a way to assign examinees into groups
WITHOUT test scores – either experts that are
familiar with all examinees or some sort of “gold
standard”
 Example: For a practice test, results on the real test
can be used as a gold standard to set cutscore
Content-based methods
 The Angoff and Bookmark methods require
experts to look at items rather than candidates
 Bookmark: pilot all items, analyze difficulty
statistics, order the items by difficulty in a
booklet, and ask experts to place a bookmark
 Angoff: All experts provide a rating 0 to 100 for
each item, average serves as cutscore
Content-based methods
 The Angoff method is the most commonly
used approach in certification testing and
therefore quite legally defensible
 Biggest advantage: does not require test to be
administered for data
 Can use data too, with Beuk Compromise, to
incorporate examinee-based aspects
 The drawback is that it requires a group of
subject matter experts to rate all items, which
can take time
Content-based methods
 The Bookmark method has the advantage that
a rating is not required for every item from
every expert (which takes a lot of time)
 The drawback is that it requires all items to be
delivered to a decent-sized sample in order to
obtain item difficulty statistics (might not be
feasible)

More Related Content

PPTX
Angoff method ppt
PPTX
Criterion referenced test
PPT
Standard Setting In Medical Exams
PPTX
Aligning tests to standards
PPTX
Item writing
PPTX
Reliability
PPTX
Standardized and non standardized tests
PPTX
Standardized Testing
Angoff method ppt
Criterion referenced test
Standard Setting In Medical Exams
Aligning tests to standards
Item writing
Reliability
Standardized and non standardized tests
Standardized Testing

What's hot (20)

PPTX
Scoring and grading ppt
PPTX
Factors in Test Administration
PPTX
Testing (Guidance and Counseling)
PPT
subjective test
PPTX
Test, measurement, assessment, evaluation and testing
PPTX
PPTX
Item analysis in education
PPTX
Test administration (Educational Assessment)
PPTX
Criterion-Referenced Assessment Review
PPT
Characteristics of a good test
PPTX
Norms Referenced and Criteria Referenced Evaluation
PPTX
New grading and student evaluation.ppt
PPTX
Oral test
PPTX
achievement test
PPTX
Objective Types of test...
PPT
Formative Assessment vs. Summative Assessment
PPT
Item Analysis
PDF
Norm Referenced and Criterion Referenced
PPTX
Educational assessment
PPTX
Concept of Evaluation
Scoring and grading ppt
Factors in Test Administration
Testing (Guidance and Counseling)
subjective test
Test, measurement, assessment, evaluation and testing
Item analysis in education
Test administration (Educational Assessment)
Criterion-Referenced Assessment Review
Characteristics of a good test
Norms Referenced and Criteria Referenced Evaluation
New grading and student evaluation.ppt
Oral test
achievement test
Objective Types of test...
Formative Assessment vs. Summative Assessment
Item Analysis
Norm Referenced and Criterion Referenced
Educational assessment
Concept of Evaluation
Ad

Similar to Introduction to standard setting (cutscores) (20)

PPTX
Angoff method ppt
PPTX
educatiinar.pptx
PPT
Two phases of establishing cutoff score
PPTX
maryjean-grace-josear-report.pptx assessment
PPTX
maryjean-grace-josear-report.pptx curriculum
PPTX
standaedized test community.......1.pptx
PPTX
7.1 assessment and the cefr (1)
PPTX
PPTX
Administering,scoring and reporting a test ppt
PPTX
Characteristics of a Good Test
PPTX
7.1 assessment and the cefr (1)
PPT
Assess score evaluate (1)
PPTX
Program evaluation
PPTX
Unit 2.pptx
PPTX
Test appraisal
PPTX
grading.pptx
PPTX
grading (1).pptx,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
PPT
Types of Test for Assessing Learning.ppt
PPT
assessment scoring and evaluation.ppt
PPTX
BASIC OF MEASUREMENT & EVALUATION
Angoff method ppt
educatiinar.pptx
Two phases of establishing cutoff score
maryjean-grace-josear-report.pptx assessment
maryjean-grace-josear-report.pptx curriculum
standaedized test community.......1.pptx
7.1 assessment and the cefr (1)
Administering,scoring and reporting a test ppt
Characteristics of a Good Test
7.1 assessment and the cefr (1)
Assess score evaluate (1)
Program evaluation
Unit 2.pptx
Test appraisal
grading.pptx
grading (1).pptx,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,,
Types of Test for Assessing Learning.ppt
assessment scoring and evaluation.ppt
BASIC OF MEASUREMENT & EVALUATION
Ad

More from Nathan Thompson (8)

PPTX
Best Practices in Item Writing
PPTX
Implementing Item Response Theory
PPTX
Using Item Response Theory to Improve Assessment
PPTX
Introduction to Item Response Theory
PPTX
Leveraging Tech Enhanced Items without Sacrificing Psychometrics
PPTX
Developing a Computerized Adaptive Test
PPTX
Introduction to Computerized Adaptive Testing (CAT)
PPTX
Statistical detection of test fraud (data forensics) - where do I start?
Best Practices in Item Writing
Implementing Item Response Theory
Using Item Response Theory to Improve Assessment
Introduction to Item Response Theory
Leveraging Tech Enhanced Items without Sacrificing Psychometrics
Developing a Computerized Adaptive Test
Introduction to Computerized Adaptive Testing (CAT)
Statistical detection of test fraud (data forensics) - where do I start?

Recently uploaded (20)

DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
Weekly quiz Compilation Jan -July 25.pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
01-Introduction-to-Information-Management.pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
RMMM.pdf make it easy to upload and study
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PDF
Trump Administration's workforce development strategy
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Yogi Goddess Pres Conference Studio Updates
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
Weekly quiz Compilation Jan -July 25.pdf
Cell Structure & Organelles in detailed.
Microbial diseases, their pathogenesis and prophylaxis
Chinmaya Tiranga quiz Grand Finale.pdf
Microbial disease of the cardiovascular and lymphatic systems
01-Introduction-to-Information-Management.pdf
O7-L3 Supply Chain Operations - ICLT Program
Final Presentation General Medicine 03-08-2024.pptx
Supply Chain Operations Speaking Notes -ICLT Program
Orientation - ARALprogram of Deped to the Parents.pptx
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
RMMM.pdf make it easy to upload and study
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
Trump Administration's workforce development strategy
Anesthesia in Laparoscopic Surgery in India
2.FourierTransform-ShortQuestionswithAnswers.pdf
Abdominal Access Techniques with Prof. Dr. R K Mishra
Yogi Goddess Pres Conference Studio Updates

Introduction to standard setting (cutscores)

  • 1. SETTING CUTSCORES FOR CERTIFICATION EXAMS Nathan A. Thompson, Ph.D. Vice President, ASC Adjunct Faculty, University of Cincinnati
  • 2. Why are cutscores necessary?  As Glaser (1963) pointed out, the reason for the existence of many tests is to make decisions about people  Mastery: Pass/Fail educational content  Credentialing: Award/not professional credential (certification, certificate, license)  Pre-employment: Hire/not for job (or eligible as candidate)  University selection: Admission/not to university or program
  • 3.  From Livingston (1980), discussing the rationale for cutscores:
  • 4. Why are cutscores necessary?  What does that mean?  That most of what we want to measure is in a continuum (knowledge, intelligence) and not naturally in “states” (e.g., male/female)  So we need to set a cutscore (or cutscores) on the continuum to sort examinees into groups that reflect interpretations and meanings that are useful to us  Pass is “qualified” and Fail is “unqualified”
  • 5. How do we set a cutscore?  As the Livingston excerpt notes, all cutscores involve a level of subjectivity or arbitrariness  The higher the stakes of the exam, the more we need to reduce the arbitrariness  Standard setting methods differ in their level of objectivity  A more objective method provides an anchor to validity and defensibility
  • 6. How do we set a cutscore? Approach Example Arbitrarine ss Arbitrary round number 70% of items MOST Quota Whatever passes 85% of people (z=-1.0) MOST Examinee-based Borderline, Contrasting Groups LEAST Content-based Angoff, Bookmark LEAST
  • 7. Examinee-based methods  Borderline Method  Experts familiar with content AND all examinees identify those examinees they consider “borderline”  The mean or median score for those examinees is the cutscore  Contrasting Groups Method  Experts familiar with content AND all examinees sort examinees into Pass and Fail Groups (or external criterion is used)  The point where the two score distributions cross is the cutscore
  • 8. Examinee-based methods  Are conceptually appealing but have two large disadvantages:  Require examinees to take the test first, so pass/fail decisions cannot be made after they finish the test  Require a way to assign examinees into groups WITHOUT test scores – either experts that are familiar with all examinees or some sort of “gold standard”  Example: For a practice test, results on the real test can be used as a gold standard to set cutscore
  • 9. Content-based methods  The Angoff and Bookmark methods require experts to look at items rather than candidates  Bookmark: pilot all items, analyze difficulty statistics, order the items by difficulty in a booklet, and ask experts to place a bookmark  Angoff: All experts provide a rating 0 to 100 for each item, average serves as cutscore
  • 10. Content-based methods  The Angoff method is the most commonly used approach in certification testing and therefore quite legally defensible  Biggest advantage: does not require test to be administered for data  Can use data too, with Beuk Compromise, to incorporate examinee-based aspects  The drawback is that it requires a group of subject matter experts to rate all items, which can take time
  • 11. Content-based methods  The Bookmark method has the advantage that a rating is not required for every item from every expert (which takes a lot of time)  The drawback is that it requires all items to be delivered to a decent-sized sample in order to obtain item difficulty statistics (might not be feasible)