SlideShare a Scribd company logo
1
Presented By: Tariq Mehmood
Presented To : Dr Muhammad Iqbal
University of Education
Lahore
After completion of this unit Students will be able to :
 Define vertically Moderated Standard setting.
 Describe the history of VMSS.
 Elaborate Approaches of VMSS.
 Describe Application of VMSS..
2
vertically-moderated standard setting(VMSS)
sometimes referred to as a process of vertical articulation of
standards—is a new and developing concept.
3
 Simply VMSS is a procedure or set of procedures, typically
carried out after individual standards have been set.
 Reasonable expectations are typically stated in terms of
percentages of students at or above a important performance
level, such as Proficient. To illustrate VMSS, consider the
following scenario for a typical state testing program. Six
groups of standard setters have gathered to set standards for
the state mathematics tests in Grades 3–8.
4
Table 14-1 Hypothetical Results of Independent
Standard Settings Across Six Grade Levels
5
Grade Percentage of Students Classified as At or
Above Proficient Performance Level
3 37
4 41
5 34
6 43
7 29
8 42
 Given the placement of the cut scores for the Proficient
category, percentages of students that would be classified
as Proficient or above are shown, by grade level, in Table
14-1.
 In this example, we see that the percentages of students
considered Proficient or better (e.g., Proficient or
Advanced), goes up from Grade 3 to Grade 4, drops back
down at Grade 5, goes up again at Grade 6, drops again at
Grade 7, and rises again at Grade 8.
6
 Each of the methods of setting performance standards we
described is routinely applied to contexts in which the need is for a
single cut score or set of performance levels for a single test.
 These contexts have included setting a single cut score on a
licensing examination, deriving three performance levels (e.g.,
Basic, Proficient, Advanced) on a single end-of-grade subject area
test, and obtaining multiple cut scores on a set of four or five
different subject tests used to determine eligibility for high school
graduation.
 In each instance, there was no apparent need to link cut scores on
one test to those on another.
7
 The challenges to creating meaningful systems of performance
standards across grades and subject areas.
 One approach has been proposed by Lissitz and Huynh
(2003), who introduced the concept of VMSS in a background
paper prepared for the Arkansas Department of Education.
 The paper, titled “Vertical Equating for the Arkansas
ACTAAP Assessments: Issues and Solutions in Determination
of Adequate Yearly Progress and School Accountability,”.
8
 As an alternative to vertical scaling (equating), Lissitz and
Huynh (2003b) recommended VMSS. Specifically, they
recommended that “new cut scores for each test be set for all
grades such that
 each achievement level has the same (generic) meaning across
all grades, and
 the proportion of students in each achievement level follow a
growth curve trend across these grades.” They then offered a
list of specific recommendations to carry out the vertical
moderation of a set of performance standards across Grades 3–
8, including an annual validation of the process.
9
 It remains to be seen if VMSS will provide satisfactory
solutions to the current contexts to which it is being applied, as
well as how adaptable VMSS will be to other contexts.
10
 Simply VMSS is a procedure or set of procedures,
typically carried out after individual standards have
been set.
 Reasonable expectations are typically stated in terms
of percentages of students at or above a important
performance level, such as Proficient. To illustrate
VMSS, consider the following scenario for a typical
state testing program. Six groups of standard setters
have gathered to set standards for the state
mathematics tests in Grades 3–8.
11
VMSS typically focuses on percentages of
students at various proficiency levels.
A simple solution to the problem of different
percentages of students reaching a given
performance level—say, the Proficient cut score—
at different grades would simply be to set all
standards at the same score point or such that
equal percentages of students would be classified
as Proficient at each grade level, by command.
12
An alternative would be to set standards only for the lowest
and highest grades and then align the percentages of
Proficient students in the intermediate grades accordingly. In
the preceding
example, we would simply take the 37% figure for Grade 3
and the 42% figure for Grade 8 and set cut scores for Grades
4–7 so that their resulting percentages of students at or
above Proficient would fall on a straight line between 37%
and 42% as shown in Table 14-2
13
 . If complete standard-setting procedures were used for
establishing cut scores for Grades 3 and 8, then there is
some reason to have confidence in at least the 37% and
42% values for the anchor grades.
14
Table 14-2 Results of Smoothing Standard-
Setting Results Across Six Grade Levels
15
Grade Percentage of Students Classified as At or
Above Proficient Performance Level
3 37
4 38
5 39
6 40
7 41
8 42
According to these authors, assumptions for standard setting are
based on the intersection of three growth models and four
expected growth amounts as depicted in Table 14-3.
The three growth models may be summarized as follows: (1)
Linear growth, which assumes that the proficiency of all
examinees increases by a fixed amount, and examinees retain
their positions relative to one another;
16
 (2) Remediation, which assumes that the proficiency
of examinees at the lower end of the score
distribution increases more than those of examinees
at the upper end; and (3) Acceleration, which
assumes that the proficiency of examinees in the
upper portion of the score distribution increases at a
greater rate than that of examinees at the lower end of
the score distribution.
17
18
Because of the newness of VMSS, only recently have results of
VMSS approaches been presented in the psychometric literature.
As we noted
19
Growth
Trajectori
es
Growth Model Types
Linear Remediation Acceleration
Negative
growth
All groups
show
decline
over time.
Overall group
shows
negative
growth, but
remedial
group fares
better than
non remedial
group.
Overall group
shows negative
growth, but
non remedial
group fares
better than
remedial
group.
20
Growth
Trajectori
es
Growth Model Types
Linear Remediation Acceleration
No growth All groups
show no
growth
over time.
Overall group
shows no
growth, but
remedial
group fares
better than
non remedial
group.
Overall group
shows no
growth, but
non remedial
group fares
better than
remedial
group.
21
Growth
Trajectori
es
Growth Model Types
Linear Remediation Acceleration
Low
growth
All groups
show low
growth
over time.
Overall group
shows low
growth, with
most gain
coming from
remedial
group.
Overall group
shows low
growth, with
most gain
coming from
non remedial
group.
22
Growth
Trajectorie
s
Growth Model Types
Linear Remediation Acceleration
Moderate
growth
All groups
show
moderate
growth
over time.
Overall group
shows
moderate
growth, with
most gain
coming from
remedial group.
Overall group
shows moderate
growth, with
most gain
coming from
nonremedial
group.

More Related Content

PPT
Pp ch13
PPT
Pp ch09
PDF
Regression techniques to study the student performance in post graduate exam...
PPTX
Grading students’ test score
PPTX
Interpretation of scores and the grading system in public schools
PDF
Norm Referenced and Criterion Referenced
PPT
Interpreting Test Scores
PPTX
Comparison of criterion referenced and norm referenced assessment
Pp ch13
Pp ch09
Regression techniques to study the student performance in post graduate exam...
Grading students’ test score
Interpretation of scores and the grading system in public schools
Norm Referenced and Criterion Referenced
Interpreting Test Scores
Comparison of criterion referenced and norm referenced assessment

What's hot (20)

PPTX
Grading system
PPTX
Classification of Assessment based on Nature of Interpretation-Norms Referen...
PPTX
Six steps for avoiding misinterpretations
PDF
Assessment of students' performance grading systems
PPTX
Criterion vs norm referenced testing
PPTX
Chap 15
PPTX
Norm referenced and criterion-referenced evaluation
PPTX
Norm Referenced Evaluation by Surendran Radjou
PPTX
Norm referenced assessment
PPT
Week1 B Assessment Overview
PPT
Types of grading (grading and reporting)
PPTX
DOCX
Two-Scheme Grading Management and Student Profiling System
PPTX
Criterion-referenced and norm-referenced assessments: compatibility and compl...
PDF
Miller Guidance Assessment Hierarchy
PPT
Grading and Reporting
PPT
Chapter 16 ppt eval & testing 4e formatted 01.10 kg edits
PPTX
Grading and reporting
PPT
Standardized tests
PPTX
Test interpretation and report writing
Grading system
Classification of Assessment based on Nature of Interpretation-Norms Referen...
Six steps for avoiding misinterpretations
Assessment of students' performance grading systems
Criterion vs norm referenced testing
Chap 15
Norm referenced and criterion-referenced evaluation
Norm Referenced Evaluation by Surendran Radjou
Norm referenced assessment
Week1 B Assessment Overview
Types of grading (grading and reporting)
Two-Scheme Grading Management and Student Profiling System
Criterion-referenced and norm-referenced assessments: compatibility and compl...
Miller Guidance Assessment Hierarchy
Grading and Reporting
Chapter 16 ppt eval & testing 4e formatted 01.10 kg edits
Grading and reporting
Standardized tests
Test interpretation and report writing
Ad

Similar to vertical Moderate Standard setting (20)

PPTX
Haxhiraj ch13 14-presentation
PDF
AIOU Code 697 Assessment in Science Education Solved Assignment 1.pdf
PPTX
AIOU Code 697 Assessment in Science Education Solved Assignment 1.pptx
DOCX
COMPARATIVE REVIEW OF CRITERION REFERENCED TESTS (CRTs) AND NORM REFERENCED T...
PPT
Vertical Scale Scores
PPTX
grades record dsn,k,nhj,kj.xbvbmnghghbbd
PPTX
Jane's ccss
PPT
PPTX
Nature of Interpretation: Norm referenced, Criterion referenced
DOCX
PPTX
Learning_activity1_Tirado.Correa_Geovanna.Elizabeth.pptx
PDF
Student Performance Data Mining Project Report
PPT
Reliability and validity
PPT
PARCC and the Common Core in Massachusetts
PPT
Gradding and reporting
DOCX
Adapted from Assessment in Special and incl.docx
PPTX
Relevant-and-Pressing-Topicsin-High-Stakes-Assessment-in-Education.pptx
PDF
I-ready Research
PPTX
GRADING-AND-REPORTING-SYSTEMS REPORT.pptx
PDF
A COMPARATIVE ANALYSIS OF SELECTED STUDIES IN STUDENT PERFORMANCE PREDICTION
Haxhiraj ch13 14-presentation
AIOU Code 697 Assessment in Science Education Solved Assignment 1.pdf
AIOU Code 697 Assessment in Science Education Solved Assignment 1.pptx
COMPARATIVE REVIEW OF CRITERION REFERENCED TESTS (CRTs) AND NORM REFERENCED T...
Vertical Scale Scores
grades record dsn,k,nhj,kj.xbvbmnghghbbd
Jane's ccss
Nature of Interpretation: Norm referenced, Criterion referenced
Learning_activity1_Tirado.Correa_Geovanna.Elizabeth.pptx
Student Performance Data Mining Project Report
Reliability and validity
PARCC and the Common Core in Massachusetts
Gradding and reporting
Adapted from Assessment in Special and incl.docx
Relevant-and-Pressing-Topicsin-High-Stakes-Assessment-in-Education.pptx
I-ready Research
GRADING-AND-REPORTING-SYSTEMS REPORT.pptx
A COMPARATIVE ANALYSIS OF SELECTED STUDIES IN STUDENT PERFORMANCE PREDICTION
Ad

More from munsif123 (20)

PPTX
On Page SEO presentations
PPTX
Big shadow test
PPTX
Computer based test designs (cbt)
PPTX
Test construction
PPTX
Test equating using irt. final
PPTX
Simultaneously assembly
PPTX
The nedelsky method
PPTX
Portfolios in Education
PPT
Instrument development process
PPTX
Advanced assessment and evaluation (role of assessment and measurement in tea...
PPTX
Alternative assessment strategies
PPTX
Anecdotal record in education
PPTX
Role of objective Assessment and Evaluation
PPTX
Item analysis in education
PPTX
Principles of assessment
PPTX
Angoff method ppt
PPTX
Angoff method ppt
PPTX
American psychology Association
PPTX
Student diversity
PPT
Rationalism
On Page SEO presentations
Big shadow test
Computer based test designs (cbt)
Test construction
Test equating using irt. final
Simultaneously assembly
The nedelsky method
Portfolios in Education
Instrument development process
Advanced assessment and evaluation (role of assessment and measurement in tea...
Alternative assessment strategies
Anecdotal record in education
Role of objective Assessment and Evaluation
Item analysis in education
Principles of assessment
Angoff method ppt
Angoff method ppt
American psychology Association
Student diversity
Rationalism

Recently uploaded (20)

PDF
Trump Administration's workforce development strategy
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
PPTX
Introduction to Building Materials
PDF
My India Quiz Book_20210205121199924.pdf
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PDF
AI-driven educational solutions for real-life interventions in the Philippine...
PPTX
Introduction to pro and eukaryotes and differences.pptx
PDF
Computing-Curriculum for Schools in Ghana
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PPTX
TNA_Presentation-1-Final(SAVE)) (1).pptx
PDF
IGGE1 Understanding the Self1234567891011
PPTX
Virtual and Augmented Reality in Current Scenario
PDF
Empowerment Technology for Senior High School Guide
PDF
HVAC Specification 2024 according to central public works department
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
Indian roads congress 037 - 2012 Flexible pavement
PPTX
History, Philosophy and sociology of education (1).pptx
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
Trump Administration's workforce development strategy
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
Chinmaya Tiranga quiz Grand Finale.pdf
Introduction to Building Materials
My India Quiz Book_20210205121199924.pdf
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
AI-driven educational solutions for real-life interventions in the Philippine...
Introduction to pro and eukaryotes and differences.pptx
Computing-Curriculum for Schools in Ghana
Share_Module_2_Power_conflict_and_negotiation.pptx
TNA_Presentation-1-Final(SAVE)) (1).pptx
IGGE1 Understanding the Self1234567891011
Virtual and Augmented Reality in Current Scenario
Empowerment Technology for Senior High School Guide
HVAC Specification 2024 according to central public works department
Paper A Mock Exam 9_ Attempt review.pdf.
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Indian roads congress 037 - 2012 Flexible pavement
History, Philosophy and sociology of education (1).pptx
A powerpoint presentation on the Revised K-10 Science Shaping Paper

vertical Moderate Standard setting

  • 1. 1 Presented By: Tariq Mehmood Presented To : Dr Muhammad Iqbal University of Education Lahore
  • 2. After completion of this unit Students will be able to :  Define vertically Moderated Standard setting.  Describe the history of VMSS.  Elaborate Approaches of VMSS.  Describe Application of VMSS.. 2
  • 3. vertically-moderated standard setting(VMSS) sometimes referred to as a process of vertical articulation of standards—is a new and developing concept. 3
  • 4.  Simply VMSS is a procedure or set of procedures, typically carried out after individual standards have been set.  Reasonable expectations are typically stated in terms of percentages of students at or above a important performance level, such as Proficient. To illustrate VMSS, consider the following scenario for a typical state testing program. Six groups of standard setters have gathered to set standards for the state mathematics tests in Grades 3–8. 4
  • 5. Table 14-1 Hypothetical Results of Independent Standard Settings Across Six Grade Levels 5 Grade Percentage of Students Classified as At or Above Proficient Performance Level 3 37 4 41 5 34 6 43 7 29 8 42
  • 6.  Given the placement of the cut scores for the Proficient category, percentages of students that would be classified as Proficient or above are shown, by grade level, in Table 14-1.  In this example, we see that the percentages of students considered Proficient or better (e.g., Proficient or Advanced), goes up from Grade 3 to Grade 4, drops back down at Grade 5, goes up again at Grade 6, drops again at Grade 7, and rises again at Grade 8. 6
  • 7.  Each of the methods of setting performance standards we described is routinely applied to contexts in which the need is for a single cut score or set of performance levels for a single test.  These contexts have included setting a single cut score on a licensing examination, deriving three performance levels (e.g., Basic, Proficient, Advanced) on a single end-of-grade subject area test, and obtaining multiple cut scores on a set of four or five different subject tests used to determine eligibility for high school graduation.  In each instance, there was no apparent need to link cut scores on one test to those on another. 7
  • 8.  The challenges to creating meaningful systems of performance standards across grades and subject areas.  One approach has been proposed by Lissitz and Huynh (2003), who introduced the concept of VMSS in a background paper prepared for the Arkansas Department of Education.  The paper, titled “Vertical Equating for the Arkansas ACTAAP Assessments: Issues and Solutions in Determination of Adequate Yearly Progress and School Accountability,”. 8
  • 9.  As an alternative to vertical scaling (equating), Lissitz and Huynh (2003b) recommended VMSS. Specifically, they recommended that “new cut scores for each test be set for all grades such that  each achievement level has the same (generic) meaning across all grades, and  the proportion of students in each achievement level follow a growth curve trend across these grades.” They then offered a list of specific recommendations to carry out the vertical moderation of a set of performance standards across Grades 3– 8, including an annual validation of the process. 9
  • 10.  It remains to be seen if VMSS will provide satisfactory solutions to the current contexts to which it is being applied, as well as how adaptable VMSS will be to other contexts. 10
  • 11.  Simply VMSS is a procedure or set of procedures, typically carried out after individual standards have been set.  Reasonable expectations are typically stated in terms of percentages of students at or above a important performance level, such as Proficient. To illustrate VMSS, consider the following scenario for a typical state testing program. Six groups of standard setters have gathered to set standards for the state mathematics tests in Grades 3–8. 11
  • 12. VMSS typically focuses on percentages of students at various proficiency levels. A simple solution to the problem of different percentages of students reaching a given performance level—say, the Proficient cut score— at different grades would simply be to set all standards at the same score point or such that equal percentages of students would be classified as Proficient at each grade level, by command. 12
  • 13. An alternative would be to set standards only for the lowest and highest grades and then align the percentages of Proficient students in the intermediate grades accordingly. In the preceding example, we would simply take the 37% figure for Grade 3 and the 42% figure for Grade 8 and set cut scores for Grades 4–7 so that their resulting percentages of students at or above Proficient would fall on a straight line between 37% and 42% as shown in Table 14-2 13
  • 14.  . If complete standard-setting procedures were used for establishing cut scores for Grades 3 and 8, then there is some reason to have confidence in at least the 37% and 42% values for the anchor grades. 14
  • 15. Table 14-2 Results of Smoothing Standard- Setting Results Across Six Grade Levels 15 Grade Percentage of Students Classified as At or Above Proficient Performance Level 3 37 4 38 5 39 6 40 7 41 8 42
  • 16. According to these authors, assumptions for standard setting are based on the intersection of three growth models and four expected growth amounts as depicted in Table 14-3. The three growth models may be summarized as follows: (1) Linear growth, which assumes that the proficiency of all examinees increases by a fixed amount, and examinees retain their positions relative to one another; 16
  • 17.  (2) Remediation, which assumes that the proficiency of examinees at the lower end of the score distribution increases more than those of examinees at the upper end; and (3) Acceleration, which assumes that the proficiency of examinees in the upper portion of the score distribution increases at a greater rate than that of examinees at the lower end of the score distribution. 17
  • 18. 18
  • 19. Because of the newness of VMSS, only recently have results of VMSS approaches been presented in the psychometric literature. As we noted 19 Growth Trajectori es Growth Model Types Linear Remediation Acceleration Negative growth All groups show decline over time. Overall group shows negative growth, but remedial group fares better than non remedial group. Overall group shows negative growth, but non remedial group fares better than remedial group.
  • 20. 20 Growth Trajectori es Growth Model Types Linear Remediation Acceleration No growth All groups show no growth over time. Overall group shows no growth, but remedial group fares better than non remedial group. Overall group shows no growth, but non remedial group fares better than remedial group.
  • 21. 21 Growth Trajectori es Growth Model Types Linear Remediation Acceleration Low growth All groups show low growth over time. Overall group shows low growth, with most gain coming from remedial group. Overall group shows low growth, with most gain coming from non remedial group.
  • 22. 22 Growth Trajectorie s Growth Model Types Linear Remediation Acceleration Moderate growth All groups show moderate growth over time. Overall group shows moderate growth, with most gain coming from remedial group. Overall group shows moderate growth, with most gain coming from nonremedial group.