SlideShare a Scribd company logo
Topic 4B
Test Construction
The test construction process
• Defining the Test
• Selecting Scaling Method
• Constructing the items
• Testing the items
• Revising the test
• Publishing the test
Defining the Test
• Explain the test purpose explicitly and
propose a fresh focus for what the test
intend to measure, for example,
intelligence.
• K-ABC
Selecting Scaling Method
• The immediate purpose of psychological
testing is to assign numbers to responses
on a test so that the examinee can be
judged to have more or less of the
characteristic measured.
• Levels of Measurement: Stevens (1946)
• Nominal scale, ordinal scale, interval
scale, ratio scale
Selecting Scaling Method
• Nominal scale: allows for categorizing
• Ordinal scale: allows for ranking
• Interval scale: uses equal intervals
• Ratio scale: possesses real zero point
Representative scaling methods
• Expert rankings
• Method of equal-appearing intervals
• Likert scales
• Method of empirical keying
• Rational scale construction (internal
consistency)
Constructing the Items
• Should item content be homogeneous or
varied?
• What range of difficulty should the items
cover?
• How many initial items should be
constructed?
• Which cognitive processes and item
domains should be tapped?
• What kind of test item should be used?
Testing the Items
• In conducting a thorough item analysis,
the test developer might make use of item-
difficulty index, item-reliability index, item-
validity index, item-characteristic curve,
and an index of item discrimination.
Item-difficulty index
• Generally, item difficulties that hover
around 0.5, ranging between 0.3 and 0.7,
maximize the information the test provides
about differences between examinees.
• However, this rule of thumb is subject to
one important qualification and one very
significant exception.
Item-reliability index
• The product of point-biserial correlation
and dispersion (standard deviation) of a
item is the item-reliability index.
item-validity index
• The point-biserial correlation between the
item score and the score on the criterion
variable is computed first.
• Thus, the item-validity index consists of
the product of the standard deviation and
the point-biserial correlation.
Item-characteristic curve
• Figure 4.8
Item-discrimination index
• An ideal test item is one that most of the
high scores pass and most of the low
scores fail.
Revising the Test
• Cross Validation
• Validity Shrinkage
• Feedback from Examinees
Publishing the Test
• Production of Testing Materials
• Technical Manual and User’s Manual
• Testing is Big Business

More Related Content

PPTX
Test Construction
PPT
Test development
PPTX
DEVELOPMENT AND EVALUATION OF SCALES/INSTRUMENTS IN PSYCHIATRY
PDF
tryout test, item analysis (difficulty, discrimination)
PPTX
Lesson 4 analysis of test results
PPTX
Properties of-assessment-methods
PPT
Test validity
PPTX
Validity and Reliability of a Test
Test Construction
Test development
DEVELOPMENT AND EVALUATION OF SCALES/INSTRUMENTS IN PSYCHIATRY
tryout test, item analysis (difficulty, discrimination)
Lesson 4 analysis of test results
Properties of-assessment-methods
Test validity
Validity and Reliability of a Test

What's hot (20)

PPTX
Validity of test
PPT
Item and Distracter Analysis
PPT
Evaluation.2011intro
PPT
Test Construction
PPTX
Characteristics of a good measuring tool
PPT
Test Construction
PPTX
Reliability and validity of Research Data
PPTX
Validity, Reliability and Feasibility
PPT
Reliability and validity
PPT
Qualities of a Good Test
PPTX
Properties of Assessment Method
PPTX
Test evaluation
PPT
Test Reliability and Validity
PPTX
Reliability & validity
PPTX
CHARACTERISTICS OF A GOOD INSTRUMENT
PPTX
Item Analysis and Validation
PPTX
validity and reliability
PPTX
Test specification
PPT
Item analysis
PPTX
4. qualities of good measuring instrument
Validity of test
Item and Distracter Analysis
Evaluation.2011intro
Test Construction
Characteristics of a good measuring tool
Test Construction
Reliability and validity of Research Data
Validity, Reliability and Feasibility
Reliability and validity
Qualities of a Good Test
Properties of Assessment Method
Test evaluation
Test Reliability and Validity
Reliability & validity
CHARACTERISTICS OF A GOOD INSTRUMENT
Item Analysis and Validation
validity and reliability
Test specification
Item analysis
4. qualities of good measuring instrument
Ad

Viewers also liked (20)

PPT
Test construction 2
PPT
Test Construction
PPTX
Test construction edited
PPTX
Test construction
PPT
Test construction 1
PPTX
Assembly test
PPTX
Test construction
PPTX
Constructing test Items
PPTX
Test construction and interpretation
PPTX
Portfolio Assessment
PPTX
Portfolio Assessment
PPTX
Portfolio assessment method report ko
PPTX
Classroom assessment
PPT
PURPOSES OF PORTFOLIO ASSESSMENT
PPTX
Types of Portfolio
PPTX
Achievement test
PPTX
Portfolio Assessment Methods
PPT
stages of test construction
PPTX
Assessment of Learning - Multiple Choice Test
PPT
Portfolio Assessment
Test construction 2
Test Construction
Test construction edited
Test construction
Test construction 1
Assembly test
Test construction
Constructing test Items
Test construction and interpretation
Portfolio Assessment
Portfolio Assessment
Portfolio assessment method report ko
Classroom assessment
PURPOSES OF PORTFOLIO ASSESSMENT
Types of Portfolio
Achievement test
Portfolio Assessment Methods
stages of test construction
Assessment of Learning - Multiple Choice Test
Portfolio Assessment
Ad

Similar to Chapter 4 b (20)

PPTX
Item Analysis and scaling methods...pptx
PPTX
TEST CONSTRUCTION in Psychology to measure different traits
PDF
Qualities of Good Test.pdf
PPTX
Item analysis with spss software
PPTX
Quantitative techniques for psychology
PPT
ARM Module 3 advanced research methodology
PPTX
test construction masters statistics.pptx
PDF
Content Validity: Types, Definition , Example
PPTX
Psychological Test Construction and its steps
PPTX
Carma internet research module scale development
PDF
Item Analysis -ppt for psychometrics.pdf
PPTX
DepEd Item Analysis
PPTX
Characteristics of a Good Test
PPTX
Session 7 Rubrics for assessment in class
PPT
Testreview Systems in Sweden
PPTX
Item analysis
PPTX
Presentation at Minnesota Brightspace Ignite on April 24, 2015, byCreating an...
PPTX
Rubric
PPT
Chapter24
Item Analysis and scaling methods...pptx
TEST CONSTRUCTION in Psychology to measure different traits
Qualities of Good Test.pdf
Item analysis with spss software
Quantitative techniques for psychology
ARM Module 3 advanced research methodology
test construction masters statistics.pptx
Content Validity: Types, Definition , Example
Psychological Test Construction and its steps
Carma internet research module scale development
Item Analysis -ppt for psychometrics.pdf
DepEd Item Analysis
Characteristics of a Good Test
Session 7 Rubrics for assessment in class
Testreview Systems in Sweden
Item analysis
Presentation at Minnesota Brightspace Ignite on April 24, 2015, byCreating an...
Rubric
Chapter24

Recently uploaded (20)

PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Trump Administration's workforce development strategy
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
VCE English Exam - Section C Student Revision Booklet
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PPTX
202450812 BayCHI UCSC-SV 20250812 v17.pptx
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
RMMM.pdf make it easy to upload and study
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
Weekly quiz Compilation Jan -July 25.pdf
Final Presentation General Medicine 03-08-2024.pptx
human mycosis Human fungal infections are called human mycosis..pptx
Trump Administration's workforce development strategy
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
VCE English Exam - Section C Student Revision Booklet
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
202450812 BayCHI UCSC-SV 20250812 v17.pptx
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
STATICS OF THE RIGID BODIES Hibbelers.pdf
Microbial disease of the cardiovascular and lymphatic systems
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Module 4: Burden of Disease Tutorial Slides S2 2025
2.FourierTransform-ShortQuestionswithAnswers.pdf
RMMM.pdf make it easy to upload and study
Abdominal Access Techniques with Prof. Dr. R K Mishra
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
O5-L3 Freight Transport Ops (International) V1.pdf
Weekly quiz Compilation Jan -July 25.pdf

Chapter 4 b

  • 2. The test construction process • Defining the Test • Selecting Scaling Method • Constructing the items • Testing the items • Revising the test • Publishing the test
  • 3. Defining the Test • Explain the test purpose explicitly and propose a fresh focus for what the test intend to measure, for example, intelligence. • K-ABC
  • 4. Selecting Scaling Method • The immediate purpose of psychological testing is to assign numbers to responses on a test so that the examinee can be judged to have more or less of the characteristic measured. • Levels of Measurement: Stevens (1946) • Nominal scale, ordinal scale, interval scale, ratio scale
  • 5. Selecting Scaling Method • Nominal scale: allows for categorizing • Ordinal scale: allows for ranking • Interval scale: uses equal intervals • Ratio scale: possesses real zero point
  • 6. Representative scaling methods • Expert rankings • Method of equal-appearing intervals • Likert scales • Method of empirical keying • Rational scale construction (internal consistency)
  • 7. Constructing the Items • Should item content be homogeneous or varied? • What range of difficulty should the items cover? • How many initial items should be constructed? • Which cognitive processes and item domains should be tapped? • What kind of test item should be used?
  • 8. Testing the Items • In conducting a thorough item analysis, the test developer might make use of item- difficulty index, item-reliability index, item- validity index, item-characteristic curve, and an index of item discrimination.
  • 9. Item-difficulty index • Generally, item difficulties that hover around 0.5, ranging between 0.3 and 0.7, maximize the information the test provides about differences between examinees. • However, this rule of thumb is subject to one important qualification and one very significant exception.
  • 10. Item-reliability index • The product of point-biserial correlation and dispersion (standard deviation) of a item is the item-reliability index.
  • 11. item-validity index • The point-biserial correlation between the item score and the score on the criterion variable is computed first. • Thus, the item-validity index consists of the product of the standard deviation and the point-biserial correlation.
  • 13. Item-discrimination index • An ideal test item is one that most of the high scores pass and most of the low scores fail.
  • 14. Revising the Test • Cross Validation • Validity Shrinkage • Feedback from Examinees
  • 15. Publishing the Test • Production of Testing Materials • Technical Manual and User’s Manual • Testing is Big Business