More on Assessment Part 2: Technical and Methodological Principles
Chapter 3 Norms and the Meaning of Test Scores Raw scores and percentages are useless in isolation Scores have meaning only when compared to Norms Norms come from a sample that tells us what individuals in a “representative group” would do Where does an individual fall in the distribution?
Norms: Types and Purposes Purposes Compare different areas within a person Place a person within a group Types Developmental level obtained Position within a group
Important Statistical Concepts Frequency distribution Distribution curve Histogram Frequency Polygon Normal Curve
Measures of Central Tendency Mean Median Mode
Measures of Central Tendency 1,  4,  7,  3,  3,  2,  5,  6,  4,  3,  5,  5,  2,  4,  6,  4 Frequency Distributions
Measures of Central Tendency Mean, Median, and Mode all equal 4 Normal distribution
Variability Range Deviation From the mean Variance Standard Deviation Square root of variance
Standard Deviation Allows for easiest comparison of scores Jane is 1.5 standard deviations from the mean John is .5 standard deviations bellow the mean Jim is 1 standard deviation above the mean in math and 2 standard deviations above the mean in verbal ability Figure 3.3 on page 54
Developmental Norms Mental Age Grade Equivalents Ordinal Scales for child development for babies…less judgment imposed…more descriptive
Within-Group Norms Percentiles Every 10% of population is a percentile Radiates out from mean—fig. 3.4 on pg. 59 Standard Scores Using formula on pg. 61, Mean=0 and SD=1 May be modified to avoid negative numbers Deviation IQs A standard score Mean of 100 SD of 16
Within-Group Norms Stanines Divides the normal curve into nine categories Table 3.4 on pg. 63 Interrelationships of Within-Group Scores Figure 3.6 on pg. 67
Relativity of Norms Normative Sample Population vs Sample Another warning about representation Interest Comparisons Keep in mind that, even if two tests measure the same thing, different norms mean that 118 may equal 112 or 145 National Anchor norms-tables to compare tests Specific Norms—admit limitations and norm test to narrower groups Fixed reference groups—norms never change (SAT)
Computers and Test Scores Allows for complex methods of testing Allows for narrative interpretations Allows for easier scoring MMPI templates Over reliance on computer interpretations Compatibility of Scores
Domain Referenced Interpretation What has been mastered instead of how does this individual compare to others.
Minimum Qualifications and Cutoff Scores Should be based on Criterion Comparison Remember false positives and false negatives Table 3.6 pg. 82
Chapter 4 Reliability All about consistency Test score = true score + or – error Measured variance= TV + or - EV
Correlation Coefficients
Correlation Coefficients What do they tell us? Statistical Significance Confidence Intervals
Reliability Coefficients Are Correlation Coefficients Types of Reliability Coefficients Test-Retest Reliability Alternate Form Reliability Split-Half Reliability Spearmand-Brown Formula on pg. 96 Interitem Reliability Scorer Reliability Speed Tests vs. Power Tests
Reliability and Sample Greater variance in sample allows for higher reliability to be demonstrated Fig. 4.5 on pg. 106
Chapter 5 Validity: Basic Concepts Content Validity Face Validity Criterion-Prediction Procedures Concurrent Validity (diagnostic) Predictive Validity Construct Validity Theoretically derived
More on Construct Validity Developmental Changes Correlations with other tests Factor Analysis Internal Consistency Convergent and Discriminant Validation Interventions
Chapter 6 Validity: Measurement and Interpretation Test selection Test interpretation
Validity Coefficient Validity Coefficient is a correlation coefficient of assessment and criterion Dependent on the group Valid for 9 th  grade does not mean valid for 12 th   Acceptable magnitude will depend on purpose Decision Theory—false accepts and false rejects Sequential testing to improve accuracy Moderator Variables Variables that change the validity
Combining Information from Tests Multiple Regression pg. 157 Provides a single value Allows for compensation Profile Analysis with cutoff scores Requires review of multiple values Does not allow for compensation
Uses of Tests Selection Placement Classification Test Bias pg. 167 Slope bias Intercept bias
Chapter 7 Item Analysis Useful both in evaluation tests and in creating tests (including classroom exams and quizzes) Can allow for a shorter test to be more reliable and valid than a longer one by removing useless items
Item Analysis Item Difficulty Distribution of Test Scores Item Discrimination Use of extreme groups Item Response Theory (IRT) pgs. 189 and 190 Item Analysis of Speeded Tests Cross Validation Differential Item Functioning
Item Analysis Cross Validation Use one (or more than one) sample for item selection Use different samples to validate the test Keeps error random instead of allowing it to become bias Differential Item Functioning

More Related Content

PPTX
Utilization of assessment data
PPTX
Types of Scores & Types of Standard Scores
PPTX
Measurement and instrumentaion
PPTX
Six steps for avoiding misinterpretations
PPT
Interpreting Test Scores
PPT
Chapter 16 ppt eval & testing 4e formatted 01.10 kg edits
PPT
Test scores
PPT
Statistics for interpreting test scores
Utilization of assessment data
Types of Scores & Types of Standard Scores
Measurement and instrumentaion
Six steps for avoiding misinterpretations
Interpreting Test Scores
Chapter 16 ppt eval & testing 4e formatted 01.10 kg edits
Test scores
Statistics for interpreting test scores

What's hot (19)

PPT
Norms[1]
PPTX
Spe 501 class 11
PPTX
Test item analysis
PPTX
Administering, analyzing, and improving the test or assessment
PDF
Item Analysis - Discrimination and Difficulty Index
PPTX
Qualitative item analysis
PPTX
Item analysis
PPTX
Administering, Scoring, and Analyzing Test
PPT
Item and Distracter Analysis
PPTX
Item analysis report
DOCX
Administering, Analyzing, and Improving the Test or Assessment
PPTX
Interpretation of test Scores
PPTX
ITEM ANALYSIS
PPT
Analyzing and using test item data
PPTX
Item Analysis and Validation
PPTX
Item analysis
PPTX
Analyzing and Using Test Item Data
PPTX
Quantitative Item Analysis
PPTX
Improving the test items
Norms[1]
Spe 501 class 11
Test item analysis
Administering, analyzing, and improving the test or assessment
Item Analysis - Discrimination and Difficulty Index
Qualitative item analysis
Item analysis
Administering, Scoring, and Analyzing Test
Item and Distracter Analysis
Item analysis report
Administering, Analyzing, and Improving the Test or Assessment
Interpretation of test Scores
ITEM ANALYSIS
Analyzing and using test item data
Item Analysis and Validation
Item analysis
Analyzing and Using Test Item Data
Quantitative Item Analysis
Improving the test items
Ad

Viewers also liked (20)

PDF
Heyday Holiday 2010 photoshoot
PPT
8th Grade Chapter 1 Lesson 5
PPS
Feliz Cumple Romi
DOC
Works Cited Rad
PDF
Reporte industria 5
PDF
Reporte de la industria 4
PDF
“O Futuro Faz-se Hoje” é o mote para a celebração do Dia do Porto de Lisboa (...
DOC
CHRISTINE E. ANDREWS EA
PPT
7th Grade Chapter 2 Lesson 5
PPTX
Evaluation question 7
DOC
rajesh GAYATRI
PDF
Cob 20090304 1
DOCX
latest_resume_shakir_1 (1)
PDF
Reporte industria 2
PPTX
Final Project Sriram A R-IITM 2016
PPT
9th Grade Chapter 4 Lesson 4
DOCX
BSCS Course Description
DOC
Beyond Media Media Kit
DOCX
BSCS Course Description
PPT
Factor theorem solving cubic equations
Heyday Holiday 2010 photoshoot
8th Grade Chapter 1 Lesson 5
Feliz Cumple Romi
Works Cited Rad
Reporte industria 5
Reporte de la industria 4
“O Futuro Faz-se Hoje” é o mote para a celebração do Dia do Porto de Lisboa (...
CHRISTINE E. ANDREWS EA
7th Grade Chapter 2 Lesson 5
Evaluation question 7
rajesh GAYATRI
Cob 20090304 1
latest_resume_shakir_1 (1)
Reporte industria 2
Final Project Sriram A R-IITM 2016
9th Grade Chapter 4 Lesson 4
BSCS Course Description
Beyond Media Media Kit
BSCS Course Description
Factor theorem solving cubic equations
Ad

Similar to Coun 224 Session2 (20)

PPTX
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
PDF
psychological-assessment-summaries-pdf.pdf
PPT
EEX 501 Assessment
PPT
EEX 501 Assess Ch4,5,6,7,All
PPTX
Psychological testing report sample.pptx
PPTX
Practical Language Testing by Fulcher (2010)
PDF
Psychological testing
PPTX
Chapter 4: Of Tests and Testing
PPTX
Norms and Interpretation of Test Scores.pptx
PPT
Quantitative measurement
PPT
instrument development and psychometrics
PDF
Testing & measurement
PDF
Nature-and-use of psychological in testing
PPTX
Test standardization
DOCX
Criteria to consider when constructing good tests
DOCX
Criteria to Consider when Constructing Good tests
PPTX
Development of Norms this is presentation.pptx
PDF
Tools construction and validation of Tools
PPTX
Norms and the Meaning of Test Scores
PPTX
test construction masters statistics.pptx
Chapter 2 The Science of Psychological Measurement (Alivio, Ansula).pptx
psychological-assessment-summaries-pdf.pdf
EEX 501 Assessment
EEX 501 Assess Ch4,5,6,7,All
Psychological testing report sample.pptx
Practical Language Testing by Fulcher (2010)
Psychological testing
Chapter 4: Of Tests and Testing
Norms and Interpretation of Test Scores.pptx
Quantitative measurement
instrument development and psychometrics
Testing & measurement
Nature-and-use of psychological in testing
Test standardization
Criteria to consider when constructing good tests
Criteria to Consider when Constructing Good tests
Development of Norms this is presentation.pptx
Tools construction and validation of Tools
Norms and the Meaning of Test Scores
test construction masters statistics.pptx

Recently uploaded (20)

PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PPTX
Education and Perspectives of Education.pptx
PDF
My India Quiz Book_20210205121199924.pdf
PDF
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
PDF
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
PDF
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
PPTX
What’s under the hood: Parsing standardized learning content for AI
PDF
Hazard Identification & Risk Assessment .pdf
PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PDF
Race Reva University – Shaping Future Leaders in Artificial Intelligence
PDF
International_Financial_Reporting_Standa.pdf
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
PPTX
Core Concepts of Personalized Learning and Virtual Learning Environments
PDF
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
PDF
Empowerment Technology for Senior High School Guide
Paper A Mock Exam 9_ Attempt review.pdf.
FORM 1 BIOLOGY MIND MAPS and their schemes
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
A powerpoint presentation on the Revised K-10 Science Shaping Paper
Education and Perspectives of Education.pptx
My India Quiz Book_20210205121199924.pdf
LIFE & LIVING TRILOGY - PART (3) REALITY & MYSTERY.pdf
LIFE & LIVING TRILOGY- PART (1) WHO ARE WE.pdf
Vision Prelims GS PYQ Analysis 2011-2022 www.upscpdf.com.pdf
What’s under the hood: Parsing standardized learning content for AI
Hazard Identification & Risk Assessment .pdf
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
Race Reva University – Shaping Future Leaders in Artificial Intelligence
International_Financial_Reporting_Standa.pdf
Environmental Education MCQ BD2EE - Share Source.pdf
Core Concepts of Personalized Learning and Virtual Learning Environments
CISA (Certified Information Systems Auditor) Domain-Wise Summary.pdf
Share_Module_2_Power_conflict_and_negotiation.pptx
Empowerment Technology for Senior High School Guide

Coun 224 Session2

  • 1. More on Assessment Part 2: Technical and Methodological Principles
  • 2. Chapter 3 Norms and the Meaning of Test Scores Raw scores and percentages are useless in isolation Scores have meaning only when compared to Norms Norms come from a sample that tells us what individuals in a “representative group” would do Where does an individual fall in the distribution?
  • 3. Norms: Types and Purposes Purposes Compare different areas within a person Place a person within a group Types Developmental level obtained Position within a group
  • 4. Important Statistical Concepts Frequency distribution Distribution curve Histogram Frequency Polygon Normal Curve
  • 5. Measures of Central Tendency Mean Median Mode
  • 6. Measures of Central Tendency 1, 4, 7, 3, 3, 2, 5, 6, 4, 3, 5, 5, 2, 4, 6, 4 Frequency Distributions
  • 7. Measures of Central Tendency Mean, Median, and Mode all equal 4 Normal distribution
  • 8. Variability Range Deviation From the mean Variance Standard Deviation Square root of variance
  • 9. Standard Deviation Allows for easiest comparison of scores Jane is 1.5 standard deviations from the mean John is .5 standard deviations bellow the mean Jim is 1 standard deviation above the mean in math and 2 standard deviations above the mean in verbal ability Figure 3.3 on page 54
  • 10. Developmental Norms Mental Age Grade Equivalents Ordinal Scales for child development for babies…less judgment imposed…more descriptive
  • 11. Within-Group Norms Percentiles Every 10% of population is a percentile Radiates out from mean—fig. 3.4 on pg. 59 Standard Scores Using formula on pg. 61, Mean=0 and SD=1 May be modified to avoid negative numbers Deviation IQs A standard score Mean of 100 SD of 16
  • 12. Within-Group Norms Stanines Divides the normal curve into nine categories Table 3.4 on pg. 63 Interrelationships of Within-Group Scores Figure 3.6 on pg. 67
  • 13. Relativity of Norms Normative Sample Population vs Sample Another warning about representation Interest Comparisons Keep in mind that, even if two tests measure the same thing, different norms mean that 118 may equal 112 or 145 National Anchor norms-tables to compare tests Specific Norms—admit limitations and norm test to narrower groups Fixed reference groups—norms never change (SAT)
  • 14. Computers and Test Scores Allows for complex methods of testing Allows for narrative interpretations Allows for easier scoring MMPI templates Over reliance on computer interpretations Compatibility of Scores
  • 15. Domain Referenced Interpretation What has been mastered instead of how does this individual compare to others.
  • 16. Minimum Qualifications and Cutoff Scores Should be based on Criterion Comparison Remember false positives and false negatives Table 3.6 pg. 82
  • 17. Chapter 4 Reliability All about consistency Test score = true score + or – error Measured variance= TV + or - EV
  • 19. Correlation Coefficients What do they tell us? Statistical Significance Confidence Intervals
  • 20. Reliability Coefficients Are Correlation Coefficients Types of Reliability Coefficients Test-Retest Reliability Alternate Form Reliability Split-Half Reliability Spearmand-Brown Formula on pg. 96 Interitem Reliability Scorer Reliability Speed Tests vs. Power Tests
  • 21. Reliability and Sample Greater variance in sample allows for higher reliability to be demonstrated Fig. 4.5 on pg. 106
  • 22. Chapter 5 Validity: Basic Concepts Content Validity Face Validity Criterion-Prediction Procedures Concurrent Validity (diagnostic) Predictive Validity Construct Validity Theoretically derived
  • 23. More on Construct Validity Developmental Changes Correlations with other tests Factor Analysis Internal Consistency Convergent and Discriminant Validation Interventions
  • 24. Chapter 6 Validity: Measurement and Interpretation Test selection Test interpretation
  • 25. Validity Coefficient Validity Coefficient is a correlation coefficient of assessment and criterion Dependent on the group Valid for 9 th grade does not mean valid for 12 th Acceptable magnitude will depend on purpose Decision Theory—false accepts and false rejects Sequential testing to improve accuracy Moderator Variables Variables that change the validity
  • 26. Combining Information from Tests Multiple Regression pg. 157 Provides a single value Allows for compensation Profile Analysis with cutoff scores Requires review of multiple values Does not allow for compensation
  • 27. Uses of Tests Selection Placement Classification Test Bias pg. 167 Slope bias Intercept bias
  • 28. Chapter 7 Item Analysis Useful both in evaluation tests and in creating tests (including classroom exams and quizzes) Can allow for a shorter test to be more reliable and valid than a longer one by removing useless items
  • 29. Item Analysis Item Difficulty Distribution of Test Scores Item Discrimination Use of extreme groups Item Response Theory (IRT) pgs. 189 and 190 Item Analysis of Speeded Tests Cross Validation Differential Item Functioning
  • 30. Item Analysis Cross Validation Use one (or more than one) sample for item selection Use different samples to validate the test Keeps error random instead of allowing it to become bias Differential Item Functioning