Coun 224 Session2

More on Assessment Part 2: Technical and Methodological Principles

Chapter 3 Norms and the Meaning of Test Scores Raw scores and percentages are useless in isolation Scores have meaning only when compared to Norms Norms come from a sample that tells us what individuals in a “representative group” would do Where does an individual fall in the distribution?

Norms: Types and Purposes Purposes Compare different areas within a person Place a person within a group Types Developmental level obtained Position within a group

Important Statistical Concepts Frequency distribution Distribution curve Histogram Frequency Polygon Normal Curve

Measures of Central Tendency Mean Median Mode

Measures of Central Tendency 1, 4, 7, 3, 3, 2, 5, 6, 4, 3, 5, 5, 2, 4, 6, 4 Frequency Distributions

Measures of Central Tendency Mean, Median, and Mode all equal 4 Normal distribution

Variability Range Deviation From the mean Variance Standard Deviation Square root of variance

Standard Deviation Allows for easiest comparison of scores Jane is 1.5 standard deviations from the mean John is .5 standard deviations bellow the mean Jim is 1 standard deviation above the mean in math and 2 standard deviations above the mean in verbal ability Figure 3.3 on page 54

Developmental Norms Mental Age Grade Equivalents Ordinal Scales for child development for babies…less judgment imposed…more descriptive

Within-Group Norms Percentiles Every 10% of population is a percentile Radiates out from mean—fig. 3.4 on pg. 59 Standard Scores Using formula on pg. 61, Mean=0 and SD=1 May be modified to avoid negative numbers Deviation IQs A standard score Mean of 100 SD of 16

Within-Group Norms Stanines Divides the normal curve into nine categories Table 3.4 on pg. 63 Interrelationships of Within-Group Scores Figure 3.6 on pg. 67

Relativity of Norms Normative Sample Population vs Sample Another warning about representation Interest Comparisons Keep in mind that, even if two tests measure the same thing, different norms mean that 118 may equal 112 or 145 National Anchor norms-tables to compare tests Specific Norms—admit limitations and norm test to narrower groups Fixed reference groups—norms never change (SAT)

Computers and Test Scores Allows for complex methods of testing Allows for narrative interpretations Allows for easier scoring MMPI templates Over reliance on computer interpretations Compatibility of Scores

Domain Referenced Interpretation What has been mastered instead of how does this individual compare to others.

Minimum Qualifications and Cutoff Scores Should be based on Criterion Comparison Remember false positives and false negatives Table 3.6 pg. 82

Chapter 4 Reliability All about consistency Test score = true score + or – error Measured variance= TV + or - EV

Correlation Coefficients What do they tell us? Statistical Significance Confidence Intervals

Reliability Coefficients Are Correlation Coefficients Types of Reliability Coefficients Test-Retest Reliability Alternate Form Reliability Split-Half Reliability Spearmand-Brown Formula on pg. 96 Interitem Reliability Scorer Reliability Speed Tests vs. Power Tests

Reliability and Sample Greater variance in sample allows for higher reliability to be demonstrated Fig. 4.5 on pg. 106

Chapter 5 Validity: Basic Concepts Content Validity Face Validity Criterion-Prediction Procedures Concurrent Validity (diagnostic) Predictive Validity Construct Validity Theoretically derived

More on Construct Validity Developmental Changes Correlations with other tests Factor Analysis Internal Consistency Convergent and Discriminant Validation Interventions

Chapter 6 Validity: Measurement and Interpretation Test selection Test interpretation

Validity Coefficient Validity Coefficient is a correlation coefficient of assessment and criterion Dependent on the group Valid for 9 th grade does not mean valid for 12 th Acceptable magnitude will depend on purpose Decision Theory—false accepts and false rejects Sequential testing to improve accuracy Moderator Variables Variables that change the validity

Combining Information from Tests Multiple Regression pg. 157 Provides a single value Allows for compensation Profile Analysis with cutoff scores Requires review of multiple values Does not allow for compensation

Uses of Tests Selection Placement Classification Test Bias pg. 167 Slope bias Intercept bias

Chapter 7 Item Analysis Useful both in evaluation tests and in creating tests (including classroom exams and quizzes) Can allow for a shorter test to be more reliable and valid than a longer one by removing useless items

Item Analysis Item Difficulty Distribution of Test Scores Item Discrimination Use of extreme groups Item Response Theory (IRT) pgs. 189 and 190 Item Analysis of Speeded Tests Cross Validation Differential Item Functioning

Item Analysis Cross Validation Use one (or more than one) sample for item selection Use different samples to validate the test Keeps error random instead of allowing it to become bias Differential Item Functioning

Coun 224 Session2

More Related Content

What's hot (19)

Viewers also liked (20)

Similar to Coun 224 Session2 (20)

Recently uploaded (20)

Coun 224 Session2