SlideShare a Scribd company logo
Test Validity                                                                                      1




Introduction
Validity is arguably the most important criteria for the quality of a test. The term validity
refers to whether or not the test measures what it claims to measure. On a test with high
validity the items will be closely linked to the test’s intended focus. For many certification
and licensure tests this means that the items will be highly related to a specific job or
occupation. If a test has poor validity then it does not measure the job-related content and
competencies it ought to. When this is the case, there is no justification for using the test
results for their intended purpose. There are several ways to estimate the validity of a test
including content validity, concurrent validity, and predictive validity. The face validity of a
test is sometimes also mentioned.


Types of Validity

Content Validity
While there are several types of validity, the most important type for most certification
and licensure programs is probably that of content validity. Content validity is a logical
process where connections between the test items and the job-related tasks are
established. If a thorough test development process was followed, a job analysis was
properly conducted, an appropriate set of test specifications were developed, and item
writing guidelines were carefully followed, then the content validity of the test is likely to
be very high. Content validity is typically estimated by gathering a group of subject
matter experts (SMEs) together to review the test items. Specifically, these SMEs are
given the list of content areas specified in the test blueprint, along with the test items
intended to be based on each content area. The SMEs are then asked to indicate whether
or not they agree that each item is appropriately matched to the content area indicated.
Any items that the SMEs identify as being inadequately matched to the test blueprint, or
flawed in any other way, are either revised or dropped from the test.

Concurrent Validity
Another important method for investigating the validity of a test is concurrent validity.
Concurrent validity is a statistical method using correlation, rather than a logical method.
Examinees who are known to be either masters or non-masters on the content measured
by the test are identified, and the test is administered to them under realistic exam
conditions. Once the tests have been scored, the relationship is estimated between the


                                                                       Professional Testing Inc.
                                                                                    © PTI 2006
Test Validity                                                                                   2



examinees’ known status as either masters or non-masters and their classification as
masters or non-masters (i.e., pass or fail) based on the test. This type of validity provides
evidence that the test is classifying examinees correctly. The stronger the correlation is,
the greater the concurrent validity of the test is.

Predictive Validity
Another statistical approach to validity is predictive validity. This approach is similar to
concurrent validity, in that it measures the relationship between examinees'
performances on the test and their actual status as masters or non-masters. However,
with predictive validity, it is the relationship of test scores to an examinee's future
performance as a master or non-master that is estimated. In other words, predictive
validity considers the question, "How well does the test predict examinees' future status
as masters or non-masters?" For this type of validity, the correlation that is computed is
between the examinees' classifications as master or non-master based on the test and
their later performance, perhaps on the job. This type of validity is especially useful for
test purposes such as selection or admissions.

Face Validity
One additional type of validity that you may hear mentioned is face validity. Like content
validity, face validity is determined by a review of the items and not through the use of
statistical analyses. Unlike content validity, face validity is not investigated through
formal procedures and is not determined by subject matter experts. Instead, anyone who
looks over the test, including examinees and other stakeholders, may develop an informal
opinion as to whether or not the test is measuring what it is supposed to measure. While
it is clearly of some value to have the test appear to be valid, face validity alone is
insufficient for establishing that the test is measuring what it claims to measure. A well
developed exam program will include formal studies into other, more substantive types of
validity.

Summary
The validity of a test is critical because, without sufficient validity, test scores have no
meaning. The evidence you collect and document about the validity of your test is also
your best legal defense should the exam program ever be challenged in a court of law.
While there are several ways to estimate validity, for many certification and licensure
exam programs the most important type of validity to establish is content validity.



                                                                      Professional Testing Inc.
                                                                                   © PTI 2006

More Related Content

PPT
Reliability And Validity
PPTX
Selection tests in human resource management
PPTX
Reliablity and Validity
PPTX
David Hand:Trustworthiness of statistical analysis
PDF
Validity and reliability of the instrument
PPT
Chap007 measurement in_selection_editing
PPTX
Validity1
PPT
NQC Presentation On Validation And Moderation
Reliability And Validity
Selection tests in human resource management
Reliablity and Validity
David Hand:Trustworthiness of statistical analysis
Validity and reliability of the instrument
Chap007 measurement in_selection_editing
Validity1
NQC Presentation On Validation And Moderation

What's hot (19)

PDF
360-Degree Feedback Reliability | Research Paper Review
PPT
Selection=7
PDF
Toolbox of ACGME assessment methods
PDF
International Journal of Mathematics and Statistics Invention (IJMSI)
PDF
Validity of instrument
PDF
評量 ACGME 六大核心能力
PPTX
Chapter 6 - Selection and Placement
PPSX
Reliability And Validity Iv
DOCX
Personnel selection final paper
PPTX
Session 2 2018
PPTX
Surveying the landscape: An overview of tools for direct observation and asse...
PPT
Consumer research process (2)
PPT
Chapter 8
PDF
MBA-12-02
PPT
甄選P451
PPT
Chapter 10
PPTX
Selecting an Ideal Survey Instrument for a Quantitative Study
PDF
360-Degree Feedback Reliability | Research Paper Review
Selection=7
Toolbox of ACGME assessment methods
International Journal of Mathematics and Statistics Invention (IJMSI)
Validity of instrument
評量 ACGME 六大核心能力
Chapter 6 - Selection and Placement
Reliability And Validity Iv
Personnel selection final paper
Session 2 2018
Surveying the landscape: An overview of tools for direct observation and asse...
Consumer research process (2)
Chapter 8
MBA-12-02
甄選P451
Chapter 10
Selecting an Ideal Survey Instrument for a Quantitative Study
Ad

Similar to Test quality validity (20)

DOCX
Validity and objectivity of tests
PPT
Presentation Validity & Reliability
PDF
Module-14-1-Characterstics of a good test-Reliability,Validity....pdf
PPTX
Rep
PPT
Test characteristics
PPTX
PPTX
Validity, reliability & practicality
PPTX
Testing ppt
PPTX
Validity of a Research Tool
PPTX
EM&E.pptx
PPTX
VALIDITY
PPTX
Characteristics of Good Evaluation Instrument
PPTX
5. characteristics of Evaluation tools.pptx
PPT
Issues regarding construction of exams
PPTX
Qualities of a Good Test
PPTX
Presentation validity
PPT
Validity in psychological testing
PPTX
Educ 243 final report pepito
PPTX
Measurement & Evaluation pptx
PPTX
Validity
Validity and objectivity of tests
Presentation Validity & Reliability
Module-14-1-Characterstics of a good test-Reliability,Validity....pdf
Rep
Test characteristics
Validity, reliability & practicality
Testing ppt
Validity of a Research Tool
EM&E.pptx
VALIDITY
Characteristics of Good Evaluation Instrument
5. characteristics of Evaluation tools.pptx
Issues regarding construction of exams
Qualities of a Good Test
Presentation validity
Validity in psychological testing
Educ 243 final report pepito
Measurement & Evaluation pptx
Validity
Ad

More from Muhammad Zeeshan Baloch (20)

PDF
The concept of environmental sustainability
PPTX
Introduction of ms excel
DOCX
Foreign equity investment
DOCX
Present value or future value
DOCX
Types of Cheque
PDF
Innovative entrepreneurship
PDF
Fundamentals of organizational structure
PDF
PDF
Transformational leadership
PPT
Environmental sustainability
DOCX
Importance of contract in Islam and legal importance
DOCX
Types of banks in pakistan
DOCX
Telenor structure
PPT
Organizational structure with Telenor Structure
DOCX
Telenor structure
DOCX
Difference between balance sheet of manufacturing sector and banking sector
DOCX
Entrepreneurship in Pakistan
DOCX
Scientific method
PDF
Organizational Behavior Full topics
The concept of environmental sustainability
Introduction of ms excel
Foreign equity investment
Present value or future value
Types of Cheque
Innovative entrepreneurship
Fundamentals of organizational structure
Transformational leadership
Environmental sustainability
Importance of contract in Islam and legal importance
Types of banks in pakistan
Telenor structure
Organizational structure with Telenor Structure
Telenor structure
Difference between balance sheet of manufacturing sector and banking sector
Entrepreneurship in Pakistan
Scientific method
Organizational Behavior Full topics

Recently uploaded (20)

PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
Trump Administration's workforce development strategy
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PPTX
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Yogi Goddess Pres Conference Studio Updates
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PDF
A systematic review of self-coping strategies used by university students to ...
PDF
Complications of Minimal Access Surgery at WLH
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
Orientation - ARALprogram of Deped to the Parents.pptx
Trump Administration's workforce development strategy
Anesthesia in Laparoscopic Surgery in India
RTP_AR_KS1_Tutor's Guide_English [FOR REPRODUCTION].pdf
O7-L3 Supply Chain Operations - ICLT Program
VCE English Exam - Section C Student Revision Booklet
Microbial diseases, their pathogenesis and prophylaxis
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Introduction-to-Literarature-and-Literary-Studies-week-Prelim-coverage.pptx
Microbial disease of the cardiovascular and lymphatic systems
Module 4: Burden of Disease Tutorial Slides S2 2025
Yogi Goddess Pres Conference Studio Updates
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
O5-L3 Freight Transport Ops (International) V1.pdf
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
A systematic review of self-coping strategies used by university students to ...
Complications of Minimal Access Surgery at WLH
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Pharmacology of Heart Failure /Pharmacotherapy of CHF

Test quality validity

  • 1. Test Validity 1 Introduction Validity is arguably the most important criteria for the quality of a test. The term validity refers to whether or not the test measures what it claims to measure. On a test with high validity the items will be closely linked to the test’s intended focus. For many certification and licensure tests this means that the items will be highly related to a specific job or occupation. If a test has poor validity then it does not measure the job-related content and competencies it ought to. When this is the case, there is no justification for using the test results for their intended purpose. There are several ways to estimate the validity of a test including content validity, concurrent validity, and predictive validity. The face validity of a test is sometimes also mentioned. Types of Validity Content Validity While there are several types of validity, the most important type for most certification and licensure programs is probably that of content validity. Content validity is a logical process where connections between the test items and the job-related tasks are established. If a thorough test development process was followed, a job analysis was properly conducted, an appropriate set of test specifications were developed, and item writing guidelines were carefully followed, then the content validity of the test is likely to be very high. Content validity is typically estimated by gathering a group of subject matter experts (SMEs) together to review the test items. Specifically, these SMEs are given the list of content areas specified in the test blueprint, along with the test items intended to be based on each content area. The SMEs are then asked to indicate whether or not they agree that each item is appropriately matched to the content area indicated. Any items that the SMEs identify as being inadequately matched to the test blueprint, or flawed in any other way, are either revised or dropped from the test. Concurrent Validity Another important method for investigating the validity of a test is concurrent validity. Concurrent validity is a statistical method using correlation, rather than a logical method. Examinees who are known to be either masters or non-masters on the content measured by the test are identified, and the test is administered to them under realistic exam conditions. Once the tests have been scored, the relationship is estimated between the Professional Testing Inc. © PTI 2006
  • 2. Test Validity 2 examinees’ known status as either masters or non-masters and their classification as masters or non-masters (i.e., pass or fail) based on the test. This type of validity provides evidence that the test is classifying examinees correctly. The stronger the correlation is, the greater the concurrent validity of the test is. Predictive Validity Another statistical approach to validity is predictive validity. This approach is similar to concurrent validity, in that it measures the relationship between examinees' performances on the test and their actual status as masters or non-masters. However, with predictive validity, it is the relationship of test scores to an examinee's future performance as a master or non-master that is estimated. In other words, predictive validity considers the question, "How well does the test predict examinees' future status as masters or non-masters?" For this type of validity, the correlation that is computed is between the examinees' classifications as master or non-master based on the test and their later performance, perhaps on the job. This type of validity is especially useful for test purposes such as selection or admissions. Face Validity One additional type of validity that you may hear mentioned is face validity. Like content validity, face validity is determined by a review of the items and not through the use of statistical analyses. Unlike content validity, face validity is not investigated through formal procedures and is not determined by subject matter experts. Instead, anyone who looks over the test, including examinees and other stakeholders, may develop an informal opinion as to whether or not the test is measuring what it is supposed to measure. While it is clearly of some value to have the test appear to be valid, face validity alone is insufficient for establishing that the test is measuring what it claims to measure. A well developed exam program will include formal studies into other, more substantive types of validity. Summary The validity of a test is critical because, without sufficient validity, test scores have no meaning. The evidence you collect and document about the validity of your test is also your best legal defense should the exam program ever be challenged in a court of law. While there are several ways to estimate validity, for many certification and licensure exam programs the most important type of validity to establish is content validity. Professional Testing Inc. © PTI 2006