SlideShare a Scribd company logo
COMPUTER
ADAPTIVE TEST
(CAT)
SYED TABRAIZ BUKHARI
20
Computers as tools in testing
Generating Stimulation
Test Administration
Scoring
Interpretation
Minimize Scoring Error
Saves Time
CAPA and CAT
CAPA
Computer Assisted Psychological Assessment
CAT
Computer Adaptive Testing
Computer Adaptive Testing?
 Computerized adaptive testing (CAT) is a form of computer-based test that adapts to the
examinee's ability level.
 It has also been called tailored testing.
 In other words, it is a form of computer-administered test in which the next item or set of items
selected to be administered depends on the correctness of the test taker's responses to the
most recent items administered.
Origin of CAT
 Generally, computerized testing originated in the early 1970s (Drasgow, 2002; Wainer, 1990).
 Specifically, the first Computer Adaptive Test (CAT) was created by Larson and Madsen (1985) at
Brigham Young University, in the USA.
 Besides, After Larson and Madsen (1985), several scholars (e.g., Kaya-Carton, Carton & Dandonoli,
1991; Burston & Monville-Burston, 1995; Brown & Iwashita, 1996; Young, Shermis, Brutten &
Perkins, 1996) were motivated to construct and develop more computer adapted tests throughout
the 1990s.
 In 1998, the Test of English as a Foreign Language (TOEFL) began to use computer-adaptive
testing format
How does CAT Works?
 CAT successively selects questions for the purpose of maximizing the precision of the exam based on
what is known about the examinee from previous questions.
 From the examinee's perspective, the difficulty of the exam seems to tailor itself to their level of
ability. For example, if an examinee performs well on an item of intermediate difficulty, they will then
be presented with a more difficult question. Or, if they performed poorly, they would be presented
with a simpler question.
 Computer-adaptive tests require fewer test items to arrive at equally accurate scores. (Weiss and
Kingsbury,1984)
The basic computer-adaptive testing method is an iterative algorithm with the following steps:
 The pool of available items is searched for the optimal item, based on the current estimate of the
examinee's ability
 The chosen item is presented to the examinee, who then answers it correctly or incorrectly
 The ability estimate is updated, based upon all prior answers
 Steps 1–3 are repeated until a termination criterion is met
(Thissen & Mislevy, 2000).
Item response theory
 Item response theory is a probabilistic model that attempts to explain the response of a person to an
item.
 In its simplest form, item response theory posits that the probability of a random person j with ability θj
answering a random item i with difficulty bi correctly is conditioned upon the ability of the person and
the difficulty of the item. In other words, if a person has a high ability in a particular field, he or she will
probably get an easy item correct. Conversely, if a person has a low ability and the item is difficult, he or
she will probably get the item wrong.
Item Response Theory
 For example, we can expect someone with a large vocabulary to respond that they know easy words like
„smile‟ and „beautiful‟ but we should not expect someone with a small vocabulary to know words like
„subsidy‟ or „dissipate.‟
 When we analyze item responses, we are trying to answer the question,
“What is the probability of a person with a given ability responding correctly to an item with a given
difficulty?”
Components of CAT
A CAT procedure consists of the following components:
(a) Calibrated item pool,
(b) an entry level,
(c) an item selection rule,
(d) a scoring method, and
(e) a termination criterion.
A number of options are available for each of these components in the implementation of CAT for a
specific purpose.
Calibrated Item Pool
A pool of items must be available for the CAT to choose from. Such items can be created in the traditional
way (i.e., manually) or through Automatic Item Generation. The pool must be calibrated with a psychometric
model, which is used as a basis for the remaining four components.
Typically, item response theory is employed as the psychometric model. There are no specific guidelines for
the appropriate size of item pools; pools of 100 items can provide satisfactory results. What is essential to all
applications of CAT is that the difficulty levels of the items in the pool must span the full range of trait levels
in the population. Most efficient measurment results from using items with high discriminations.
Entry Level
In adaptive testing, it is possible to begin with items at different levels of difficulty for different students.
For example, if a student's achievement level is thought to be high, testing can begin with a relatively
difficult item. Because the level of difficulty of the items that are chosen will move to the student's trait
level as the test progresses, an erroneous entry level will not seriously affect the results, but accurate entry
levels will reduce the number of items required to achieve precise measurement. It will be shown later that
the choice of entry level should depend on the purpose of the test.
Item Selection Procedure
Two efficient item selection procedures are available.--- maximum information (Weiss, 1982) and Bayesian
(Owen, 1969, 1975). Both procedures involve searching the entire pool of administered items for a single
item: In maximum information item selection, it is the one that provides the maximum amount of item
information at the examinee's last trait level (0) estimate, and in Bayesian item selection, it is the one that
minimizes the expected posterior variance of the 0 estimate. Because information and Bayesian posterior
variance are related functions, in many cases a similar subset of items will be selected for a given individual
(Sympson, Weiss, & Ree, 1982).
Scoring Method
After an item is administered, the CAT updates its estimate of the examinee's ability level. If the examinee
answered the item correctly, the CAT will likely estimate their ability to be somewhat higher, and vice versa.
This is done by using the item response function to obtain a likelihood function of the examinee's ability.
Termination Criteria
Depends primarily on purpose of the test: point estimation or classification?
Point estimation: we want an accurate score for each student
Classification: we do NOT need an accurate score, just a classification into pass/fail etc.
Change: to see if score has gone up/down a certain amount
Advantages of CAT
Frequent retesting: the high number of permutations also enables more frequent receives a few weeks of
instruction, by the time they take a CAT again their ability has increased somewhat, and they will receive a
completely different test.
More precision: because CATs are more efficient, the organization has the option to design the CAT to
actually be more precise than a conventional test while still using fewer items.
Shorter tests: CAT can reduce testing time by 50% or more. This can obviously translate into huge financial
benefits.
Examinee experience: A CAT will provide an appropriate challenge for each examinee. Low examinees are not
discouraged or intimidated. High examinees enjoy receiving difficult items.
Advantages (Cont.)
Increased motivation: because of the better experience, there is likely an increase in examinee motivation.
Low examinees feel better, and high examinees feel challenged. Both will try harder than with a
conventional test.
Equiprecision: CATs can be designed so that examines are all measured with the same level of precision,
even though they all potentially see different items. This makes test extremely fair from a psychometric
perspective.
Security: because the CAT algorithm is very flexible and can adapt with potentially millions of
permutations, there is much greater security than if everyone was administered the same set of 200 items.
Disadvantages of CAT
 Recovery of poor starts: CATs are susceptible to issues with examinee test anxiety, as the elimination of
item review prevents someone from going back to the first few items. If they answered all those items
incorrectly due to severe test anxiety, the test cannot correct itself.
 Requirements: CAT require large sample size and extensive expertise. Item exposure: CATs are designed
to select the best items in the bank, and these items often become overexposed if a control algorithm is
not implemented.
 No review: CATs rarely allow for examinees to return to items already administered as the CAT has since
adapted and it cannot updated.
 Public relation: because of the complexity and the departure from the familiarity of the traditional exam
paradigm, an organization must put forth more effort into public relations, explaining CAT and the
reasons for using it.
Tests based on CAT
 Test of English as a Foreign Language (TOEFL)
 Graduate Record Examinations (GRE)
 Graduate Management Admission Test (GMAT)
 Scholastic Aptitude Test
 Microsoft’s qualifications
 Computerized Adaptive Assessment of Personality Disorder: Introducing the CAT–PD Project
Who is Using CAT?
 U.S. military has pioneered basic and applied research in CATs. One step in this research program is the
development of a computerized version of the Armed Services Vocational Aptitude Battery
(ASVAB). Administered to roughly a half million applicants each year, the paper-and-pencil version of
the ASVAB takes three hours to complete while the experimental CAT version takes about 90 minutes.
 Another test developed by military research laboratories -- the Computerized Adaptive Screening
Test (CAST) -- was implemented in 1984. CAST was the first nationwide use of CAT. This 15-minute
screening test gives prospects a quick but accurate estimate of their chances of passing the full ASVAB
and of qualifying for enlistment bonuses.
Who is Using CAT?
 Two public school systems are forerunners in using CATs in the educational arena.
 In Portland (OR) Public Schools, CATs have been well received by examinees, test administrators, and
test users.
 Montgomery County (MD) Public Schools has asked for approval from the State Board of Education to
make its mathematics and reading CATs available to students as an alternative to the state-sponsored
high school graduation examinations.
Who is Using CAT?
Smarter Balanced Assessment Consortium
http://www.smarterbalanced.org/
Partnership for Assessment of Readiness for College and Careers (PARCC)
https://parcc-assessment.org/
Businesses Involved in CAT
 Assessment Systems Corporation markets the MicroCAT system, which runs on IBM-PCs and compatibles.
MicroCAT is a complete authoring and administration system and includes routines for item analysis and
item-pool development. The Montgomery County Schools CAT program is based on MicroCAT.
 WICAT markets software to support CAT developers, a battery of 45 tests, and custom CAT computer
systems. Schools use Wicat's battery of CATs to screen and identify gifted and talented students.
 The American Institutes for Research recently completed a major revision of the Army's Computerized
Applicant Screening Test (CAST). The CAST item pool was expanded, fairness analyses were conducted, item
selection procedures were modified to increase accuracy at key points, and the feedback provided to
examinees and recruiters was significantly improved.
Businesses involved in CAT
 The Psychological Corporation markets a CAT version of the popular Differential Aptitude Test (DAT) to
junior and senior high schools. It has versions for the IBM-PC and Apple II computers.
 American College Testing Program (ACT) is working on several computerized adaptive tests. ACT is
developing training CATs for the Marine Corps and for college placement mathematics. It is also
researching the development of a multidimensional CAT.
 The Educational Testing Service is working with the College Entrance Examination Board to develop and
refine a CAT to aid in college placement. An initial version of the system is being used by about 20
colleges across the country.
CAT In Pakistan
 Measuring risk literacy: The Berlin Numeracy Test
Cokely, E.T., Galesic, M., Schulz, E., Ghazal, S., & Garcia-Retamero, R. (2012).
Measuring risk literacy: The Berlin Numeracy Test. Judgment and Decision
Making, 7, 25-47.
Resources
International Association for Computerized Adaptive Testing
http://www.iacat.org/
Concerto: Open-source CAT Platform
https://www.psychometrics.cam.ac.uk/newconcerto
References
 Weiss, D. J.; Kingsbury, G. G. (1984). "Application of computerized adaptive
testing to educational problems". Journal of Educational Measurement. 21 (4):
361–375. doi:10.1111/j.1745-3984.1984.tb01040.x.
 Thissen, D., & Mislevy, R.J. (2000). Testing Algorithms. In Wainer, H. (Ed.)
Computerized Adaptive Testing: A Primer. Mahwah, NJ: Lawrence Erlbaum
Associates.
 Kaya-Carton, E., Carton, A. S. & Dandonoli, P. (1991). Developing a computer-
adaptive test of French reading proficiency. In P. Dunkel (Ed.), Computer-
assisted language learning and testing: Research issues and practice (pp. 259-
84) .New York: Newbury House.
 Burston, J. & Monville-Burston, M. (1995). Practical design and implementation
considerations of a computer-adaptive foreign language test: The Monash/
Melbourne French, CALICO Journal, 13(1), 26-46.CAT

More Related Content

PPT
ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...
PDF
Cyber Psychology & Cyberbullying
PPTX
E content
PPT
Chapter 3(designing classroom language tests)
PPTX
Atomic Habits-Book Summary
PDF
EsP Grade 3 Q1.pdf
PPTX
LAS PROFESIONES EN INGLES
PPTX
Mga Uri ng Tayutay
ASSESSMENT: DISCRETE POINT TEST, INTEGRATIVE TESTING, PERFORMANCE-BASED ASSES...
Cyber Psychology & Cyberbullying
E content
Chapter 3(designing classroom language tests)
Atomic Habits-Book Summary
EsP Grade 3 Q1.pdf
LAS PROFESIONES EN INGLES
Mga Uri ng Tayutay

What's hot (20)

PPTX
Computer adaptive testing
PPTX
Achievement tests
PPTX
Characteristics of a good test
PPTX
PPT
Essay type tests
PPT
Qualities of a Good Test
PPTX
Constructing Objective and Essay Type Test
PPTX
Assembling The Test
PPTX
Test item formats: definition, types, pros and cons
PPTX
Objective and subjective type test
PPTX
Criteria of a good test
PPTX
Comparison of criterion referenced and norm referenced assessment
PPTX
Test construction
PPTX
Norm Referenced and Criterion Referenced
PPTX
Achievement test
PPT
stages of test construction
PPTX
Objective test
PPTX
Distinction among measurement, assessment and evaluation
PPTX
types of test items
PPTX
Assessment techniques
Computer adaptive testing
Achievement tests
Characteristics of a good test
Essay type tests
Qualities of a Good Test
Constructing Objective and Essay Type Test
Assembling The Test
Test item formats: definition, types, pros and cons
Objective and subjective type test
Criteria of a good test
Comparison of criterion referenced and norm referenced assessment
Test construction
Norm Referenced and Criterion Referenced
Achievement test
stages of test construction
Objective test
Distinction among measurement, assessment and evaluation
types of test items
Assessment techniques
Ad

Similar to Computer Adaptive Test (cat) (20)

PDF
An Adaptive Evaluation System to Test Student Caliber using Item Response Theory
PDF
PPTX
Presentation on supervised learning
PPT
Machine Learning presentation.
PPTX
Developing a Computerized Adaptive Test
PPT
De carlo rizk 2010 icelw
PPTX
Introduction to Computerized Adaptive Testing (CAT)
PPT
Analyzing Performance Test Data
PDF
Top 10 Data Science Practitioner Pitfalls
PPT
Ensemble Learning Featuring the Netflix Prize Competition and ...
PDF
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
PPTX
reliability and validity psychology 1234
PPT
final report (ppt)
PDF
MACHINE LEARNING TOOLBOX
PDF
Thompson (2009) -_classical_item_analysis_with_citas
PDF
De carlo rizk 2010 icelw
PDF
Ijcatr04051005
PDF
Machine Learning for Aerospace Training
PDF
Applying the pythagorean model to derive a correction factor for estimating m...
PDF
ICELW Conference Slides
An Adaptive Evaluation System to Test Student Caliber using Item Response Theory
Presentation on supervised learning
Machine Learning presentation.
Developing a Computerized Adaptive Test
De carlo rizk 2010 icelw
Introduction to Computerized Adaptive Testing (CAT)
Analyzing Performance Test Data
Top 10 Data Science Practitioner Pitfalls
Ensemble Learning Featuring the Netflix Prize Competition and ...
A New Active Learning Technique Using Furthest Nearest Neighbour Criterion fo...
reliability and validity psychology 1234
final report (ppt)
MACHINE LEARNING TOOLBOX
Thompson (2009) -_classical_item_analysis_with_citas
De carlo rizk 2010 icelw
Ijcatr04051005
Machine Learning for Aerospace Training
Applying the pythagorean model to derive a correction factor for estimating m...
ICELW Conference Slides
Ad

Recently uploaded (20)

PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PDF
O5-L3 Freight Transport Ops (International) V1.pdf
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Insiders guide to clinical Medicine.pdf
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
01-Introduction-to-Information-Management.pdf
PDF
102 student loan defaulters named and shamed – Is someone you know on the list?
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
PPTX
Lesson notes of climatology university.
PPTX
Cell Types and Its function , kingdom of life
PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PPTX
Cell Structure & Organelles in detailed.
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
RMMM.pdf make it easy to upload and study
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
O5-L3 Freight Transport Ops (International) V1.pdf
Microbial disease of the cardiovascular and lymphatic systems
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Insiders guide to clinical Medicine.pdf
human mycosis Human fungal infections are called human mycosis..pptx
01-Introduction-to-Information-Management.pdf
102 student loan defaulters named and shamed – Is someone you know on the list?
Module 4: Burden of Disease Tutorial Slides S2 2025
Physiotherapy_for_Respiratory_and_Cardiac_Problems WEBBER.pdf
Lesson notes of climatology university.
Cell Types and Its function , kingdom of life
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Supply Chain Operations Speaking Notes -ICLT Program
Microbial diseases, their pathogenesis and prophylaxis
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
Cell Structure & Organelles in detailed.
O7-L3 Supply Chain Operations - ICLT Program
RMMM.pdf make it easy to upload and study

Computer Adaptive Test (cat)

  • 2. Computers as tools in testing Generating Stimulation Test Administration Scoring Interpretation Minimize Scoring Error Saves Time
  • 3. CAPA and CAT CAPA Computer Assisted Psychological Assessment CAT Computer Adaptive Testing
  • 4. Computer Adaptive Testing?  Computerized adaptive testing (CAT) is a form of computer-based test that adapts to the examinee's ability level.  It has also been called tailored testing.  In other words, it is a form of computer-administered test in which the next item or set of items selected to be administered depends on the correctness of the test taker's responses to the most recent items administered.
  • 5. Origin of CAT  Generally, computerized testing originated in the early 1970s (Drasgow, 2002; Wainer, 1990).  Specifically, the first Computer Adaptive Test (CAT) was created by Larson and Madsen (1985) at Brigham Young University, in the USA.  Besides, After Larson and Madsen (1985), several scholars (e.g., Kaya-Carton, Carton & Dandonoli, 1991; Burston & Monville-Burston, 1995; Brown & Iwashita, 1996; Young, Shermis, Brutten & Perkins, 1996) were motivated to construct and develop more computer adapted tests throughout the 1990s.  In 1998, the Test of English as a Foreign Language (TOEFL) began to use computer-adaptive testing format
  • 6. How does CAT Works?  CAT successively selects questions for the purpose of maximizing the precision of the exam based on what is known about the examinee from previous questions.  From the examinee's perspective, the difficulty of the exam seems to tailor itself to their level of ability. For example, if an examinee performs well on an item of intermediate difficulty, they will then be presented with a more difficult question. Or, if they performed poorly, they would be presented with a simpler question.  Computer-adaptive tests require fewer test items to arrive at equally accurate scores. (Weiss and Kingsbury,1984)
  • 7. The basic computer-adaptive testing method is an iterative algorithm with the following steps:  The pool of available items is searched for the optimal item, based on the current estimate of the examinee's ability  The chosen item is presented to the examinee, who then answers it correctly or incorrectly  The ability estimate is updated, based upon all prior answers  Steps 1–3 are repeated until a termination criterion is met (Thissen & Mislevy, 2000).
  • 8. Item response theory  Item response theory is a probabilistic model that attempts to explain the response of a person to an item.  In its simplest form, item response theory posits that the probability of a random person j with ability θj answering a random item i with difficulty bi correctly is conditioned upon the ability of the person and the difficulty of the item. In other words, if a person has a high ability in a particular field, he or she will probably get an easy item correct. Conversely, if a person has a low ability and the item is difficult, he or she will probably get the item wrong.
  • 9. Item Response Theory  For example, we can expect someone with a large vocabulary to respond that they know easy words like „smile‟ and „beautiful‟ but we should not expect someone with a small vocabulary to know words like „subsidy‟ or „dissipate.‟  When we analyze item responses, we are trying to answer the question, “What is the probability of a person with a given ability responding correctly to an item with a given difficulty?”
  • 10. Components of CAT A CAT procedure consists of the following components: (a) Calibrated item pool, (b) an entry level, (c) an item selection rule, (d) a scoring method, and (e) a termination criterion. A number of options are available for each of these components in the implementation of CAT for a specific purpose.
  • 11. Calibrated Item Pool A pool of items must be available for the CAT to choose from. Such items can be created in the traditional way (i.e., manually) or through Automatic Item Generation. The pool must be calibrated with a psychometric model, which is used as a basis for the remaining four components. Typically, item response theory is employed as the psychometric model. There are no specific guidelines for the appropriate size of item pools; pools of 100 items can provide satisfactory results. What is essential to all applications of CAT is that the difficulty levels of the items in the pool must span the full range of trait levels in the population. Most efficient measurment results from using items with high discriminations.
  • 12. Entry Level In adaptive testing, it is possible to begin with items at different levels of difficulty for different students. For example, if a student's achievement level is thought to be high, testing can begin with a relatively difficult item. Because the level of difficulty of the items that are chosen will move to the student's trait level as the test progresses, an erroneous entry level will not seriously affect the results, but accurate entry levels will reduce the number of items required to achieve precise measurement. It will be shown later that the choice of entry level should depend on the purpose of the test.
  • 13. Item Selection Procedure Two efficient item selection procedures are available.--- maximum information (Weiss, 1982) and Bayesian (Owen, 1969, 1975). Both procedures involve searching the entire pool of administered items for a single item: In maximum information item selection, it is the one that provides the maximum amount of item information at the examinee's last trait level (0) estimate, and in Bayesian item selection, it is the one that minimizes the expected posterior variance of the 0 estimate. Because information and Bayesian posterior variance are related functions, in many cases a similar subset of items will be selected for a given individual (Sympson, Weiss, & Ree, 1982).
  • 14. Scoring Method After an item is administered, the CAT updates its estimate of the examinee's ability level. If the examinee answered the item correctly, the CAT will likely estimate their ability to be somewhat higher, and vice versa. This is done by using the item response function to obtain a likelihood function of the examinee's ability.
  • 15. Termination Criteria Depends primarily on purpose of the test: point estimation or classification? Point estimation: we want an accurate score for each student Classification: we do NOT need an accurate score, just a classification into pass/fail etc. Change: to see if score has gone up/down a certain amount
  • 16. Advantages of CAT Frequent retesting: the high number of permutations also enables more frequent receives a few weeks of instruction, by the time they take a CAT again their ability has increased somewhat, and they will receive a completely different test. More precision: because CATs are more efficient, the organization has the option to design the CAT to actually be more precise than a conventional test while still using fewer items. Shorter tests: CAT can reduce testing time by 50% or more. This can obviously translate into huge financial benefits. Examinee experience: A CAT will provide an appropriate challenge for each examinee. Low examinees are not discouraged or intimidated. High examinees enjoy receiving difficult items.
  • 17. Advantages (Cont.) Increased motivation: because of the better experience, there is likely an increase in examinee motivation. Low examinees feel better, and high examinees feel challenged. Both will try harder than with a conventional test. Equiprecision: CATs can be designed so that examines are all measured with the same level of precision, even though they all potentially see different items. This makes test extremely fair from a psychometric perspective. Security: because the CAT algorithm is very flexible and can adapt with potentially millions of permutations, there is much greater security than if everyone was administered the same set of 200 items.
  • 18. Disadvantages of CAT  Recovery of poor starts: CATs are susceptible to issues with examinee test anxiety, as the elimination of item review prevents someone from going back to the first few items. If they answered all those items incorrectly due to severe test anxiety, the test cannot correct itself.  Requirements: CAT require large sample size and extensive expertise. Item exposure: CATs are designed to select the best items in the bank, and these items often become overexposed if a control algorithm is not implemented.  No review: CATs rarely allow for examinees to return to items already administered as the CAT has since adapted and it cannot updated.  Public relation: because of the complexity and the departure from the familiarity of the traditional exam paradigm, an organization must put forth more effort into public relations, explaining CAT and the reasons for using it.
  • 19. Tests based on CAT  Test of English as a Foreign Language (TOEFL)  Graduate Record Examinations (GRE)  Graduate Management Admission Test (GMAT)  Scholastic Aptitude Test  Microsoft’s qualifications  Computerized Adaptive Assessment of Personality Disorder: Introducing the CAT–PD Project
  • 20. Who is Using CAT?  U.S. military has pioneered basic and applied research in CATs. One step in this research program is the development of a computerized version of the Armed Services Vocational Aptitude Battery (ASVAB). Administered to roughly a half million applicants each year, the paper-and-pencil version of the ASVAB takes three hours to complete while the experimental CAT version takes about 90 minutes.  Another test developed by military research laboratories -- the Computerized Adaptive Screening Test (CAST) -- was implemented in 1984. CAST was the first nationwide use of CAT. This 15-minute screening test gives prospects a quick but accurate estimate of their chances of passing the full ASVAB and of qualifying for enlistment bonuses.
  • 21. Who is Using CAT?  Two public school systems are forerunners in using CATs in the educational arena.  In Portland (OR) Public Schools, CATs have been well received by examinees, test administrators, and test users.  Montgomery County (MD) Public Schools has asked for approval from the State Board of Education to make its mathematics and reading CATs available to students as an alternative to the state-sponsored high school graduation examinations.
  • 22. Who is Using CAT? Smarter Balanced Assessment Consortium http://www.smarterbalanced.org/ Partnership for Assessment of Readiness for College and Careers (PARCC) https://parcc-assessment.org/
  • 23. Businesses Involved in CAT  Assessment Systems Corporation markets the MicroCAT system, which runs on IBM-PCs and compatibles. MicroCAT is a complete authoring and administration system and includes routines for item analysis and item-pool development. The Montgomery County Schools CAT program is based on MicroCAT.  WICAT markets software to support CAT developers, a battery of 45 tests, and custom CAT computer systems. Schools use Wicat's battery of CATs to screen and identify gifted and talented students.  The American Institutes for Research recently completed a major revision of the Army's Computerized Applicant Screening Test (CAST). The CAST item pool was expanded, fairness analyses were conducted, item selection procedures were modified to increase accuracy at key points, and the feedback provided to examinees and recruiters was significantly improved.
  • 24. Businesses involved in CAT  The Psychological Corporation markets a CAT version of the popular Differential Aptitude Test (DAT) to junior and senior high schools. It has versions for the IBM-PC and Apple II computers.  American College Testing Program (ACT) is working on several computerized adaptive tests. ACT is developing training CATs for the Marine Corps and for college placement mathematics. It is also researching the development of a multidimensional CAT.  The Educational Testing Service is working with the College Entrance Examination Board to develop and refine a CAT to aid in college placement. An initial version of the system is being used by about 20 colleges across the country.
  • 25. CAT In Pakistan  Measuring risk literacy: The Berlin Numeracy Test Cokely, E.T., Galesic, M., Schulz, E., Ghazal, S., & Garcia-Retamero, R. (2012). Measuring risk literacy: The Berlin Numeracy Test. Judgment and Decision Making, 7, 25-47.
  • 26. Resources International Association for Computerized Adaptive Testing http://www.iacat.org/ Concerto: Open-source CAT Platform https://www.psychometrics.cam.ac.uk/newconcerto
  • 27. References  Weiss, D. J.; Kingsbury, G. G. (1984). "Application of computerized adaptive testing to educational problems". Journal of Educational Measurement. 21 (4): 361–375. doi:10.1111/j.1745-3984.1984.tb01040.x.  Thissen, D., & Mislevy, R.J. (2000). Testing Algorithms. In Wainer, H. (Ed.) Computerized Adaptive Testing: A Primer. Mahwah, NJ: Lawrence Erlbaum Associates.  Kaya-Carton, E., Carton, A. S. & Dandonoli, P. (1991). Developing a computer- adaptive test of French reading proficiency. In P. Dunkel (Ed.), Computer- assisted language learning and testing: Research issues and practice (pp. 259- 84) .New York: Newbury House.  Burston, J. & Monville-Burston, M. (1995). Practical design and implementation considerations of a computer-adaptive foreign language test: The Monash/ Melbourne French, CALICO Journal, 13(1), 26-46.CAT