8
Most read
9
Most read
11
Most read
Principles of test design
Objectives, measurement qualities, challenges
KAYSERI, 29-31 JANUARY 2014
1.2 principles of test design: plenary CTS-Academic
Aims

! 

Definining our assessment objectives

! 

Measurement qualities: ‚test usefulness

! 

Testing challenges

! 

Considering impact
Testing: a definition

! 

“the systematic gathering of information for
the purpose of making decisions” (Weiss
1972)

! 

“the collection of reliable and relevant
information” (Bachman 1990)

! 

What information?

! 

What decisions?
Defining our test objectives

1) 

What kind of test is it going to be?

2) 

What decisions do we need information for?

3) 

What abilities are tested?

4) 

How much detail do we need?

5) 

How accurate should our results be?

6) 

Is there any washback effect?

7) 

What are the practical constraints?
Assessment in the classroom
!  examinations
!  formal
!  oral

tests

tests

!  home
!  term

assignments

reports

!  continuous

assessment

!  projectwork,
!  student

research work

self-evaluation
Purposes of tests
! 

for placement or level testing: to decide on the level of
learners

! 

for diagnostic testing: to identify individual strengths or
weaknesses

! 

to evaluate progress

! 

to evaluate skills for specific needs

! 

to give marks for performance

! 

to evaluate and update syllabus and objectives

! 

also to prepare/train for exams
Types of tests (Madsen, 1983)
Contrasting categories of ESL tests
Knowledge

Performance (or skills)

Subjective

Objective

Productive

Receptive

Language sub-skills
Norm-referenced
Discrete-point
Proficiency

Communication skills
Criterion-referenced
Integrative
Achievement (or progress)
Key principles of testing

" 

A correspondence between language test
performance and language use
◦ 

" 

“In order for a particular language test to be useful for its
intended purposes, test performance must correspond in
demonstrable ways to language use in non-test situations.”

A clear and explicit definition of qualities of test
usefulness
◦ 

“Usefulness = Reliability + Construct validity + Authenticity +
Interactiveness + Impact + Practicality”
(Bachman & Palmer, 1996)
Qualities of test usefulness:
reliability
" 

consistent across populations

" 

consistent within the same test
irrespective of:
! 

setting

! 

marker

! 

item set
When conditions of the test remain unchanged, it always produces
essentially the same results.
Qualities of test usefulness:
validity
!  Types

of validity (Alderson et al, 1995)

!  internal
!  face

validity

validity

!  content

validity

!  response

!  external

validity

validity

!  concurrent
!  predictive

!  construct
! 

validity

validity

validity

meaningful? appropriate? representative?
Construct validity
CHECKLIST FOR VALID TESTS
Tests what it claims to test?
Best format for what you want to test?
Content relevant to student’s real-life needs?
Items typical/representative of learner use?
Items don’t accidentally test other things?
Test skill, NOT knowledge, creativity or logic?
Clear and appropriate marking criteria?
Testing competence
from Language Testing in Practice by Bachman & Palmer (1996,
OUP)

Misconceptions
! 

one best test for a given
situation

! 

misunderstanding the nature
of testing and test
development

! 

unreasonable expectations

! 

blind faith in measurement

Resulting problems
! 

inappropriate tests

! 

failure to meet specific needs

! 

uninformed use of popular testing
methods

! 

frustration about not finding the
perfect test

! 

loss of faith in self ! testing can
only be done by experts

! 

defending the indefensible
(stakeholders s expectations)
Challenges in testing
" 
" 
" 
" 
" 

" 

testing = “real life”? “authentic”?
“communicative language testing”?
defining abilities in a test
selecting tasks and items ! competence?
measurement principles:
can “unidimensional scores for locally independent
test items reflect authentic language use”?
(Bachman)
outside factors ! performance?
Thank you!

zoltan.rezmuves@digimedia.hu
consonantvoiced.blogspot.com

More Related Content

PDF
Testing grammar and vocabulary.pdf
PPTX
Designing classroom language test
PPTX
Common test techniques
PPTX
Validity, reliablility, washback
PPT
Describing learners
PPTX
Testing grammar and vocabulary
PPTX
language testing: testing writing
Testing grammar and vocabulary.pdf
Designing classroom language test
Common test techniques
Validity, reliablility, washback
Describing learners
Testing grammar and vocabulary
language testing: testing writing

What's hot (20)

PPT
Testing Speaking and Writing
PPT
Reliability in Language Testing
PPTX
Language Assessment : Kinds of tests and testing
PPTX
Kinds of tests and testing
PPTX
The role of materials
PPTX
Testing Grammar and Vocabulary Skill
PPTX
Language Testing: Approaches and Techniques
PPT
Chapter 3(designing classroom language tests)
PPTX
Assessing writing
PPTX
Language Assessment - Standardized Testing by EFL Learners
PPTX
Testing writing
PPTX
Test techniques and testing overall ability
PPTX
Testing Writing and Rubrics
PPTX
Unit1(testing, assessing, and teaching)
PPTX
Language teaching methodology
PPTX
Task based syllabus
PPTX
Goals and objectives in Language Teaching
PPTX
Approaches to Course Design
PPTX
Oral Ability Testing
PPTX
Assessing listening
Testing Speaking and Writing
Reliability in Language Testing
Language Assessment : Kinds of tests and testing
Kinds of tests and testing
The role of materials
Testing Grammar and Vocabulary Skill
Language Testing: Approaches and Techniques
Chapter 3(designing classroom language tests)
Assessing writing
Language Assessment - Standardized Testing by EFL Learners
Testing writing
Test techniques and testing overall ability
Testing Writing and Rubrics
Unit1(testing, assessing, and teaching)
Language teaching methodology
Task based syllabus
Goals and objectives in Language Teaching
Approaches to Course Design
Oral Ability Testing
Assessing listening
Ad

Viewers also liked (20)

PDF
2.2.test design test writing: plenary CTS-Academic
PPT
40 Hot Tips To Better Position Your Education In A Down Economy
PPT
A3 slideshow
PDF
Comparative Chart (Brown - Bachman)
PPT
SWAN/SIOC: Aligning Scientific Discourse Representation and Social Semantics
PPT
Media & Generations: use of media in generational discourse
PDF
A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...
PPT
Culture of Adversarialism
PPTX
What is-discourse (2)
PDF
Online version 20151003 main issues in language testing
PDF
Bachman style
PPTX
Testing backto basics
PDF
Xác trị slide 1 - validation basics
PDF
Glossary of L2 evaluation terms
PPTX
Ideology and textbooks
PPTX
Qualitative research
PDF
Some Reflections on Task-based Language Performance Assessment
PPTX
Communicative Testing
PPT
communicative languag
PPTX
What is the difference and similarity between qualitative and quantitative re...
2.2.test design test writing: plenary CTS-Academic
40 Hot Tips To Better Position Your Education In A Down Economy
A3 slideshow
Comparative Chart (Brown - Bachman)
SWAN/SIOC: Aligning Scientific Discourse Representation and Social Semantics
Media & Generations: use of media in generational discourse
A Media Frame and Political Discourse Analysis of the Lehman Shock in Japan -...
Culture of Adversarialism
What is-discourse (2)
Online version 20151003 main issues in language testing
Bachman style
Testing backto basics
Xác trị slide 1 - validation basics
Glossary of L2 evaluation terms
Ideology and textbooks
Qualitative research
Some Reflections on Task-based Language Performance Assessment
Communicative Testing
communicative languag
What is the difference and similarity between qualitative and quantitative re...
Ad

Similar to 1.2 principles of test design: plenary CTS-Academic (20)

PDF
3.2 test development cycle: plenary CTS-Academic
PPTX
Presentaion about Requirements of a good test.ptx
PPT
Applied linguistics: Assessment for language teachers
PPTX
7.1 assessment and the cefr (1)
PPTX
SEMINAR ON- TOOLS AND TECHNIQUES OF ASSESSMENT-Sibi (1).pptx
PPTX
01 introducción a la evaluación del aprendizaje de idiomas
PPTX
topic_3 Language Assessment (Principles of Assessment).pptx
PPTX
3 basic-principles_of_assessment
PPT
Evaluating a Good Language Test
PPTX
principle and types of assessment 12.pptx
PPTX
CPUT Formal Assessment Online Workshop March 2018
PPTX
CRITERIA AND VALIDITY PRESENTATIONS.pptx
PPT
lecture2_20111.ppt
PPT
A presentation on Principles of Assessment.ppt
PPTX
7.1 assessment and the cefr (1)
PPTX
Introdcution to Language Assessment including with its aspects.
PPTX
Test appraisal
PPTX
ELTLAE Group 2.pptx
PPTX
Principles of Language Assessment (Aranda & Lingcallo).pptx
PPTX
Validation
3.2 test development cycle: plenary CTS-Academic
Presentaion about Requirements of a good test.ptx
Applied linguistics: Assessment for language teachers
7.1 assessment and the cefr (1)
SEMINAR ON- TOOLS AND TECHNIQUES OF ASSESSMENT-Sibi (1).pptx
01 introducción a la evaluación del aprendizaje de idiomas
topic_3 Language Assessment (Principles of Assessment).pptx
3 basic-principles_of_assessment
Evaluating a Good Language Test
principle and types of assessment 12.pptx
CPUT Formal Assessment Online Workshop March 2018
CRITERIA AND VALIDITY PRESENTATIONS.pptx
lecture2_20111.ppt
A presentation on Principles of Assessment.ppt
7.1 assessment and the cefr (1)
Introdcution to Language Assessment including with its aspects.
Test appraisal
ELTLAE Group 2.pptx
Principles of Language Assessment (Aranda & Lingcallo).pptx
Validation

More from SeltAcademy (18)

PPT
CTS-Academic: Module 2 session 9 cognitive processes
PPT
CTS-Academic: Module 2 session 8 psycholinguistics
PPT
CTS-Academic: Module 2 session 6 classroom sla
PPT
CTS-Academic: Module 2 session 5 sla research
PPT
CTS-Academic: Module 2 session 3 theories of language learning
PPT
CTS-Academic: Module 2 session 2 theories of language learning
PPT
CTS-Academic: Module 2 session 1 theories of language
PPT
CTS-Academic: Module 2 session 10 lesson shapes
PDF
1.3 tests of grammar and vocabulary: workshop CTS-Academic
PDF
3.1 curriculum and assessment: plenary CTS-Academic
PDF
2.3 tests of receptive skills: workshop CTS-Academic
PDF
3.3 tests of productive skills: workshop CTS-Academic
PDF
2.1 applying standards to testing: plenary CTS-Academic
PDF
1.1 language assessment in Turkey: plenary CTS-Academic
PPTX
Part 3 great opportunities great expectations
PPTX
Part 2 great opportunities great expectations
PPTX
Part 1 great opportunities great expectations
PPTX
Types of stories for children
CTS-Academic: Module 2 session 9 cognitive processes
CTS-Academic: Module 2 session 8 psycholinguistics
CTS-Academic: Module 2 session 6 classroom sla
CTS-Academic: Module 2 session 5 sla research
CTS-Academic: Module 2 session 3 theories of language learning
CTS-Academic: Module 2 session 2 theories of language learning
CTS-Academic: Module 2 session 1 theories of language
CTS-Academic: Module 2 session 10 lesson shapes
1.3 tests of grammar and vocabulary: workshop CTS-Academic
3.1 curriculum and assessment: plenary CTS-Academic
2.3 tests of receptive skills: workshop CTS-Academic
3.3 tests of productive skills: workshop CTS-Academic
2.1 applying standards to testing: plenary CTS-Academic
1.1 language assessment in Turkey: plenary CTS-Academic
Part 3 great opportunities great expectations
Part 2 great opportunities great expectations
Part 1 great opportunities great expectations
Types of stories for children

Recently uploaded (20)

PDF
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
PPTX
A powerpoint presentation on the Revised K-10 Science Shaping Paper
PPTX
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
PDF
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
PPTX
20th Century Theater, Methods, History.pptx
PDF
Paper A Mock Exam 9_ Attempt review.pdf.
PDF
FORM 1 BIOLOGY MIND MAPS and their schemes
PPTX
Introduction to pro and eukaryotes and differences.pptx
PDF
Environmental Education MCQ BD2EE - Share Source.pdf
PPTX
Computer Architecture Input Output Memory.pptx
PDF
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
PDF
HVAC Specification 2024 according to central public works department
DOC
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
PDF
What if we spent less time fighting change, and more time building what’s rig...
PDF
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
PDF
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
PDF
International_Financial_Reporting_Standa.pdf
PPTX
B.Sc. DS Unit 2 Software Engineering.pptx
PDF
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
PPTX
Share_Module_2_Power_conflict_and_negotiation.pptx
Τίμαιος είναι φιλοσοφικός διάλογος του Πλάτωνα
A powerpoint presentation on the Revised K-10 Science Shaping Paper
ELIAS-SEZIURE AND EPilepsy semmioan session.pptx
ChatGPT for Dummies - Pam Baker Ccesa007.pdf
20th Century Theater, Methods, History.pptx
Paper A Mock Exam 9_ Attempt review.pdf.
FORM 1 BIOLOGY MIND MAPS and their schemes
Introduction to pro and eukaryotes and differences.pptx
Environmental Education MCQ BD2EE - Share Source.pdf
Computer Architecture Input Output Memory.pptx
احياء السادس العلمي - الفصل الثالث (التكاثر) منهج متميزين/كلية بغداد/موهوبين
HVAC Specification 2024 according to central public works department
Soft-furnishing-By-Architect-A.F.M.Mohiuddin-Akhand.doc
What if we spent less time fighting change, and more time building what’s rig...
BP 704 T. NOVEL DRUG DELIVERY SYSTEMS (UNIT 2).pdf
A GUIDE TO GENETICS FOR UNDERGRADUATE MEDICAL STUDENTS
International_Financial_Reporting_Standa.pdf
B.Sc. DS Unit 2 Software Engineering.pptx
1.3 FINAL REVISED K-10 PE and Health CG 2023 Grades 4-10 (1).pdf
Share_Module_2_Power_conflict_and_negotiation.pptx

1.2 principles of test design: plenary CTS-Academic

  • 1. Principles of test design Objectives, measurement qualities, challenges KAYSERI, 29-31 JANUARY 2014
  • 3. Aims !  Definining our assessment objectives !  Measurement qualities: ‚test usefulness !  Testing challenges !  Considering impact
  • 4. Testing: a definition !  “the systematic gathering of information for the purpose of making decisions” (Weiss 1972) !  “the collection of reliable and relevant information” (Bachman 1990) !  What information? !  What decisions?
  • 5. Defining our test objectives 1)  What kind of test is it going to be? 2)  What decisions do we need information for? 3)  What abilities are tested? 4)  How much detail do we need? 5)  How accurate should our results be? 6)  Is there any washback effect? 7)  What are the practical constraints?
  • 6. Assessment in the classroom !  examinations !  formal !  oral tests tests !  home !  term assignments reports !  continuous assessment !  projectwork, !  student research work self-evaluation
  • 7. Purposes of tests !  for placement or level testing: to decide on the level of learners !  for diagnostic testing: to identify individual strengths or weaknesses !  to evaluate progress !  to evaluate skills for specific needs !  to give marks for performance !  to evaluate and update syllabus and objectives !  also to prepare/train for exams
  • 8. Types of tests (Madsen, 1983) Contrasting categories of ESL tests Knowledge Performance (or skills) Subjective Objective Productive Receptive Language sub-skills Norm-referenced Discrete-point Proficiency Communication skills Criterion-referenced Integrative Achievement (or progress)
  • 9. Key principles of testing "  A correspondence between language test performance and language use ◦  "  “In order for a particular language test to be useful for its intended purposes, test performance must correspond in demonstrable ways to language use in non-test situations.” A clear and explicit definition of qualities of test usefulness ◦  “Usefulness = Reliability + Construct validity + Authenticity + Interactiveness + Impact + Practicality” (Bachman & Palmer, 1996)
  • 10. Qualities of test usefulness: reliability "  consistent across populations "  consistent within the same test irrespective of: !  setting !  marker !  item set When conditions of the test remain unchanged, it always produces essentially the same results.
  • 11. Qualities of test usefulness: validity !  Types of validity (Alderson et al, 1995) !  internal !  face validity validity !  content validity !  response !  external validity validity !  concurrent !  predictive !  construct !  validity validity validity meaningful? appropriate? representative?
  • 12. Construct validity CHECKLIST FOR VALID TESTS Tests what it claims to test? Best format for what you want to test? Content relevant to student’s real-life needs? Items typical/representative of learner use? Items don’t accidentally test other things? Test skill, NOT knowledge, creativity or logic? Clear and appropriate marking criteria?
  • 13. Testing competence from Language Testing in Practice by Bachman & Palmer (1996, OUP) Misconceptions !  one best test for a given situation !  misunderstanding the nature of testing and test development !  unreasonable expectations !  blind faith in measurement Resulting problems !  inappropriate tests !  failure to meet specific needs !  uninformed use of popular testing methods !  frustration about not finding the perfect test !  loss of faith in self ! testing can only be done by experts !  defending the indefensible (stakeholders s expectations)
  • 14. Challenges in testing "  "  "  "  "  "  testing = “real life”? “authentic”? “communicative language testing”? defining abilities in a test selecting tasks and items ! competence? measurement principles: can “unidimensional scores for locally independent test items reflect authentic language use”? (Bachman) outside factors ! performance?