SlideShare a Scribd company logo
Validity! We need to 
find out if our 
research is sound. 
Do our tests 
measure what they 
claim to measure?
Are techniques used to collect data in tests, 
questionnaires, interviews and observations measuring 
what is claimed? For example was the Strange Situation 
really measuring attachment style?
We need to be 
able to measure 
or observe 
something time 
after time and 
produce the 
same or similar 
results
I want to measure intelligence. If the same 
person sits the test on several occasions and 
the results change each time, then that test 
lacks reliability
The test also arguably lacks validity because the 
scores are meaningless
If I test my participants again several months 
later and their scores remains consistent, I can 
say the test is reliable, but it might still lack 
validity.
Is an A level in Psychology a valid and reliable 
assessment of your performance in Psychology.
This measures consistency from one occasion to another – 
the same result should be found on different days, in different 
labs , observations or interviews, by different researchers 
I exposed these 
teenage brain 
cells to 1000 
PowerPoint 
slides last 
Monday and 
they’re all dead 
I thought that 
was a fluke but 
they seem to 
be shrivelling 
after only five 
minutes!
Participants take the same test on different occasions – a high correlation between 
test scores indicates the test has good external reliability . 
Timing is crucial. Why? 
January June 
I hope that’s 
the right 
answer this 
time
This refers to the consistency of a researcher’s behaviour. 
A researcher should produce similar test results, or make similar observations or 
carry out interviews in the same way on more than one occasion. 
Thanks for taking 
part today. Any 
problems and I’ll 
be right over. Take 
your time. 
Right. Let’s get on. 
Fast as you can. 
How much longer 
before I can get in 
the pub and relax 
my facial muscles?
In observational 
studies this is known 
as inter-observer 
reliability – observers 
have to agree on what 
they see and carry out 
the same procedure 
Consistency between 
different researchers 
working on the some 
study is very important 
for reliability
1. Increase reliability by standardising instructions 
2. Carry out a pilot study to improve procedures and 
materials 
3. You will be thoroughly trained in the use 
of materials and procedures prior to our 
study taking place
This measures the extent to which a test or procedure is 
consistent within itself, i.e., questionnaire items or questions 
in an interview should all be measuring the same thing 
Do you like to keep to deadlines? 
Do you get impatient driving? 
Do you like cheese? 
Do you like doing several tasks at once? 
Do you like chocolate? 
Do you get easily irritated? 
Are you competitive? 
This interviewer seems 
a little confused about 
Type A personality traits
Odds/Evens Top/Bottom 
Compares a participant’s performance on two halves of a test or questionnaire – 
there should be a close correlation between scores on both halves of the test. 
Questions in both halves should be of equal quality for good internal reliability.
Would you see this as bullying or 
horseplay in the playground? 
You would see 
this from your 
own subjective 
viewpoint – 
we’re biased by 
experience and 
expectation 
Observers must 
agree about what 
they are observing – 
they need to use 
standardised 
behavioural 
categories
Measuring Reliability 
Match the method of estimating reliability 
to the description 
Test-Retest 
reliability 
If the measure depends 
upon interpretation of 
behaviour, we can 
compare the results 
from two or more 
raters. 
If the results in the two 
halves are similar, we can 
assume the test is reliable 
Split Half 
Reliability 
Splitting a test into two 
halves, and comparing 
the scores in both 
halves 
If the results on the two 
tests are similar, we can 
assume the test is reliable 
Inter-Rater 
reliability 
The measure is 
administered to the 
same group of people 
twice 
If there is high agreement 
between the raters, the 
measure is reliable
The tool is measuring what it is 
intending to measure 
= 
= 
The findings can be generalized 
beyond the context of the 
research situation
Does our 
measuring 
tool appear 
to be doing 
what it 
should? 
Face 
validity: 
One or more 
judges assess 
whether the 
test seems 
appropriate 
and suggest 
changes if 
necessary
Does the content of a 
test cover everything in 
the area of interest? 
Content validity: 
More rigorous – 
experts in the field 
systematically examine 
the tool’s components 
and compare them with 
set standards 
They have to agree the 
content is appropriate
Improving internal validity 
• Single blind procedure - reduces demand 
characteristics 
• Double blind procedure ….
Population Validity 
Can we generalise 
findings from our 
research participants 
to other population 
groups?
Can we apply our findings to 
other contexts and situations 
outside of the research setting? 
Ecological Validity
Improving external validity 
• Sample must be representative of target 
population and be unbiased….. 
• Research situation must reflect real life 
situation e.g. debate over Milgram….Strange 
Situation

More Related Content

PPTX
Validity and Reliability
PPT
Presentation Validity & Reliability
PPTX
Reliability (assessment of student learning I)
PPTX
Presentation validity
PPT
Test Reliability and Validity
PPT
Validity and reliability in assessment.
PPTX
Evaluation Performance Measurement and Assessment 1
PPTX
validity and reliability
Validity and Reliability
Presentation Validity & Reliability
Reliability (assessment of student learning I)
Presentation validity
Test Reliability and Validity
Validity and reliability in assessment.
Evaluation Performance Measurement and Assessment 1
validity and reliability

What's hot (20)

PPT
Testing and Test Construction
PPTX
Reliability and validity ppt
PPT
Reliability and validity
PPT
Validity, its types, measurement & factors.
PPTX
reliability presentation.pptx
PPT
Reliability
PPT
Characteristics of a good test
PPTX
Formative & summative evaluation
PPTX
Experimental research design
PPTX
Reliability and its types: Split half method and test retest methods
PDF
What is Reliability and its Types?
PPTX
PPT
Qualities of a Good Test
PPTX
Types of Variables - Independent, Dependent ,Extraneous ,Intervening ,Moderator
PPT
Validity and Reliability
PDF
Test validity
PPTX
Concept, construct and variable by sajjad ahmad-upm
PPT
Correlational Research
PPTX
Experimental design
PPTX
Testing and Test Construction
Reliability and validity ppt
Reliability and validity
Validity, its types, measurement & factors.
reliability presentation.pptx
Reliability
Characteristics of a good test
Formative & summative evaluation
Experimental research design
Reliability and its types: Split half method and test retest methods
What is Reliability and its Types?
Qualities of a Good Test
Types of Variables - Independent, Dependent ,Extraneous ,Intervening ,Moderator
Validity and Reliability
Test validity
Concept, construct and variable by sajjad ahmad-upm
Correlational Research
Experimental design
Ad

Viewers also liked (19)

PPTX
Validity & reliability seminar
PPTX
Validity, reliability and feasibility
PPT
Measurement in Marketing Research
PPTX
Validity & reliability an interesting powerpoint slide i created
PPT
Reliability and validity
PPT
Print media research
PPTX
Print Media Research References
PPT
RESEARCH WRITING - Apa References Style
PDF
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...
PPTX
Types & Uses of PR Research
PPTX
Chapter 5 : Conclusion & Suggestion
PPTX
Validity, reliability & practicality
PPT
Reliability and validity
PPT
Writing a research report
PPTX
Hypothesis and its types
PPTX
Common Errors in Writing
PPTX
Errors in research
PPT
Advertising research
PPT
Report Writing - Conclusions & Recommendations sections
Validity & reliability seminar
Validity, reliability and feasibility
Measurement in Marketing Research
Validity & reliability an interesting powerpoint slide i created
Reliability and validity
Print media research
Print Media Research References
RESEARCH WRITING - Apa References Style
Measure of dispersion part I (Range, Quartile Deviation, Interquartile devi...
Types & Uses of PR Research
Chapter 5 : Conclusion & Suggestion
Validity, reliability & practicality
Reliability and validity
Writing a research report
Hypothesis and its types
Common Errors in Writing
Errors in research
Advertising research
Report Writing - Conclusions & Recommendations sections
Ad

Similar to A" Research Methods Reliability and validity (20)

PPTX
Forms of Reliability and Validity in statistics.pptx
PPTX
Reliability and Validity types and example.pptx
PPT
reliability-and-validity-in-psychological-research.ppt
PPTX
Validity, Reliability ,Objective & Their Types
PPT
Reliability & validity
PPTX
Data collection reliability
PPTX
VALIDITY AND RELIABILITY OF THE TOPIC NURSING RESEARCH.pptx
PPTX
Validity, Reliability & Cranach's alpha.pptx
PPTX
Presentation validity and reliability of instruments.pptx
PPTX
Validity and Reliability of an Instrument Brief Introduction.pptx
PPTX
LESSON 1 RELIABILITY AND VALIDITY for Statech12
PPT
Test characteristics
PPT
23APR_NR_Data collection Methods_Part 3.ppt
PPT
23APR_NR_Data collection Methods_Part 3.ppt
PPTX
JC-16-23June2021-rel-val.pptx
PPTX
Reliability & Validity
PPTX
RELIABILITY AND VALIDITY
PPT
D8 and d9 personality test development 10 2007-posting
PDF
MBA-12-02
PPTX
EM&E.pptx
Forms of Reliability and Validity in statistics.pptx
Reliability and Validity types and example.pptx
reliability-and-validity-in-psychological-research.ppt
Validity, Reliability ,Objective & Their Types
Reliability & validity
Data collection reliability
VALIDITY AND RELIABILITY OF THE TOPIC NURSING RESEARCH.pptx
Validity, Reliability & Cranach's alpha.pptx
Presentation validity and reliability of instruments.pptx
Validity and Reliability of an Instrument Brief Introduction.pptx
LESSON 1 RELIABILITY AND VALIDITY for Statech12
Test characteristics
23APR_NR_Data collection Methods_Part 3.ppt
23APR_NR_Data collection Methods_Part 3.ppt
JC-16-23June2021-rel-val.pptx
Reliability & Validity
RELIABILITY AND VALIDITY
D8 and d9 personality test development 10 2007-posting
MBA-12-02
EM&E.pptx

More from Jill Jan (20)

PPT
Social exchange theory
PPTX
Factors influencing eating
PPT
Biological approach 2015
PPT
Behaviourist oprant conditioning
PPT
Approaches Behaviourism- classical
PPTX
Gender - hormones and genes
PPTX
AS Social Psychology -Introducing conformity
PPT
AS Abnormal Psychology Cognitive model
PPT
AS Behaviourist treatments systematic desensitisation pp
PPTX
A2 Cross cultural research into gender roles
PPTX
Behaviourist model AS Psychology
PPT
AS Psychodynamic treatments
PPTX
Psychological explanations of gender development
PPT
A2 Psych Gender dysphoria
PPTX
AS Psychodynamic approach abnormality
PPT
AS Biological treatments for abnormality
PPT
A2 Gender biosocial approach
PPTX
Defining abnormality
PPTX
Psychology AS Induction
PPT
Gender -gender hormones and genes
Social exchange theory
Factors influencing eating
Biological approach 2015
Behaviourist oprant conditioning
Approaches Behaviourism- classical
Gender - hormones and genes
AS Social Psychology -Introducing conformity
AS Abnormal Psychology Cognitive model
AS Behaviourist treatments systematic desensitisation pp
A2 Cross cultural research into gender roles
Behaviourist model AS Psychology
AS Psychodynamic treatments
Psychological explanations of gender development
A2 Psych Gender dysphoria
AS Psychodynamic approach abnormality
AS Biological treatments for abnormality
A2 Gender biosocial approach
Defining abnormality
Psychology AS Induction
Gender -gender hormones and genes

Recently uploaded (20)

PDF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
PPTX
master seminar digital applications in india
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Institutional Correction lecture only . . .
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
PDF
Classroom Observation Tools for Teachers
PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Complications of Minimal Access Surgery at WLH
PDF
Insiders guide to clinical Medicine.pdf
PDF
Computing-Curriculum for Schools in Ghana
PPTX
Cell Structure & Organelles in detailed.
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PPTX
GDM (1) (1).pptx small presentation for students
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
PPTX
Pharmacology of Heart Failure /Pharmacotherapy of CHF
ANTIBIOTICS.pptx.pdf………………… xxxxxxxxxxxxx
master seminar digital applications in india
Final Presentation General Medicine 03-08-2024.pptx
Institutional Correction lecture only . . .
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
human mycosis Human fungal infections are called human mycosis..pptx
Black Hat USA 2025 - Micro ICS Summit - ICS/OT Threat Landscape
Classroom Observation Tools for Teachers
Abdominal Access Techniques with Prof. Dr. R K Mishra
O7-L3 Supply Chain Operations - ICLT Program
STATICS OF THE RIGID BODIES Hibbelers.pdf
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Complications of Minimal Access Surgery at WLH
Insiders guide to clinical Medicine.pdf
Computing-Curriculum for Schools in Ghana
Cell Structure & Organelles in detailed.
2.FourierTransform-ShortQuestionswithAnswers.pdf
GDM (1) (1).pptx small presentation for students
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
Pharmacology of Heart Failure /Pharmacotherapy of CHF

A" Research Methods Reliability and validity

  • 1. Validity! We need to find out if our research is sound. Do our tests measure what they claim to measure?
  • 2. Are techniques used to collect data in tests, questionnaires, interviews and observations measuring what is claimed? For example was the Strange Situation really measuring attachment style?
  • 3. We need to be able to measure or observe something time after time and produce the same or similar results
  • 4. I want to measure intelligence. If the same person sits the test on several occasions and the results change each time, then that test lacks reliability
  • 5. The test also arguably lacks validity because the scores are meaningless
  • 6. If I test my participants again several months later and their scores remains consistent, I can say the test is reliable, but it might still lack validity.
  • 7. Is an A level in Psychology a valid and reliable assessment of your performance in Psychology.
  • 8. This measures consistency from one occasion to another – the same result should be found on different days, in different labs , observations or interviews, by different researchers I exposed these teenage brain cells to 1000 PowerPoint slides last Monday and they’re all dead I thought that was a fluke but they seem to be shrivelling after only five minutes!
  • 9. Participants take the same test on different occasions – a high correlation between test scores indicates the test has good external reliability . Timing is crucial. Why? January June I hope that’s the right answer this time
  • 10. This refers to the consistency of a researcher’s behaviour. A researcher should produce similar test results, or make similar observations or carry out interviews in the same way on more than one occasion. Thanks for taking part today. Any problems and I’ll be right over. Take your time. Right. Let’s get on. Fast as you can. How much longer before I can get in the pub and relax my facial muscles?
  • 11. In observational studies this is known as inter-observer reliability – observers have to agree on what they see and carry out the same procedure Consistency between different researchers working on the some study is very important for reliability
  • 12. 1. Increase reliability by standardising instructions 2. Carry out a pilot study to improve procedures and materials 3. You will be thoroughly trained in the use of materials and procedures prior to our study taking place
  • 13. This measures the extent to which a test or procedure is consistent within itself, i.e., questionnaire items or questions in an interview should all be measuring the same thing Do you like to keep to deadlines? Do you get impatient driving? Do you like cheese? Do you like doing several tasks at once? Do you like chocolate? Do you get easily irritated? Are you competitive? This interviewer seems a little confused about Type A personality traits
  • 14. Odds/Evens Top/Bottom Compares a participant’s performance on two halves of a test or questionnaire – there should be a close correlation between scores on both halves of the test. Questions in both halves should be of equal quality for good internal reliability.
  • 15. Would you see this as bullying or horseplay in the playground? You would see this from your own subjective viewpoint – we’re biased by experience and expectation Observers must agree about what they are observing – they need to use standardised behavioural categories
  • 16. Measuring Reliability Match the method of estimating reliability to the description Test-Retest reliability If the measure depends upon interpretation of behaviour, we can compare the results from two or more raters. If the results in the two halves are similar, we can assume the test is reliable Split Half Reliability Splitting a test into two halves, and comparing the scores in both halves If the results on the two tests are similar, we can assume the test is reliable Inter-Rater reliability The measure is administered to the same group of people twice If there is high agreement between the raters, the measure is reliable
  • 17. The tool is measuring what it is intending to measure = = The findings can be generalized beyond the context of the research situation
  • 18. Does our measuring tool appear to be doing what it should? Face validity: One or more judges assess whether the test seems appropriate and suggest changes if necessary
  • 19. Does the content of a test cover everything in the area of interest? Content validity: More rigorous – experts in the field systematically examine the tool’s components and compare them with set standards They have to agree the content is appropriate
  • 20. Improving internal validity • Single blind procedure - reduces demand characteristics • Double blind procedure ….
  • 21. Population Validity Can we generalise findings from our research participants to other population groups?
  • 22. Can we apply our findings to other contexts and situations outside of the research setting? Ecological Validity
  • 23. Improving external validity • Sample must be representative of target population and be unbiased….. • Research situation must reflect real life situation e.g. debate over Milgram….Strange Situation