SlideShare a Scribd company logo
B Y
M.SANTHOSH KUMAR
III EEE
MAIN
CONTENTS
 INTRODUCTION
 19TH CENTURY FOUNDATION
 VICTORIAN STREAM
 GERMAN STREAM
 20TH CENTURY
 DEFINITION OF MEASUREMENT IN SOCIAL HISTORY
 KEY CONCEPTS
INTRODUCTION
 Psychometrics is the field that is concerned with theory
and technique of psychological measurement.
 It includes the measurement of knowledge, abilities,
attitudes, personality traits and educational measurement.
 It has two major research tasks.
1. The construction of instruments and procedure for
measurement.
2. The development refinement of theoretical approaches
to measurement.
 The persons who practice psychometrics are known to be
called as psychometrican’s.
 All psychometrican’s possess a specific qualification
 This field is primarily concerned with construction and
validation of measurement instruments.
 The mentioned point involves questionnaries, tests and
and prsonality assessments
19th century foundation
 Psychological has come from two steams of thought.
 The first one is from Darwin, Galton and Cattle on the
measurement of individual differences.
 The second one is from Herbart, Weber, Fechner and
Wundt and their psychophysical measurements of a similar
construct.
 The second set of individual’s and their research led to the
development of psychology, and standardized testing.
 From the above mentioned points we can conclude that
this is the beginning stage at which psychology and their
testing has developed.
1. victorian stream
 Charles Darwin was the inspiration behind Sir Francis
Galton who led to the creation of psychometrics.
 In 1859,Charles Darwin published his book “The origin of
species”, which pertained to individual differences in
animals.
 This book led to know about the species which is less
adaptive and the species which is more adaptive.
 The species which is more adaptive are the once which
gives way to the next generation.
 This idea of studying animals led to Galton’s interest about
study about human beings and how the differ from one
another, and importantly, how to measure those
differences.
 Galton wrote a book entitled “Hereditary Genius” about
different characteristics that people possess and how those
characteristics make them more “fit” than others.
 Today these differences, such as sensory and motor
functioning(reaction time, visual acuity and physical
strength) are important domains of scientific psychology.
 Much of the theoretical and applied work in psychometrics
was under taken in an attempt to measure intelligence
 Francis Galton is called as “the father of psychometrics”
2. GERMAN STREAM
 The origin of psychometrics also has connections to the
related field of psychophysics.
 Around the same time that Darwin, Galton, and Cattell
were making their discoveries, J.E. Herbart was also
interested in "unlocking the mysteries of human
consciousness" through the scientific method.
 (Kaplan & Saccuzzo, 2010) Herbart was responsible for
creating mathematical models of the mind, which were
influential in educational practices in years to come.
 Following Herbart, E.H. Weber built upon Herbart's work
and tried to prove the existence of a psychological
threshold saying that a minimum stimulus was necessary
to activate a sensory system.
 After Weber, G.T. Fechner expanded upon the knowledge
he gleaned from Herbart and Weber, to devise the law that
the strength of a sensation grows as the logarithm of the
stimulus intensity.
 A follower of Weber and Fechner, Wilhelm Wundt is
credited with founding the science of psychology.
 It is Wundt's influence that paved the way for others
to develop psychological testing.
20th century
 The psychometrician L. L. Thurstone, founder and first
president of the Psychometric Society in 1936, developed
and applied a theoretical approach to measurement
referred to as the law of comparative judgment, an
approach that has close connections to the
psychophysical theory of Ernst Heinrich Weber
and Gustav Fechner.
 In addition, Spearman and Thurstone both made
important contributions to the theory and application
of factor analysis, a statistical method developed and
used extensively in psychometrics.
 More recently, psychometric theory has been applied in
the measurement of personality, attitudes, and beliefs,
and academic achievement.
 Measurement of these unobservable phenomena is
difficult, and much of the research and accumulated
science in this discipline has been developed in an
attempt to properly define and quantify such
phenomena.
 Critics, including practitioners in the physical
sciences and social activists, have argued that such
definition and quantification is impossibly difficult,
and that such measurements are often misused, such as
with psychometric personality tests used in
employment procedures.
 In the late 1950s, Leopold Szondi made an
historical and epistemological assessment of the
impact of statistical thinking onto psychology
during previous few decades: "in the last decades,
the specifically psychological thinking has been
almost completely suppressed and removed, and
replaced by a statistical thinking.
 Precisely here we see the cancer of testology and
testomania of today.”
Definition of measurement in
the social sciences
 The definition of measurement in the social sciences has
a long history.
 A currently widespread definition, proposed by Stanley
Smith Stevens (1946), is that measurement is "the
assignment of numerals to objects or events according to
some rule.
 " This definition was introduced in the paper in which
Stevens proposed four levels of measurement.
 Although widely adopted, this definition differs in
important respects from the more classical definition of
measurement adopted in the physical sciences, which is
that measurement is the
 estimation and expression of the magnitude of one
quantity relative to another (Michell, 1997).
 Indeed, Stevens's definition of measurement was
put forward in response to the British Ferguson
Committee, whose chair, A. Ferguson, was a
physicist.
 The committee was appointed in 1932 by the
British Association for the Advancement of
Science to investigate the possibility of
quantitatively estimating sensory events.
 Although its chair and other members were
physicists, the committee also included several
psychologists.
 The committee's report highlighted the
importance of the definition of measurement.
 While Stevens's response was to propose a new
definition, which has had considerable influence in
the field, this was by no means the only response to
the report.
 Another, notably different, response was to accept
the classical definition, as reflected in the following
statement:
 Measurement in psychology and physics are in no
sense different.
 Physicists can measure when they can find the
operations by which they may meet the necessary
criteria; psychologists have but to do the same.
 They need not worry about the mysterious
differences between the meaning of measurement
in the two sciences.
 These divergent responses are reflected in
alternative approaches to measurement.
 For example, methods based on covariance
matrices are typically employed on the premise
that numbers, such as raw scores derived from
assessments, are measurements.
 Such approaches implicitly entail Stevens's
definition of measurement, which requires only
that numbers are assignedaccording to some
rule.
 The main research task, then, is generally
considered to be the discovery of associations
between scores, and of factors posited to
underlie such associations.
 On the other hand, when measurement models
such as the Rasch model are employed, numbers
are not assigned based on a rule.
 Instead, in keeping with Reese's statement above,
specific criteria for measurement are stated, and
the goal is to construct procedures or operations
that provide data that meet the relevant criteria.
 Measurements are estimated based on the
models, and tests are conducted to ascertain
whether the relevant criteria have been met.
Instruments and
procedures
 The first psychometric instruments were designed to
measure the concept of intelligence.
 The best known historical approach involved
the Stanford-Binet IQ test, developed originally by the
French psychologist Alfred Binet.
 Intelligence tests are useful tools for various purposes.
 An alternative conception of intelligence is that
cognitive capacities within individuals are a
manifestation of a general component, or general
intelligence factor, as well as cognitive capacity specific
to a given domain.
 Psychometrics is applied widely in educational assessment
to measure abilities in domains such as reading, writing,
and mathematics.
 The main approaches in applying tests in these domains
have been Classical Test Theory and the more recent Item
Response Theory and Rasch measurement models.
 These latter approaches permit joint scaling of persons
and assessment items, which provides a basis for mapping
of developmental continua by allowing descriptions of the
skills displayed at various points along a continuum.
 Such approaches provide powerful information regarding
the nature of developmental growth within various
domains.
 Another major focus in psychometrics has been
on personality testing.
 There have been a range of theoretical approaches to
conceptualizing and measuring personality.
 Some of the better known instruments include
the Minnesota Multiphasic Personality Inventory,
the Five-Factor Model(or "Big 5") and tools such
as Personality and Preference Inventory and the Myers-
Briggs Type Indicator.
 Attitudes have also been studied extensively using
psychometric approaches.
 A common method in the measurement of attitudes is
the use of the Likert scale.
 An alternative method involves the application of
unfolding measurement models, the most general being
the Hyperbolic Cosine Model (Andrich & Luo, 1993).
Key concepts
 Key concepts in classical test theory
are reliability and validity.
 A reliable measure is one that measures a construct
consistently across time, individuals, and situations.
 A valid measure is one that measures what it is intended
to measure.
 Reliability is necessary, but not sufficient, for validity.
 Both reliability and validity can be assessed statistically.
 Consistency over repeated measures of the same test can
be assessed with the Pearson correlation coefficient, and
is often called test-retest reliability.
 Similarly, the equivalence of different versions of the
same measure can be indexed by a Pearson correlation,
and is called equivalent forms reliability or a similar term.
 Internal consistency, which addresses the homogeneity
of a single test form, may be assessed by correlating
performance on two halves of a test, which is
termed split-half reliability; the value of this Pearson
product-moment correlation coefficient for two half-
tests is adjusted with the Spearman–Brown prediction
formula to correspond to the correlation between two
full-length tests.
 Perhaps the most commonly used index of reliability
is Cronbach's α, which is equivalent to the mean of all
possible split-half coefficients.
 Other approaches include the intra-class correlation,
which is the ratio of variance of measurements of a given
target to the variance of all targets.
 There are a number of different forms of validity.
 Criterion-related validity can be assessed by correlating a
measure with a criterion measure known to be valid.
 When the criterion measure is collected at the same
time as the measure being validated the goal is to
establish concurrent validity ; when the criterion is
collected later the goal is to establish predictive validity.
 A measure has construct validity if it is related to
measures of other constructs as required by
theory. Content validity is a demonstration that the items
of a test are drawn from the domain being measured.
 In a personnel selection example, test content is based
on a defined statement or set of statements of
knowledge, skill, ability, or other characteristics obtained
from a job analysis.
 Item response theory models the relationship
between latent traits and responses to test items.
 Among other advantages, IRT provides a basis for
obtaining an estimate of the location of a test-taker on a
given latent trait as well as the standard error of
measurement of that location.
 For example, a university student's knowledge of history
can be deduced from his or her score on a university test
and then be compared reliably with a high school
student‘s
knowledge deduced from a less difficult test.
 Scores derived by classical test theory do not have this
characteristic, and assessment of actual ability (rather
than ability relative to other test-takers) must be
assessed by comparing scores to those of a "norm group"
randomly selected from the population.
 In fact, all measures derived from classical test theory are
dependent on the sample tested, while, in principle,
those derived from item response theory are not.
Standards of quality
 The considerations of validity and reliability typically are
viewed as essential elements for determining
the quality of any test.
 However, professional and practitioner associations
frequently have placed these concerns within broader
contexts when developing standards and making overall
judgments about the quality of any test as a whole within
a given context.
 A consideration of concern in many applied research
settings is whether or not the metric of a given
psychological inventory is meaningful or arbitrary.
e
n
d

More Related Content

PPTX
Psychophysics - Siddhartha
PDF
Psychological Assessment Tools
PPTX
Conducting Research in Clinical Psychology
PPTX
Psychological Tests
PPT
PPTX
Alternatives to Experimentation: Surveys and Interviews
PPTX
Introduction to Item Response Theory
Psychophysics - Siddhartha
Psychological Assessment Tools
Conducting Research in Clinical Psychology
Psychological Tests
Alternatives to Experimentation: Surveys and Interviews
Introduction to Item Response Theory

What's hot (20)

PPTX
Health psychology;Definition, areas,Aims, Need & Significance|Aboutpsy.com
PPTX
Psychological Factors influence on health
DOC
Psychodynamic Model
PDF
Psychological Assessment
PPT
Reality Therapy
PPT
Children's apperception test
PPTX
Psychodiagnostic technique[1]
PPTX
Chapter 13 Introduction to Applied Social Psychology
PDF
Introduction to Positive Psychology
PPTX
Humanistic therapies
PDF
Unit 1, Clinical Psychology
PPTX
Models of mental health & illness
PPTX
Community Psychology: Introduction
PPTX
cognitive behavioral therapy (CBT)
PPTX
Nature and use of Psychological Tests
PPTX
Transference and counter- transference
PPTX
Clinical Interview
PPTX
Psychological test
PPTX
Counselling skills: Micro skills and stages of counselling
Health psychology;Definition, areas,Aims, Need & Significance|Aboutpsy.com
Psychological Factors influence on health
Psychodynamic Model
Psychological Assessment
Reality Therapy
Children's apperception test
Psychodiagnostic technique[1]
Chapter 13 Introduction to Applied Social Psychology
Introduction to Positive Psychology
Humanistic therapies
Unit 1, Clinical Psychology
Models of mental health & illness
Community Psychology: Introduction
cognitive behavioral therapy (CBT)
Nature and use of Psychological Tests
Transference and counter- transference
Clinical Interview
Psychological test
Counselling skills: Micro skills and stages of counselling
Ad

Similar to Psychometrics ppt (20)

PPTX
Psychological assessment introduction
PPTX
1 konsep pengukuran
PPTX
Psychological testing
PPTX
Introduction principles of psychological measurement
PPTX
Educational psychology- Test and measurement
PDF
Psychometric testing
PDF
Psychometric testing
PDF
Psychometric testing
PPTX
Psychometric assessment Premnath 28 Feb 2013
PDF
Week 3.1 Historical and Cultural Perspective.pdf
PDF
II (second) History of Psych Testing.pdf
PPTX
Chapter 1 history of testing
PPTX
1. history of measurement and evaluation
PPTX
Psychological tests
PPTX
Assessment-in-Learning-1.pptx
PPTX
Chapter 2, Topic 1, Reporter 1.pptx
PDF
History of Testing and Measurement and Evaluation.pdf
PPT
instrument development and psychometrics
PPTX
psychometrics
DOCX
Intelligence Foundations and Issues in AssessmentLinda Go.docx
Psychological assessment introduction
1 konsep pengukuran
Psychological testing
Introduction principles of psychological measurement
Educational psychology- Test and measurement
Psychometric testing
Psychometric testing
Psychometric testing
Psychometric assessment Premnath 28 Feb 2013
Week 3.1 Historical and Cultural Perspective.pdf
II (second) History of Psych Testing.pdf
Chapter 1 history of testing
1. history of measurement and evaluation
Psychological tests
Assessment-in-Learning-1.pptx
Chapter 2, Topic 1, Reporter 1.pptx
History of Testing and Measurement and Evaluation.pdf
instrument development and psychometrics
psychometrics
Intelligence Foundations and Issues in AssessmentLinda Go.docx
Ad

Recently uploaded (20)

PDF
Abdominal Access Techniques with Prof. Dr. R K Mishra
PDF
Trump Administration's workforce development strategy
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PPTX
Orientation - ARALprogram of Deped to the Parents.pptx
PDF
Complications of Minimal Access Surgery at WLH
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PDF
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
PPTX
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
PPTX
Cell Structure & Organelles in detailed.
PDF
Microbial disease of the cardiovascular and lymphatic systems
PDF
Supply Chain Operations Speaking Notes -ICLT Program
PPTX
Lesson notes of climatology university.
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PDF
01-Introduction-to-Information-Management.pdf
PPTX
Cell Types and Its function , kingdom of life
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PPTX
master seminar digital applications in india
PDF
Chinmaya Tiranga quiz Grand Finale.pdf
Abdominal Access Techniques with Prof. Dr. R K Mishra
Trump Administration's workforce development strategy
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
Orientation - ARALprogram of Deped to the Parents.pptx
Complications of Minimal Access Surgery at WLH
STATICS OF THE RIGID BODIES Hibbelers.pdf
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Chapter 2 Heredity, Prenatal Development, and Birth.pdf
Tissue processing ( HISTOPATHOLOGICAL TECHNIQUE
Cell Structure & Organelles in detailed.
Microbial disease of the cardiovascular and lymphatic systems
Supply Chain Operations Speaking Notes -ICLT Program
Lesson notes of climatology university.
Final Presentation General Medicine 03-08-2024.pptx
01-Introduction-to-Information-Management.pdf
Cell Types and Its function , kingdom of life
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
master seminar digital applications in india
Chinmaya Tiranga quiz Grand Finale.pdf

Psychometrics ppt

  • 2. MAIN CONTENTS  INTRODUCTION  19TH CENTURY FOUNDATION  VICTORIAN STREAM  GERMAN STREAM  20TH CENTURY  DEFINITION OF MEASUREMENT IN SOCIAL HISTORY  KEY CONCEPTS
  • 3. INTRODUCTION  Psychometrics is the field that is concerned with theory and technique of psychological measurement.  It includes the measurement of knowledge, abilities, attitudes, personality traits and educational measurement.  It has two major research tasks. 1. The construction of instruments and procedure for measurement. 2. The development refinement of theoretical approaches to measurement.  The persons who practice psychometrics are known to be called as psychometrican’s.  All psychometrican’s possess a specific qualification
  • 4.  This field is primarily concerned with construction and validation of measurement instruments.  The mentioned point involves questionnaries, tests and and prsonality assessments
  • 5. 19th century foundation  Psychological has come from two steams of thought.  The first one is from Darwin, Galton and Cattle on the measurement of individual differences.  The second one is from Herbart, Weber, Fechner and Wundt and their psychophysical measurements of a similar construct.  The second set of individual’s and their research led to the development of psychology, and standardized testing.  From the above mentioned points we can conclude that this is the beginning stage at which psychology and their testing has developed.
  • 6. 1. victorian stream  Charles Darwin was the inspiration behind Sir Francis Galton who led to the creation of psychometrics.  In 1859,Charles Darwin published his book “The origin of species”, which pertained to individual differences in animals.  This book led to know about the species which is less adaptive and the species which is more adaptive.  The species which is more adaptive are the once which gives way to the next generation.  This idea of studying animals led to Galton’s interest about study about human beings and how the differ from one another, and importantly, how to measure those differences.
  • 7.  Galton wrote a book entitled “Hereditary Genius” about different characteristics that people possess and how those characteristics make them more “fit” than others.  Today these differences, such as sensory and motor functioning(reaction time, visual acuity and physical strength) are important domains of scientific psychology.  Much of the theoretical and applied work in psychometrics was under taken in an attempt to measure intelligence  Francis Galton is called as “the father of psychometrics”
  • 8. 2. GERMAN STREAM  The origin of psychometrics also has connections to the related field of psychophysics.  Around the same time that Darwin, Galton, and Cattell were making their discoveries, J.E. Herbart was also interested in "unlocking the mysteries of human consciousness" through the scientific method.
  • 9.  (Kaplan & Saccuzzo, 2010) Herbart was responsible for creating mathematical models of the mind, which were influential in educational practices in years to come.  Following Herbart, E.H. Weber built upon Herbart's work and tried to prove the existence of a psychological threshold saying that a minimum stimulus was necessary to activate a sensory system.  After Weber, G.T. Fechner expanded upon the knowledge he gleaned from Herbart and Weber, to devise the law that the strength of a sensation grows as the logarithm of the stimulus intensity.
  • 10.  A follower of Weber and Fechner, Wilhelm Wundt is credited with founding the science of psychology.  It is Wundt's influence that paved the way for others to develop psychological testing.
  • 11. 20th century  The psychometrician L. L. Thurstone, founder and first president of the Psychometric Society in 1936, developed and applied a theoretical approach to measurement referred to as the law of comparative judgment, an approach that has close connections to the psychophysical theory of Ernst Heinrich Weber and Gustav Fechner.  In addition, Spearman and Thurstone both made important contributions to the theory and application of factor analysis, a statistical method developed and used extensively in psychometrics.
  • 12.  More recently, psychometric theory has been applied in the measurement of personality, attitudes, and beliefs, and academic achievement.  Measurement of these unobservable phenomena is difficult, and much of the research and accumulated science in this discipline has been developed in an attempt to properly define and quantify such phenomena.  Critics, including practitioners in the physical sciences and social activists, have argued that such definition and quantification is impossibly difficult, and that such measurements are often misused, such as with psychometric personality tests used in employment procedures.
  • 13.  In the late 1950s, Leopold Szondi made an historical and epistemological assessment of the impact of statistical thinking onto psychology during previous few decades: "in the last decades, the specifically psychological thinking has been almost completely suppressed and removed, and replaced by a statistical thinking.  Precisely here we see the cancer of testology and testomania of today.”
  • 14. Definition of measurement in the social sciences  The definition of measurement in the social sciences has a long history.  A currently widespread definition, proposed by Stanley Smith Stevens (1946), is that measurement is "the assignment of numerals to objects or events according to some rule.  " This definition was introduced in the paper in which Stevens proposed four levels of measurement.  Although widely adopted, this definition differs in important respects from the more classical definition of measurement adopted in the physical sciences, which is that measurement is the
  • 15.  estimation and expression of the magnitude of one quantity relative to another (Michell, 1997).  Indeed, Stevens's definition of measurement was put forward in response to the British Ferguson Committee, whose chair, A. Ferguson, was a physicist.  The committee was appointed in 1932 by the British Association for the Advancement of Science to investigate the possibility of quantitatively estimating sensory events.  Although its chair and other members were physicists, the committee also included several psychologists.  The committee's report highlighted the importance of the definition of measurement.
  • 16.  While Stevens's response was to propose a new definition, which has had considerable influence in the field, this was by no means the only response to the report.  Another, notably different, response was to accept the classical definition, as reflected in the following statement:  Measurement in psychology and physics are in no sense different.  Physicists can measure when they can find the operations by which they may meet the necessary criteria; psychologists have but to do the same.  They need not worry about the mysterious differences between the meaning of measurement in the two sciences.
  • 17.  These divergent responses are reflected in alternative approaches to measurement.  For example, methods based on covariance matrices are typically employed on the premise that numbers, such as raw scores derived from assessments, are measurements.  Such approaches implicitly entail Stevens's definition of measurement, which requires only that numbers are assignedaccording to some rule.  The main research task, then, is generally considered to be the discovery of associations between scores, and of factors posited to underlie such associations.
  • 18.  On the other hand, when measurement models such as the Rasch model are employed, numbers are not assigned based on a rule.  Instead, in keeping with Reese's statement above, specific criteria for measurement are stated, and the goal is to construct procedures or operations that provide data that meet the relevant criteria.  Measurements are estimated based on the models, and tests are conducted to ascertain whether the relevant criteria have been met.
  • 19. Instruments and procedures  The first psychometric instruments were designed to measure the concept of intelligence.  The best known historical approach involved the Stanford-Binet IQ test, developed originally by the French psychologist Alfred Binet.  Intelligence tests are useful tools for various purposes.  An alternative conception of intelligence is that cognitive capacities within individuals are a manifestation of a general component, or general intelligence factor, as well as cognitive capacity specific to a given domain.
  • 20.  Psychometrics is applied widely in educational assessment to measure abilities in domains such as reading, writing, and mathematics.  The main approaches in applying tests in these domains have been Classical Test Theory and the more recent Item Response Theory and Rasch measurement models.  These latter approaches permit joint scaling of persons and assessment items, which provides a basis for mapping of developmental continua by allowing descriptions of the skills displayed at various points along a continuum.  Such approaches provide powerful information regarding the nature of developmental growth within various domains.
  • 21.  Another major focus in psychometrics has been on personality testing.  There have been a range of theoretical approaches to conceptualizing and measuring personality.  Some of the better known instruments include the Minnesota Multiphasic Personality Inventory, the Five-Factor Model(or "Big 5") and tools such as Personality and Preference Inventory and the Myers- Briggs Type Indicator.  Attitudes have also been studied extensively using psychometric approaches.  A common method in the measurement of attitudes is the use of the Likert scale.
  • 22.  An alternative method involves the application of unfolding measurement models, the most general being the Hyperbolic Cosine Model (Andrich & Luo, 1993).
  • 23. Key concepts  Key concepts in classical test theory are reliability and validity.  A reliable measure is one that measures a construct consistently across time, individuals, and situations.  A valid measure is one that measures what it is intended to measure.  Reliability is necessary, but not sufficient, for validity.  Both reliability and validity can be assessed statistically.  Consistency over repeated measures of the same test can be assessed with the Pearson correlation coefficient, and is often called test-retest reliability.
  • 24.  Similarly, the equivalence of different versions of the same measure can be indexed by a Pearson correlation, and is called equivalent forms reliability or a similar term.
  • 25.  Internal consistency, which addresses the homogeneity of a single test form, may be assessed by correlating performance on two halves of a test, which is termed split-half reliability; the value of this Pearson product-moment correlation coefficient for two half- tests is adjusted with the Spearman–Brown prediction formula to correspond to the correlation between two full-length tests.  Perhaps the most commonly used index of reliability is Cronbach's α, which is equivalent to the mean of all possible split-half coefficients.  Other approaches include the intra-class correlation, which is the ratio of variance of measurements of a given target to the variance of all targets.
  • 26.  There are a number of different forms of validity.  Criterion-related validity can be assessed by correlating a measure with a criterion measure known to be valid.  When the criterion measure is collected at the same time as the measure being validated the goal is to establish concurrent validity ; when the criterion is collected later the goal is to establish predictive validity.  A measure has construct validity if it is related to measures of other constructs as required by theory. Content validity is a demonstration that the items of a test are drawn from the domain being measured.
  • 27.  In a personnel selection example, test content is based on a defined statement or set of statements of knowledge, skill, ability, or other characteristics obtained from a job analysis.  Item response theory models the relationship between latent traits and responses to test items.  Among other advantages, IRT provides a basis for obtaining an estimate of the location of a test-taker on a given latent trait as well as the standard error of measurement of that location.  For example, a university student's knowledge of history can be deduced from his or her score on a university test and then be compared reliably with a high school student‘s
  • 28. knowledge deduced from a less difficult test.  Scores derived by classical test theory do not have this characteristic, and assessment of actual ability (rather than ability relative to other test-takers) must be assessed by comparing scores to those of a "norm group" randomly selected from the population.  In fact, all measures derived from classical test theory are dependent on the sample tested, while, in principle, those derived from item response theory are not.
  • 29. Standards of quality  The considerations of validity and reliability typically are viewed as essential elements for determining the quality of any test.  However, professional and practitioner associations frequently have placed these concerns within broader contexts when developing standards and making overall judgments about the quality of any test as a whole within a given context.  A consideration of concern in many applied research settings is whether or not the metric of a given psychological inventory is meaningful or arbitrary.
  • 30. e n d