SlideShare a Scribd company logo
SAMPLING
SAMPLING
METHODS
METHODS
1
LEARNING OBJECTIVES
LEARNING OBJECTIVES
ď‚— Learn the reasons for sampling
ď‚— Develop an understanding about
different sampling methods
ď‚— Distinguish between probability & non
probability sampling
ď‚— Discuss the relative advantages &
disadvantages of each sampling
methods
2
SAMPLING
SAMPLING
 A sample is “a smaller (but hopefully
representative) collection of units from a population
used to determine truths about that population”
ď‚— Why sample?
â—¦ Resources (time, money) and workload
â—¦ Gives results with known accuracy that can be
calculated mathematically
ď‚— The sampling frame is the list from which the
potential respondents are drawn
◦ Registrar’s office
â—¦ Class rosters
3
SAMPLING……
SAMPLING……
ď‚— What is your population of interest?
ď‚– To whom do you want to generalize your results?
ď‚– All doctors
ď‚– School children
ď‚– Indians
ď‚– Women aged 15-45 years
ď‚– Other
ď‚— Can you sample the entire population?
4
SAMPLING
SAMPLING…….
…….
ď‚— 3 factors that influence sample representative-ness
ď‚– Sampling procedure
ď‚– Sample size
ď‚– Participation (response)
ď‚— When might you sample the entire population?
ď‚– When your population is very small
ď‚– When you have extensive resources
 When you don’t expect a very high response
5
6
SAMPLING BREAKDOWN
SAMPLING
SAMPLING…….
…….
TARGET POPULATION
STUDY POPULATION
SAMPLE
Types of Samples
Types of Samples
ď‚— Probability (Random) Samples
â—¦ Simple random sample
â—¦ Systematic random sample
â—¦ Stratified random sample
â—¦ Multistage sample
â—¦ Multiphase sample
â—¦ Cluster sample
ď‚— Non-Probability Samples
â—¦ Convenience sample
â—¦ Purposive sample
â—¦ Quota
8
Process
Process
ď‚— The sampling process comprises several stages:
â—¦ Defining the population of concern
â—¦ Specifying a sampling frame, a set of items or
events possible to measure
â—¦ Specifying a sampling method for selecting items or
events from the frame
â—¦ Determining the sample size
â—¦ Implementing the sampling plan
â—¦ Sampling and data collecting
â—¦ Reviewing the sampling process
9
Population definition
Population definition
ď‚— A population can be defined as including all people
or items with the characteristic one wishes to
understand.
ď‚— Because there is very rarely enough time or money
to gather information from everyone or everything
in a population, the goal becomes finding a
representative sample (or subset) of that
population.
10
Population definition…….
Population definition…….
ď‚— Note also that the population from which the
sample is drawn may not be the same as the
population about which we actually want
information. Often there is large but not complete
overlap between these two groups due to frame
issues etc .
ď‚— Sometimes they may be entirely separate - for
instance, we might study rats in order to get a
better understanding of human health, or we might
study records from people born in 2008 in order to
make predictions about people born in 2009.
11
SAMPLING FRAME
SAMPLING FRAME
ď‚— In the most straightforward case, such as the sentencing
of a batch of material from production (acceptance
sampling by lots), it is possible to identify and measure
every single item in the population and to include any one
of them in our sample. However, in the more general
case this is not possible. There is no way to identify all
rats in the set of all rats. Where voting is not
compulsory, there is no way to identify which people will
actually vote at a forthcoming election (in advance of the
election)
ď‚— As a remedy, we seek a sampling frame which has the
property that we can identify every single element and
include any in our sample .
ď‚— The sampling frame must be representative of the
population
12
PROBABILITY SAMPLING
PROBABILITY SAMPLING
ď‚— A probability sampling scheme is one in which
every unit in the population has a chance
(greater than zero) of being selected in the
sample, and this probability can be accurately
determined.
ď‚— . When every element in the population does
have the same probability of selection, this is
known as an 'equal probability of selection'
(EPS) design. Such designs are also referred
to as 'self-weighting' because all sampled
units are given the same weight.
13
PROBABILITY SAMPLING…….
PROBABILITY SAMPLING…….
ď‚— Probability sampling includes:
ď‚— Simple Random Sampling,
ď‚— Systematic Sampling,
ď‚— Stratified Random Sampling,
ď‚— Cluster Sampling
ď‚— Multistage Sampling.
ď‚— Multiphase sampling
14
NON PROBABILITY SAMPLING
NON PROBABILITY SAMPLING
ď‚— Any sampling method where some elements of population
have no chance of selection (these are sometimes
referred to as 'out of coverage'/'undercovered'), or
where the probability of selection can't be accurately
determined. It involves the selection of elements based
on assumptions regarding the population of interest,
which forms the criteria for selection. Hence, because
the selection of elements is nonrandom, nonprobability
sampling not allows the estimation of sampling errors..
ď‚— Example: We visit every household in a given street, and
interview the first person to answer the door. In any
household with more than one occupant, this is a
nonprobability sample, because some people are more
likely to answer the door (e.g. an unemployed person who
spends most of their time at home is more likely to
answer than an employed housemate who might be at
work when the interviewer calls) and it's not practical to
calculate these probabilities.
15
NONPROBABILITY SAMPLING…….
NONPROBABILITY SAMPLING…….
• Nonprobability Sampling includes:
Accidental Sampling, Quota Sampling and
Purposive Sampling. In addition, nonresponse
effects may turn any probability design into a
nonprobability design if the characteristics of
nonresponse are not well understood, since
nonresponse effectively modifies each
element's probability of being sampled.
16
SIMPLE RANDOM SAMPLING
SIMPLE RANDOM SAMPLING
• Applicable when population is small,
homogeneous & readily available
• All subsets of the frame are given an equal
probability. Each element of the frame thus
has an equal probability of selection.
• It provides for greatest number of possible
samples. This is done by assigning a number to
each unit in the sampling frame.
• A table of random number or lottery system
is used to determine which units are to be
selected.
17
SIMPLE RANDOM SAMPLING……..
SIMPLE RANDOM SAMPLING……..
ď‚— Estimates are easy to calculate.
ď‚— Disadvantages
ď‚— If sampling frame large, this method impracticable.
ď‚— Minority subgroups of interest in population may not be
present in sample in sufficient numbers for study.
18
REPLACEMENT OF SELECTED UNITS
REPLACEMENT OF SELECTED UNITS
ď‚— Sampling schemes may be without replacement ('WOR' - no
element can be selected more than once in the same sample) or
with replacement ('WR' - an element may appear multiple times in
the one sample).
ď‚— For example, if we catch fish, measure them, and immediately
return them to the water before continuing with the sample, this
is a WR design, because we might end up catching and measuring
the same fish more than once. However, if we do not return the
fish to the water (e.g. if we eat the fish), this becomes a WOR
design.
19
SYSTEMATIC SAMPLING
SYSTEMATIC SAMPLING
ď‚— Systematic sampling relies on arranging the target
population according to some ordering scheme and
then selecting elements at regular intervals through
that ordered list.
ď‚— Systematic sampling involves a random start and then
proceeds with the selection of every kth element from
then onwards. In this case, k=(population size/sample
size).
ď‚— It is important that the starting point is not
automatically the first in the list, but is instead
randomly chosen from within the first to the kth
element in the list.
ď‚— A simple example would be to select every 10th name
from the telephone directory (an 'every 10th' sample,
also referred to as 'sampling with a skip of 10').
20
SYSTEMATIC SAMPLING……
SYSTEMATIC SAMPLING……
As described above, systematic sampling is an EPS method, because all
elements have the same probability of selection (in the example
given, one in ten). It is not 'simple random sampling' because
different subsets of the same size have different selection
probabilities - e.g. the set {4,14,24,...,994} has a one-in-ten
probability of selection, but the set {4,13,24,34,...} has zero
probability of selection.
21
SYSTEMATIC SAMPLING……
SYSTEMATIC SAMPLING……
ď‚— ADVANTAGES:
ď‚— Sample easy to select
ď‚— Suitable sampling frame can be identified easily
ď‚— Sample evenly spread over entire reference population
ď‚— DISADVANTAGES:
ď‚— Sample may be biased if hidden periodicity in population
coincides with that of selection.
ď‚— Difficult to assess precision of estimate from one survey.
22
STRATIFIED SAMPLING
STRATIFIED SAMPLING
Where population embraces a number of distinct
categories, the frame can be organized into
separate "strata." Each stratum is then sampled
as an independent sub-population, out of which
individual elements can be randomly selected.
ď‚— Every unit in a stratum has same chance of being
selected.
ď‚— Using same sampling fraction for all strata
ensures proportionate representation in the
sample.
ď‚— Adequate representation of minority subgroups of
interest can be ensured by stratification & varying
sampling fraction between strata as required.
23
STRATIFIED SAMPLING……
STRATIFIED SAMPLING……
ď‚— Finally, since each stratum is treated as an
independent population, different sampling
approaches can be applied to different strata.
ď‚— Drawbacks to using stratified sampling.
ď‚— First, sampling frame of entire population has
to be prepared separately for each stratum
ď‚— Second, when examining multiple criteria,
stratifying variables may be related to some,
but not to others, further complicating the
design, and potentially reducing the utility of
the strata.
ď‚— Finally, in some cases (such as designs with a
large number of strata, or those with a
specified minimum sample size per group),
stratified sampling can potentially require a
larger sample than would other methods
24
STRATIFIED SAMPLING…….
STRATIFIED SAMPLING…….
25
Draw a sample from each stratum
POSTSTRATIFICATION
POSTSTRATIFICATION
ď‚— Stratification is sometimes introduced after the sampling
phase in a process called "poststratification“.
ď‚— This approach is typically implemented due to a lack of prior
knowledge of an appropriate stratifying variable or when
the experimenter lacks the necessary information to create
a stratifying variable during the sampling phase. Although
the method is susceptible to the pitfalls of post hoc
approaches, it can provide several benefits in the right
situation. Implementation usually follows a simple random
sample. In addition to allowing for stratification on an
ancillary variable, poststratification can be used to
implement weighting, which can improve the precision of a
sample's estimates.
26
OVERSAMPLING
OVERSAMPLING
ď‚— Choice-based sampling is one of the stratified
sampling strategies. In this, data are stratified
on the target and a sample is taken from each
strata so that the rare target class will be more
represented in the sample. The model is then
built on this biased sample. The effects of the
input variables on the target are often
estimated with more precision with the choice-
based sample even when a smaller overall sample
size is taken, compared to a random sample. The
results usually must be adjusted to correct for
the oversampling.
27
CLUSTER SAMPLING…….
CLUSTER SAMPLING…….
ď‚— Advantages :
ď‚— Cuts down on the cost of preparing a sampling
frame.
ď‚— This can reduce travel and other administrative
costs.
ď‚— Disadvantages: sampling error is higher for a
simple random sample of same size.
ď‚— Often used to evaluate vaccination coverage in
EPI
28
CLUSTER SAMPLING
CLUSTER SAMPLING
ď‚— Cluster sampling is an example of 'two-stage sampling' .
ď‚— First stage a sample of areas is chosen;
ď‚— Second stage a sample of respondents within those areas
is selected.
ď‚— Population divided into clusters of homogeneous units,
usually based on geographical contiguity.
ď‚— Sampling units are groups rather than individuals.
ď‚— A sample of such clusters is then selected.
ď‚— All units from the selected clusters are studied.
29
CLUSTER SAMPLING…….
CLUSTER SAMPLING…….
• Identification of clusters
– List all cities, towns, villages & wards of cities with their
population falling in target area under study.
– Calculate cumulative population & divide by 30, this gives sampling
interval.
– Select a random no. less than or equal to sampling interval having
same no. of digits. This forms 1st
cluster.
– Random no.+ sampling interval = population of 2nd
cluster.
– Second cluster + sampling interval = 4th
cluster.
– Last or 30th
cluster = 29th
cluster + sampling interval
30
CLUSTER SAMPLING…….
CLUSTER SAMPLING…….
Two types of cluster sampling
methods.
One-stage sampling. All of the
elements within selected clusters
are included in the sample.
Two-stage sampling. A subset of
elements within selected clusters
are randomly selected for inclusion
in the sample.
31
Difference Between Strata and Clusters
Difference Between Strata and Clusters
ď‚— Although strata and clusters are both non-
overlapping subsets of the population, they
differ in several ways.
ď‚— All strata are represented in the sample; but
only a subset of clusters are in the sample.
ď‚— With stratified sampling, the best survey
results occur when elements within strata are
internally homogeneous. However, with cluster
sampling, the best results occur when elements
within clusters are internally heterogeneous
32
MULTISTAGE SAMPLING
MULTISTAGE SAMPLING
ď‚— Complex form of cluster sampling in which two or more levels of
units are embedded one in the other.
ď‚— First stage, random number of districts chosen in all
states.
ď‚— Followed by random number of talukas, villages.
ď‚— Then third stage units will be houses.
ď‚— All ultimate units (houses, for instance) selected at last step
are surveyed.
33
MULTISTAGE SAMPLING……..
MULTISTAGE SAMPLING……..
ď‚— This technique, is essentially the process of taking
random samples of preceding random samples.
ď‚— Not as effective as true random sampling, but probably
solves more of the problems inherent to random
sampling.
ď‚— An effective strategy because it banks on multiple
randomizations. As such, extremely useful.
ď‚— Multistage sampling used frequently when a complete
list of all members of the population not exists and is
inappropriate.
ď‚— Moreover, by avoiding the use of all sample units in all
selected clusters, multistage sampling avoids the large,
and perhaps unnecessary, costs associated with
traditional cluster sampling.
34
MULTI PHASE SAMPLING
MULTI PHASE SAMPLING
ď‚— Part of the information collected from whole sample & part from
subsample.
 In Tb survey MT in all cases – Phase I
 X –Ray chest in MT +ve cases – Phase II
 Sputum examination in X – Ray +ve cases - Phase III
ď‚— Survey by such procedure is less costly, less laborious & more
purposeful
35
MATCHED RANDOM SAMPLING
MATCHED RANDOM SAMPLING
A method of assigning participants to groups in which
pairs of participants are first matched on some
characteristic and then individually assigned randomly to
groups.
ď‚— The Procedure for Matched random sampling can be
briefed with the following contexts,
ď‚— Two samples in which the members are clearly paired, or
are matched explicitly by the researcher. For example,
IQ measurements or pairs of identical twins.
ď‚— Those samples in which the same attribute, or variable,
is measured twice on each subject, under different
circumstances. Commonly called repeated measures.
ď‚— Examples include the times of a group of athletes for
1500m before and after a week of special training; the
milk yields of cows before and after being fed a
particular diet.
36
QUOTA SAMPLING
QUOTA SAMPLING
ď‚— The population is first segmented into mutually
exclusive sub-groups, just as in stratified sampling.
ď‚— Then judgment used to select subjects or units from
each segment based on a specified proportion.
ď‚— For example, an interviewer may be told to sample 200
females and 300 males between the age of 45 and 60.
ď‚— It is this second step which makes the technique one of
non-probability sampling.
ď‚— In quota sampling the selection of the sample is non-
random.
ď‚— For example interviewers might be tempted to interview
those who look most helpful. The problem is that these
samples may be biased because not everyone gets a
chance of selection. This random element is its greatest
weakness and quota versus probability has been a matter
of controversy for many years
37
CONVENIENCE SAMPLING
CONVENIENCE SAMPLING
ď‚— Sometimes known as grab or opportunity sampling or accidental
or haphazard sampling.
ď‚— A type of nonprobability sampling which involves the sample being
drawn from that part of the population which is close to hand.
That is, readily available and convenient.
ď‚— The researcher using such a sample cannot scientifically make
generalizations about the total population from this sample
because it would not be representative enough.
ď‚— For example, if the interviewer was to conduct a survey at a
shopping center early in the morning on a given day, the people
that he/she could interview would be limited to those given there
at that given time, which would not represent the views of other
members of society in such an area, if the survey was to be
conducted at different times of day and several times per week.
ď‚— This type of sampling is most useful for pilot testing.
ď‚— In social science research, snowball sampling is a similar
technique, where existing study subjects are used to recruit more
subjects into the sample.
38
39
CONVENIENCE SAMPLING…….
CONVENIENCE SAMPLING…….
â—¦ Use results that are easy to get
39
Judgmental sampling or
Judgmental sampling or
Purposive sampling
Purposive sampling
ď‚— - The researcher chooses the
sample based on who they think
would be appropriate for the study.
This is used primarily when there is
a limited number of people that
have expertise in the area being
researched
40
PANEL SAMPLING
PANEL SAMPLING
ď‚— Method of first selecting a group of participants through a
random sampling method and then asking that group for the same
information again several times over a period of time.
ď‚— Therefore, each participant is given same survey or interview at
two or more time points; each period of data collection called a
"wave".
ď‚— This sampling methodology often chosen for large scale or
nation-wide studies in order to gauge changes in the population
with regard to any number of variables from chronic illness to job
stress to weekly food expenditures.
ď‚— Panel sampling can also be used to inform researchers about
within-person health changes due to age or help explain changes
in continuous dependent variables such as spousal interaction.
ď‚— There have been several proposed methods of analyzing panel
sample data, including growth curves.
41
What sampling method u recommend?
What sampling method u recommend?
ď‚— Determining proportion of undernourished five
year olds in a village.
ď‚— Investigating nutritional status of preschool
children.
ď‚— Selecting maternity records for the study of
previous abortions or duration of postnatal
stay.
ď‚— In estimation of immunization coverage in a
province, data on seven children aged 12-23
months in 30 clusters are used to determine
proportion of fully immunized children in the
province.
ď‚— Give reasons why cluster sampling is used in
this survey.
42
Probability proportional to size sampling
Probability proportional to size sampling
ď‚— In some cases the sample designer has access to an "auxiliary
variable" or "size measure", believed to be correlated to the
variable of interest, for each element in the population. This
data can be used to improve accuracy in sample design. One
option is to use the auxiliary variable as a basis for
stratification, as discussed above.
ď‚— Another option is probability-proportional-to-size ('PPS')
sampling, in which the selection probability for each element
is set to be proportional to its size measure, up to a
maximum of 1. In a simple PPS design, these selection
probabilities can then be used as the basis for Poisson
sampling. However, this has the drawbacks of variable sample
size, and different portions of the population may still be
over- or under-represented due to chance variation in
selections. To address this problem, PPS may be combined
with a systematic approach.
43
Contd.
Contd.
ď‚— Example: Suppose we have six schools with populations of 150, 180, 200, 220,
260, and 490 students respectively (total 1500 students), and we want to use
student population as the basis for a PPS sample of size three. To do this, we
could allocate the first school numbers 1 to 150, the second school 151 to
330 (= 150 + 180), the third school 331 to 530, and so on to the last school
(1011 to 1500). We then generate a random start between 1 and 500 (equal
to 1500/3) and count through the school populations by multiples of 500. If
our random start was 137, we would select the schools which have been
allocated numbers 137, 637, and 1137, i.e. the first, fourth, and sixth schools.
ď‚— The PPS approach can improve accuracy for a given sample size by
concentrating sample on large elements that have the greatest impact on
population estimates. PPS sampling is commonly used for surveys of
businesses, where element size varies greatly and auxiliary information is often
available - for instance, a survey attempting to measure the number of guest-
nights spent in hotels might use each hotel's number of rooms as an auxiliary
variable. In some cases, an older measurement of the variable of interest can
be used as an auxiliary variable when attempting to produce more current
estimates.
44
Event sampling
Event sampling
ď‚— Event Sampling Methodology (ESM) is a new form of
sampling method that allows researchers to study ongoing
experiences and events that vary across and within days in its
naturally-occurring environment. Because of the frequent
sampling of events inherent in ESM, it enables researchers to
measure the typology of activity and detect the temporal and
dynamic fluctuations of work experiences. Popularity of ESM
as a new form of research design increased over the recent
years because it addresses the shortcomings of cross-
sectional research, where once unable to, researchers can
now detect intra-individual variances across time. In ESM,
participants are asked to record their experiences and
perceptions in a paper or electronic diary.
 There are three types of ESM:# Signal contingent – random
beeping notifies participants to record data. The advantage of
this type of ESM is minimization of recall bias.
 Event contingent – records data when certain events occur
45
Contd.
Contd.
 Event contingent – records data when certain events occur
 Interval contingent – records data according to the passing of
a certain period of time
ď‚— ESM has several disadvantages. One of the disadvantages of
ESM is it can sometimes be perceived as invasive and
intrusive by participants. ESM also leads to possible self-
selection bias. It may be that only certain types of individuals
are willing to participate in this type of study creating a non-
random sample. Another concern is related to participant
cooperation. Participants may not be actually fill out their
diaries at the specified times. Furthermore, ESM may
substantively change the phenomenon being studied.
Reactivity or priming effects may occur, such that repeated
measurement may cause changes in the participants'
experiences. This method of sampling data is also highly
vulnerable to common method variance.[6]
46
contd.
contd.
ď‚— Further, it is important to think about whether or not
an appropriate dependent variable is being used in an
ESM design. For example, it might be logical to use ESM
in order to answer research questions which involve
dependent variables with a great deal of variation
throughout the day. Thus, variables such as change in
mood, change in stress level, or the immediate impact
of particular events may be best studied using ESM
methodology. However, it is not likely that utilizing ESM
will yield meaningful predictions when measuring
someone performing a repetitive task throughout the
day or when dependent variables are long-term in
nature (coronary heart problems).
47

More Related Content

PPT
File
PPT
sampling methods...
PPT
Sampling
PPT
sampling in marketing research and prep.
PPT
Sampling Methods and Techniques in Research.ppt
PPT
Phyton class by Pavan - Study notes incl
PPT
435454541511878181818156681818484911.ppt
PPT
43911.ppt
File
sampling methods...
Sampling
sampling in marketing research and prep.
Sampling Methods and Techniques in Research.ppt
Phyton class by Pavan - Study notes incl
435454541511878181818156681818484911.ppt
43911.ppt

Similar to Sampling Methods, Types of Sampling Methods (20)

PPT
Soil Sampling Soil Sampling Soil Sampling Soil Sampling
PPT
43911.ppt
PPT
43911.ppt
PPT
43911.ppt
PPT
43911.ppt
PPT
43911.ppt
PPT
It is one of the best resrch sampling.ppt
PPT
Sampling method.ppt
PPTX
sampling%20method.pptx
PPT
SAMPLING Methods 12 December 2024 presentation
PPT
Research Method
PPT
Methods of Sampling. (Business Statistics)
PPT
Methods of Sampling used in dentistry. ppt
PPT
Sampling.ppt
PPTX
Sampling Design.pptx
PPT
43911.ppt
PPTX
Basic Statistics Sampling Techniques 2.pptx
PPT
Sampling method in research
PPT
Sample method
PPT
Sampling Methods
Soil Sampling Soil Sampling Soil Sampling Soil Sampling
43911.ppt
43911.ppt
43911.ppt
43911.ppt
43911.ppt
It is one of the best resrch sampling.ppt
Sampling method.ppt
sampling%20method.pptx
SAMPLING Methods 12 December 2024 presentation
Research Method
Methods of Sampling. (Business Statistics)
Methods of Sampling used in dentistry. ppt
Sampling.ppt
Sampling Design.pptx
43911.ppt
Basic Statistics Sampling Techniques 2.pptx
Sampling method in research
Sample method
Sampling Methods
Ad

Recently uploaded (20)

PDF
VCE English Exam - Section C Student Revision Booklet
PPTX
Pharma ospi slides which help in ospi learning
PPTX
human mycosis Human fungal infections are called human mycosis..pptx
PDF
FourierSeries-QuestionsWithAnswers(Part-A).pdf
PPTX
Cell Structure & Organelles in detailed.
PPTX
Presentation on HIE in infants and its manifestations
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PDF
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
01-Introduction-to-Information-Management.pdf
PDF
RMMM.pdf make it easy to upload and study
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
PDF
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
Microbial diseases, their pathogenesis and prophylaxis
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
PPTX
Final Presentation General Medicine 03-08-2024.pptx
PPTX
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
 
VCE English Exam - Section C Student Revision Booklet
Pharma ospi slides which help in ospi learning
human mycosis Human fungal infections are called human mycosis..pptx
FourierSeries-QuestionsWithAnswers(Part-A).pdf
Cell Structure & Organelles in detailed.
Presentation on HIE in infants and its manifestations
STATICS OF THE RIGID BODIES Hibbelers.pdf
The Lost Whites of Pakistan by Jahanzaib Mughal.pdf
Anesthesia in Laparoscopic Surgery in India
01-Introduction-to-Information-Management.pdf
RMMM.pdf make it easy to upload and study
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
1st Inaugural Professorial Lecture held on 19th February 2020 (Governance and...
GENETICS IN BIOLOGY IN SECONDARY LEVEL FORM 3
Final Presentation General Medicine 03-08-2024.pptx
Microbial diseases, their pathogenesis and prophylaxis
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
OBE - B.A.(HON'S) IN INTERIOR ARCHITECTURE -Ar.MOHIUDDIN.pdf
Final Presentation General Medicine 03-08-2024.pptx
PPT- ENG7_QUARTER1_LESSON1_WEEK1. IMAGERY -DESCRIPTIONS pptx.pptx
 
Ad

Sampling Methods, Types of Sampling Methods

  • 2. LEARNING OBJECTIVES LEARNING OBJECTIVES ď‚— Learn the reasons for sampling ď‚— Develop an understanding about different sampling methods ď‚— Distinguish between probability & non probability sampling ď‚— Discuss the relative advantages & disadvantages of each sampling methods 2
  • 3. SAMPLING SAMPLING ď‚— A sample is “a smaller (but hopefully representative) collection of units from a population used to determine truths about that population” ď‚— Why sample? â—¦ Resources (time, money) and workload â—¦ Gives results with known accuracy that can be calculated mathematically ď‚— The sampling frame is the list from which the potential respondents are drawn â—¦ Registrar’s office â—¦ Class rosters 3
  • 4. SAMPLING…… SAMPLING…… ď‚— What is your population of interest? ď‚– To whom do you want to generalize your results? ď‚– All doctors ď‚– School children ď‚– Indians ď‚– Women aged 15-45 years ď‚– Other ď‚— Can you sample the entire population? 4
  • 5. SAMPLING SAMPLING……. ……. ď‚— 3 factors that influence sample representative-ness ď‚– Sampling procedure ď‚– Sample size ď‚– Participation (response) ď‚— When might you sample the entire population? ď‚– When your population is very small ď‚– When you have extensive resources ď‚– When you don’t expect a very high response 5
  • 8. Types of Samples Types of Samples ď‚— Probability (Random) Samples â—¦ Simple random sample â—¦ Systematic random sample â—¦ Stratified random sample â—¦ Multistage sample â—¦ Multiphase sample â—¦ Cluster sample ď‚— Non-Probability Samples â—¦ Convenience sample â—¦ Purposive sample â—¦ Quota 8
  • 9. Process Process ď‚— The sampling process comprises several stages: â—¦ Defining the population of concern â—¦ Specifying a sampling frame, a set of items or events possible to measure â—¦ Specifying a sampling method for selecting items or events from the frame â—¦ Determining the sample size â—¦ Implementing the sampling plan â—¦ Sampling and data collecting â—¦ Reviewing the sampling process 9
  • 10. Population definition Population definition ď‚— A population can be defined as including all people or items with the characteristic one wishes to understand. ď‚— Because there is very rarely enough time or money to gather information from everyone or everything in a population, the goal becomes finding a representative sample (or subset) of that population. 10
  • 11. Population definition……. Population definition……. ď‚— Note also that the population from which the sample is drawn may not be the same as the population about which we actually want information. Often there is large but not complete overlap between these two groups due to frame issues etc . ď‚— Sometimes they may be entirely separate - for instance, we might study rats in order to get a better understanding of human health, or we might study records from people born in 2008 in order to make predictions about people born in 2009. 11
  • 12. SAMPLING FRAME SAMPLING FRAME ď‚— In the most straightforward case, such as the sentencing of a batch of material from production (acceptance sampling by lots), it is possible to identify and measure every single item in the population and to include any one of them in our sample. However, in the more general case this is not possible. There is no way to identify all rats in the set of all rats. Where voting is not compulsory, there is no way to identify which people will actually vote at a forthcoming election (in advance of the election) ď‚— As a remedy, we seek a sampling frame which has the property that we can identify every single element and include any in our sample . ď‚— The sampling frame must be representative of the population 12
  • 13. PROBABILITY SAMPLING PROBABILITY SAMPLING ď‚— A probability sampling scheme is one in which every unit in the population has a chance (greater than zero) of being selected in the sample, and this probability can be accurately determined. ď‚— . When every element in the population does have the same probability of selection, this is known as an 'equal probability of selection' (EPS) design. Such designs are also referred to as 'self-weighting' because all sampled units are given the same weight. 13
  • 14. PROBABILITY SAMPLING……. PROBABILITY SAMPLING……. ď‚— Probability sampling includes: ď‚— Simple Random Sampling, ď‚— Systematic Sampling, ď‚— Stratified Random Sampling, ď‚— Cluster Sampling ď‚— Multistage Sampling. ď‚— Multiphase sampling 14
  • 15. NON PROBABILITY SAMPLING NON PROBABILITY SAMPLING ď‚— Any sampling method where some elements of population have no chance of selection (these are sometimes referred to as 'out of coverage'/'undercovered'), or where the probability of selection can't be accurately determined. It involves the selection of elements based on assumptions regarding the population of interest, which forms the criteria for selection. Hence, because the selection of elements is nonrandom, nonprobability sampling not allows the estimation of sampling errors.. ď‚— Example: We visit every household in a given street, and interview the first person to answer the door. In any household with more than one occupant, this is a nonprobability sample, because some people are more likely to answer the door (e.g. an unemployed person who spends most of their time at home is more likely to answer than an employed housemate who might be at work when the interviewer calls) and it's not practical to calculate these probabilities. 15
  • 16. NONPROBABILITY SAMPLING……. NONPROBABILITY SAMPLING……. • Nonprobability Sampling includes: Accidental Sampling, Quota Sampling and Purposive Sampling. In addition, nonresponse effects may turn any probability design into a nonprobability design if the characteristics of nonresponse are not well understood, since nonresponse effectively modifies each element's probability of being sampled. 16
  • 17. SIMPLE RANDOM SAMPLING SIMPLE RANDOM SAMPLING • Applicable when population is small, homogeneous & readily available • All subsets of the frame are given an equal probability. Each element of the frame thus has an equal probability of selection. • It provides for greatest number of possible samples. This is done by assigning a number to each unit in the sampling frame. • A table of random number or lottery system is used to determine which units are to be selected. 17
  • 18. SIMPLE RANDOM SAMPLING…….. SIMPLE RANDOM SAMPLING…….. ď‚— Estimates are easy to calculate. ď‚— Disadvantages ď‚— If sampling frame large, this method impracticable. ď‚— Minority subgroups of interest in population may not be present in sample in sufficient numbers for study. 18
  • 19. REPLACEMENT OF SELECTED UNITS REPLACEMENT OF SELECTED UNITS ď‚— Sampling schemes may be without replacement ('WOR' - no element can be selected more than once in the same sample) or with replacement ('WR' - an element may appear multiple times in the one sample). ď‚— For example, if we catch fish, measure them, and immediately return them to the water before continuing with the sample, this is a WR design, because we might end up catching and measuring the same fish more than once. However, if we do not return the fish to the water (e.g. if we eat the fish), this becomes a WOR design. 19
  • 20. SYSTEMATIC SAMPLING SYSTEMATIC SAMPLING ď‚— Systematic sampling relies on arranging the target population according to some ordering scheme and then selecting elements at regular intervals through that ordered list. ď‚— Systematic sampling involves a random start and then proceeds with the selection of every kth element from then onwards. In this case, k=(population size/sample size). ď‚— It is important that the starting point is not automatically the first in the list, but is instead randomly chosen from within the first to the kth element in the list. ď‚— A simple example would be to select every 10th name from the telephone directory (an 'every 10th' sample, also referred to as 'sampling with a skip of 10'). 20
  • 21. SYSTEMATIC SAMPLING…… SYSTEMATIC SAMPLING…… As described above, systematic sampling is an EPS method, because all elements have the same probability of selection (in the example given, one in ten). It is not 'simple random sampling' because different subsets of the same size have different selection probabilities - e.g. the set {4,14,24,...,994} has a one-in-ten probability of selection, but the set {4,13,24,34,...} has zero probability of selection. 21
  • 22. SYSTEMATIC SAMPLING…… SYSTEMATIC SAMPLING…… ď‚— ADVANTAGES: ď‚— Sample easy to select ď‚— Suitable sampling frame can be identified easily ď‚— Sample evenly spread over entire reference population ď‚— DISADVANTAGES: ď‚— Sample may be biased if hidden periodicity in population coincides with that of selection. ď‚— Difficult to assess precision of estimate from one survey. 22
  • 23. STRATIFIED SAMPLING STRATIFIED SAMPLING Where population embraces a number of distinct categories, the frame can be organized into separate "strata." Each stratum is then sampled as an independent sub-population, out of which individual elements can be randomly selected. ď‚— Every unit in a stratum has same chance of being selected. ď‚— Using same sampling fraction for all strata ensures proportionate representation in the sample. ď‚— Adequate representation of minority subgroups of interest can be ensured by stratification & varying sampling fraction between strata as required. 23
  • 24. STRATIFIED SAMPLING…… STRATIFIED SAMPLING…… ď‚— Finally, since each stratum is treated as an independent population, different sampling approaches can be applied to different strata. ď‚— Drawbacks to using stratified sampling. ď‚— First, sampling frame of entire population has to be prepared separately for each stratum ď‚— Second, when examining multiple criteria, stratifying variables may be related to some, but not to others, further complicating the design, and potentially reducing the utility of the strata. ď‚— Finally, in some cases (such as designs with a large number of strata, or those with a specified minimum sample size per group), stratified sampling can potentially require a larger sample than would other methods 24
  • 26. POSTSTRATIFICATION POSTSTRATIFICATION ď‚— Stratification is sometimes introduced after the sampling phase in a process called "poststratification“. ď‚— This approach is typically implemented due to a lack of prior knowledge of an appropriate stratifying variable or when the experimenter lacks the necessary information to create a stratifying variable during the sampling phase. Although the method is susceptible to the pitfalls of post hoc approaches, it can provide several benefits in the right situation. Implementation usually follows a simple random sample. In addition to allowing for stratification on an ancillary variable, poststratification can be used to implement weighting, which can improve the precision of a sample's estimates. 26
  • 27. OVERSAMPLING OVERSAMPLING ď‚— Choice-based sampling is one of the stratified sampling strategies. In this, data are stratified on the target and a sample is taken from each strata so that the rare target class will be more represented in the sample. The model is then built on this biased sample. The effects of the input variables on the target are often estimated with more precision with the choice- based sample even when a smaller overall sample size is taken, compared to a random sample. The results usually must be adjusted to correct for the oversampling. 27
  • 28. CLUSTER SAMPLING……. CLUSTER SAMPLING……. ď‚— Advantages : ď‚— Cuts down on the cost of preparing a sampling frame. ď‚— This can reduce travel and other administrative costs. ď‚— Disadvantages: sampling error is higher for a simple random sample of same size. ď‚— Often used to evaluate vaccination coverage in EPI 28
  • 29. CLUSTER SAMPLING CLUSTER SAMPLING ď‚— Cluster sampling is an example of 'two-stage sampling' . ď‚— First stage a sample of areas is chosen; ď‚— Second stage a sample of respondents within those areas is selected. ď‚— Population divided into clusters of homogeneous units, usually based on geographical contiguity. ď‚— Sampling units are groups rather than individuals. ď‚— A sample of such clusters is then selected. ď‚— All units from the selected clusters are studied. 29
  • 30. CLUSTER SAMPLING……. CLUSTER SAMPLING……. • Identification of clusters – List all cities, towns, villages & wards of cities with their population falling in target area under study. – Calculate cumulative population & divide by 30, this gives sampling interval. – Select a random no. less than or equal to sampling interval having same no. of digits. This forms 1st cluster. – Random no.+ sampling interval = population of 2nd cluster. – Second cluster + sampling interval = 4th cluster. – Last or 30th cluster = 29th cluster + sampling interval 30
  • 31. CLUSTER SAMPLING……. CLUSTER SAMPLING……. Two types of cluster sampling methods. One-stage sampling. All of the elements within selected clusters are included in the sample. Two-stage sampling. A subset of elements within selected clusters are randomly selected for inclusion in the sample. 31
  • 32. Difference Between Strata and Clusters Difference Between Strata and Clusters ď‚— Although strata and clusters are both non- overlapping subsets of the population, they differ in several ways. ď‚— All strata are represented in the sample; but only a subset of clusters are in the sample. ď‚— With stratified sampling, the best survey results occur when elements within strata are internally homogeneous. However, with cluster sampling, the best results occur when elements within clusters are internally heterogeneous 32
  • 33. MULTISTAGE SAMPLING MULTISTAGE SAMPLING ď‚— Complex form of cluster sampling in which two or more levels of units are embedded one in the other. ď‚— First stage, random number of districts chosen in all states. ď‚— Followed by random number of talukas, villages. ď‚— Then third stage units will be houses. ď‚— All ultimate units (houses, for instance) selected at last step are surveyed. 33
  • 34. MULTISTAGE SAMPLING…….. MULTISTAGE SAMPLING…….. ď‚— This technique, is essentially the process of taking random samples of preceding random samples. ď‚— Not as effective as true random sampling, but probably solves more of the problems inherent to random sampling. ď‚— An effective strategy because it banks on multiple randomizations. As such, extremely useful. ď‚— Multistage sampling used frequently when a complete list of all members of the population not exists and is inappropriate. ď‚— Moreover, by avoiding the use of all sample units in all selected clusters, multistage sampling avoids the large, and perhaps unnecessary, costs associated with traditional cluster sampling. 34
  • 35. MULTI PHASE SAMPLING MULTI PHASE SAMPLING ď‚— Part of the information collected from whole sample & part from subsample. ď‚— In Tb survey MT in all cases – Phase I ď‚— X –Ray chest in MT +ve cases – Phase II ď‚— Sputum examination in X – Ray +ve cases - Phase III ď‚— Survey by such procedure is less costly, less laborious & more purposeful 35
  • 36. MATCHED RANDOM SAMPLING MATCHED RANDOM SAMPLING A method of assigning participants to groups in which pairs of participants are first matched on some characteristic and then individually assigned randomly to groups. ď‚— The Procedure for Matched random sampling can be briefed with the following contexts, ď‚— Two samples in which the members are clearly paired, or are matched explicitly by the researcher. For example, IQ measurements or pairs of identical twins. ď‚— Those samples in which the same attribute, or variable, is measured twice on each subject, under different circumstances. Commonly called repeated measures. ď‚— Examples include the times of a group of athletes for 1500m before and after a week of special training; the milk yields of cows before and after being fed a particular diet. 36
  • 37. QUOTA SAMPLING QUOTA SAMPLING ď‚— The population is first segmented into mutually exclusive sub-groups, just as in stratified sampling. ď‚— Then judgment used to select subjects or units from each segment based on a specified proportion. ď‚— For example, an interviewer may be told to sample 200 females and 300 males between the age of 45 and 60. ď‚— It is this second step which makes the technique one of non-probability sampling. ď‚— In quota sampling the selection of the sample is non- random. ď‚— For example interviewers might be tempted to interview those who look most helpful. The problem is that these samples may be biased because not everyone gets a chance of selection. This random element is its greatest weakness and quota versus probability has been a matter of controversy for many years 37
  • 38. CONVENIENCE SAMPLING CONVENIENCE SAMPLING ď‚— Sometimes known as grab or opportunity sampling or accidental or haphazard sampling. ď‚— A type of nonprobability sampling which involves the sample being drawn from that part of the population which is close to hand. That is, readily available and convenient. ď‚— The researcher using such a sample cannot scientifically make generalizations about the total population from this sample because it would not be representative enough. ď‚— For example, if the interviewer was to conduct a survey at a shopping center early in the morning on a given day, the people that he/she could interview would be limited to those given there at that given time, which would not represent the views of other members of society in such an area, if the survey was to be conducted at different times of day and several times per week. ď‚— This type of sampling is most useful for pilot testing. ď‚— In social science research, snowball sampling is a similar technique, where existing study subjects are used to recruit more subjects into the sample. 38
  • 40. Judgmental sampling or Judgmental sampling or Purposive sampling Purposive sampling ď‚— - The researcher chooses the sample based on who they think would be appropriate for the study. This is used primarily when there is a limited number of people that have expertise in the area being researched 40
  • 41. PANEL SAMPLING PANEL SAMPLING ď‚— Method of first selecting a group of participants through a random sampling method and then asking that group for the same information again several times over a period of time. ď‚— Therefore, each participant is given same survey or interview at two or more time points; each period of data collection called a "wave". ď‚— This sampling methodology often chosen for large scale or nation-wide studies in order to gauge changes in the population with regard to any number of variables from chronic illness to job stress to weekly food expenditures. ď‚— Panel sampling can also be used to inform researchers about within-person health changes due to age or help explain changes in continuous dependent variables such as spousal interaction. ď‚— There have been several proposed methods of analyzing panel sample data, including growth curves. 41
  • 42. What sampling method u recommend? What sampling method u recommend? ď‚— Determining proportion of undernourished five year olds in a village. ď‚— Investigating nutritional status of preschool children. ď‚— Selecting maternity records for the study of previous abortions or duration of postnatal stay. ď‚— In estimation of immunization coverage in a province, data on seven children aged 12-23 months in 30 clusters are used to determine proportion of fully immunized children in the province. ď‚— Give reasons why cluster sampling is used in this survey. 42
  • 43. Probability proportional to size sampling Probability proportional to size sampling ď‚— In some cases the sample designer has access to an "auxiliary variable" or "size measure", believed to be correlated to the variable of interest, for each element in the population. This data can be used to improve accuracy in sample design. One option is to use the auxiliary variable as a basis for stratification, as discussed above. ď‚— Another option is probability-proportional-to-size ('PPS') sampling, in which the selection probability for each element is set to be proportional to its size measure, up to a maximum of 1. In a simple PPS design, these selection probabilities can then be used as the basis for Poisson sampling. However, this has the drawbacks of variable sample size, and different portions of the population may still be over- or under-represented due to chance variation in selections. To address this problem, PPS may be combined with a systematic approach. 43
  • 44. Contd. Contd. ď‚— Example: Suppose we have six schools with populations of 150, 180, 200, 220, 260, and 490 students respectively (total 1500 students), and we want to use student population as the basis for a PPS sample of size three. To do this, we could allocate the first school numbers 1 to 150, the second school 151 to 330 (= 150 + 180), the third school 331 to 530, and so on to the last school (1011 to 1500). We then generate a random start between 1 and 500 (equal to 1500/3) and count through the school populations by multiples of 500. If our random start was 137, we would select the schools which have been allocated numbers 137, 637, and 1137, i.e. the first, fourth, and sixth schools. ď‚— The PPS approach can improve accuracy for a given sample size by concentrating sample on large elements that have the greatest impact on population estimates. PPS sampling is commonly used for surveys of businesses, where element size varies greatly and auxiliary information is often available - for instance, a survey attempting to measure the number of guest- nights spent in hotels might use each hotel's number of rooms as an auxiliary variable. In some cases, an older measurement of the variable of interest can be used as an auxiliary variable when attempting to produce more current estimates. 44
  • 45. Event sampling Event sampling ď‚— Event Sampling Methodology (ESM) is a new form of sampling method that allows researchers to study ongoing experiences and events that vary across and within days in its naturally-occurring environment. Because of the frequent sampling of events inherent in ESM, it enables researchers to measure the typology of activity and detect the temporal and dynamic fluctuations of work experiences. Popularity of ESM as a new form of research design increased over the recent years because it addresses the shortcomings of cross- sectional research, where once unable to, researchers can now detect intra-individual variances across time. In ESM, participants are asked to record their experiences and perceptions in a paper or electronic diary. ď‚— There are three types of ESM:# Signal contingent – random beeping notifies participants to record data. The advantage of this type of ESM is minimization of recall bias. ď‚— Event contingent – records data when certain events occur 45
  • 46. Contd. Contd. ď‚— Event contingent – records data when certain events occur ď‚— Interval contingent – records data according to the passing of a certain period of time ď‚— ESM has several disadvantages. One of the disadvantages of ESM is it can sometimes be perceived as invasive and intrusive by participants. ESM also leads to possible self- selection bias. It may be that only certain types of individuals are willing to participate in this type of study creating a non- random sample. Another concern is related to participant cooperation. Participants may not be actually fill out their diaries at the specified times. Furthermore, ESM may substantively change the phenomenon being studied. Reactivity or priming effects may occur, such that repeated measurement may cause changes in the participants' experiences. This method of sampling data is also highly vulnerable to common method variance.[6] 46
  • 47. contd. contd. ď‚— Further, it is important to think about whether or not an appropriate dependent variable is being used in an ESM design. For example, it might be logical to use ESM in order to answer research questions which involve dependent variables with a great deal of variation throughout the day. Thus, variables such as change in mood, change in stress level, or the immediate impact of particular events may be best studied using ESM methodology. However, it is not likely that utilizing ESM will yield meaningful predictions when measuring someone performing a repetitive task throughout the day or when dependent variables are long-term in nature (coronary heart problems). 47

Editor's Notes

  • #3: Sampling frame errors: university versus personal email addresses; changing class rosters; are all students in your population of interest represented?
  • #4: How do we determine our population of interest? Administrators can tell us We notice anecdotally or through qualitative research that a particular subgroup of students is experiencing higher risk We decide to do everyone and go from there 3 factors that influence sample representativeness Sampling procedure Sample size Participation (response) When might you sample the entire population? When your population is very small When you have extensive resources When you don’t expect a very high response
  • #6: Picture of sampling breakdown
  • #8: Two general approaches to sampling are used in social science research. With probability sampling, all elements (e.g., persons, households) in the population have some opportunity of being included in the sample, and the mathematical probability that any one of them will be selected can be calculated. With nonprobability sampling, in contrast, population elements are selected on the basis of their availability (e.g., because they volunteered) or because of the researcher's personal judgment that they are representative. The consequence is that an unknown portion of the population is excluded (e.g., those who did not volunteer). One of the most common types of nonprobability sample is called a convenience sample – not because such samples are necessarily easy to recruit, but because the researcher uses whatever individuals are available rather than selecting from the entire population. Because some members of the population have no chance of being sampled, the extent to which a convenience sample – regardless of its size – actually represents the entire population cannot be known