SlideShare a Scribd company logo
1
Research Methods in Health
Chapter 6. Sampling Design
Young Moon Chae, Ph.D.
Graduate School of Public Health
Yonsei University, Korea
ymchae@yuhs.ac
2
Table of Contents
• Introduction
• Population and sampling
• Probability sampling and non-probability sampling
• Types of probability sampling
• Determination of sample size
3
Introduction to Sampling Design
• Selection of some part of an aggregate or totality on the basis of which a
judgment or inference (statistical) about the aggregate or totality is made
• Items so selected constitute a sample
• The selection process or technique is sample design
• Survey conducted on the basis of sample is sample survey
• A complete enumeration of all the items in the ‘Population’ is called a
census inquiry
• Census inquiry requires enormous time, money & energy; slightest bias
may get magnified when number of observations are increased
• It is possible to obtain sufficiently accurate results by studying only a part (a
miniature cross section ) of the population
4
Basic Concepts
• UNIVERSE: Totality of items or units in any field of enquiry
• POPULATION
- Total of items about which information is desired aggregate of elementary units (finite or infinite, N)
possess at least one common characteristics -real or hypothetical
• ELEMENTARY UNITS
- Units possessing the relevant characteristics i.e., attributes that are the object of study (operational
definition)
• SAMPLING FRAME:
- A list of all the units of population
- Perfect frames seldom exists; elementary units or cluster of such units or the group form the basis for
frame for finite population
- Frame is either constructed by the researcher or some existing list of population is used
- Sampling frame should be a good representative of the population and as far as possible free from (i)
Incompleteness (ii) Inaccurateness (iii)Inadequateness (iv) Out-of-date (complete, no missing element, no
ineligibles, no element should appear more than once & up to date)
• SAMPLING DESIGN: (Target population →Survey population)
- A definite plan for obtaining a sample from the sampling frame
- Refers to technique or procedure adopted by the researcher
5
Characteristics of a Good Sampling Design
• Truly representative (Representativeness is not the property of the sample but
of the procedure by which the sample is obtained)
• Having small sampling error
• Economically viable
• Systematic bias is controlled (in a better way)
• Results can be applied to the universe in general with a reasonable level of
confidence ⇒reliability
• Optimum size (adequately large)
• Similar to population / universe
• Should have all the characteristics that are present in the population / universe
6
Population and Sampling
• Sampling is the process of selection of a number of units
from a defined study population.
• The process of sampling involves:
1. Identification of study population
2. Determination of sampling population
3. Definition of the sampling unit
4. Choice of sampling method
5. Estimation of the sample size
7
Identification of study population
• The study or target population is the one upon which the results of the study
will be generalized.
• It is crucial that the study population is clearly defined, since it is the most
important determinant of the sampling population
8
Determination of sampling population
• The sampling population is the one from which the sample is drawn.
• The definition of the sampling population by the investigator is governed by
two factors:
- Feasibility: reachable sampling population
- External validity: the ability to generalize from the study results to the target
population.
9
Aims and Advantages of Sampling
• By examining a sample it saves money. Obviously the assessment of a
sample of 100 from a population of 1000 will reduce costs considerably not
only because the sample consists of fewer people, but also because
sampling saves labour - only a smaller number of staff are needed for both
the fieldwork and the processing of data.
• Sampling also saves time. An estimate of the characteristics of the
population can be gathered in a short time. As a result of reduced costs and
the time saved through sampling, more attention can be given to each
subject hence, increasing the accuracy of the analysis.
• Sampling allows us to deduce characteristics of a population while
observing only some of its members (the sample). In order to achieve this
aim it is necessary to avoid selection bias.
• Sampling allows for more accurate measurements
− Inspection fatigue is reduced (non-sampling error)
− Sampling error can be studied, controlled & probability statement can be made about
magnitude
10
Bias in the Sampling
• Bias in the selection of a sample can arise if:
•The sampling is done by a non-random method, which generally means the
selection is influenced by human choice.
• The sampling frame (census, phone book) which serves as the basis for
selection does not cover the population adequately, completely or
accurately.
• Groups of the population are not represented because you can't find them
or they refuse to participate.
11
Choice of sampling method
• Non probability sampling
- In non-probability sampling, there is no assurance that each individual has a
known chance of being selected
• Probability sampling
- Probability sampling means that each individual in the population has a known
probability or chance of selection into the sample
12
Non-probability Sampling:
• Types of non-probability sampling:
- Convenience sampling
- Quota sampling
üSelection based on some basic parameters like age, sex, income, etc. of population so as
to make it representative
üField workers are assigned quotas of number of units satisfying the required
characteristics for collecting data
üLooks similar to stratified sampling, but differs in the fact that the discretion of field worker
is not found in ‘Stratified sampling’ which makes random sample from each cell
üDifficult to obtain an accurate & up to -date proportion of respondents assigned to each
cell
• Not recommended in medical research
- It is by far the most biases sampling procedure as it is not random (not everyone
in the population has an equal chance of being selected to participate in the study).
13
Probability Sampling
• Assigns equal probability to each unit of the population i.e., every element
has equal (non-zero) chance of being selected (equi-probability)
• Random does not mean haphazard
• Errors of estimation or significance of results obtained can be measured
• Best technique for representative sample
• Ensures the law of statistical regularity (i.e.. on an average,sample chosen
will have the same composition and structure as the universe / population)
• Even each combination (possible ) of sample will have equal probability of
being picked
• All choices are independent of one another
14
Types of Probability Sampling
• Simple random sampling
• Systematic random sampling
• Stratified random sampling
• Multi-stage random sampling
• Cluster sampling
• Multi-phase sampling
15
Simple Random Sampling
• In this method, all subject or elements have an equal probability of being
selected. There are two major ways of conducting a random sample.
• SRS is the basic selection process & all other procedures are viewed as
modifications of SRS
• The first is to consult a random number table, and the second is to have the
computer select a random sample.
e.g., Throwing of a fair dice. Probability of getting one is same for each of 20
sample throws and the 20 throws are all independent
16
16
Sampling Design
17
Advantage
• Decreases external validity of the results
• Allows for a smaller sample size
• Eliminates the influence of confounding factors
(cont.)
18
Systematic Random Sampling
• A systematic sample is conducted by randomly selecting a first case on a
list of the population and then proceeding every Nth case until your sample
is selected. This is particularly useful if your list of the population is long.
• For example, if your list was the phone book, it would be easiest to start at
perhaps the 17th person, and then select every 50th person from that point
on.
19
(cont.)
• Advantage of systematic sampling
− Easier to use & less costlier for large population
− Sample is spread more evenly over the entire population
− Elements can be ordered in a manner found in the universe
− Can be used even without list of units in the population
• Disadvantage
− If there is hidden periodicity or intrinsic cyclical relationship between
elements of the population, it is dangerous
e.g.:10 sample days selected from 70 consecutive days for number of
issues in a library
20
Stratified Sampling
• In a stratified sample, we sample either proportionately or equally to
represent various strata or subpopulations.
• In the proportional allocation, the same sampling ratio may be used in all
strata. That is, the number of individuals chosen in each stratum I
proportional to the size of the stratum
• If a uniform sampling fraction was used, the sample is ‘self-weighting’ and
can for some purposes be treated as if it were a simple random or
systematic sample
• For example if our strata were cities in a country we would make sure and
sample from each of the cities. If our strata were gender, we would sample
both men and women.
21
(cont.)
Steps for stratified sampling
• Divide population into several sub-population or strata which are more
homogeneous than population based on factors or characteristics more closely
related to the purpose of study, past experience, personal judgment and
consideration of relationship between the characteristics of population and
characteristics to be estimated. Conduct a pilot study if necessary and examine the
variances within and among strata.
• Make simple random sampling from each strata
- If stratification is implicit in ordering the list, systematic random sampling will serve the
purpose
- e.g., Students of school by the standard in which studying
• Decide on the size of the sample for each strata based on: (I) Size(ii) Variability &
(iii) Sampling cost of strata
- A uniform sampling fraction: Sample size is proportional to the size, variability & cost ( of
sampling ) of strata
- A variable sampling fraction : may be appropriate when there is special interest in some sub
groups or strata.( Weighting adjustments may be necessary while analyzing data)
22
22
23
Advantage
• There is less sampling variation than with simple random or systematic
sampling
• If the strata are more uniform than the total population with respect to other
attributes, it reduces sampling variation with respect to other properties also
• The greater the differences between the strata and the less the differences
within the strata. The greater is the gain due to stratification
• It is possible to compare the differences in mean between the strata by
using ANOVA
(cont.)
24
Multi-stage Sampling
• This method is used in large-scale surveys. A sample of first—stage
sampling units is chosen, each of the selected units is divided into second-
stage units, samples of second-stage units are selected, and so on
• Different methods (simple random, stratified, systematic or cluster sampling)
may be used at any stage
• The first-stage units may be province, villages, or other aggregations. This
method ha the same advantages as cluster sampling – less travel by
interviewers, no need for a sampling frame showing all individuals in the
population, etc.
25
(cont.)
Country
Provinces
Cities
Districts
Households
Person
Population
Sample
Subsample
26
Cluster Sampling
• In cluster sampling we take a random sample of strata and then survey
every member of the group.
• That is, the sampling units are clusters, and the sampling frame is a list of
these clusters.
• For example, if our strata were individuals schools in a city, we would
randomly select a number of schools and then test all of the students within
those schools.
27
27
28
(cont.)
• Advantage
- Less traveling by interviewers, fewer school teachers to negotiate with
- No need for a sampling frame showing all individuals in the population
- Better field supervision
• Disadvantage
- If the clusters contain similar persons (high intra-class correlation), it is difficult to
estimate the precision with which generalizations may be made to the parent
population
- A large number of small clusters is preferable to a small number of large clusters
29
Determination of Sample Size
• Nature of universe / population
- Heterogeneous or homogenous
- Dispersion factor (or variability)
• Number of variables to be studied
• Number of groups & sub-groups proposed
- Special interest in subgroups
• Nature of study (qualitative or quantitative)
- Intensive & continuous or general survey
• Sampling design or type of sample
30
(cont.)
• Intended depth of analysis
• Precision and reliability (acceptable confidence level) required
• Level of non-response (item & unit) expected
• Available finance and other resources (trained investigators)
• Size of population
• Nature of units in the population
• Size of questionnaire
NOTE: Size is more important than the proportion of the population represented by sample
31
Sample Size when Estimating a Percentage
or Proportion
Confidence interval for population proportion, ^p
^P = ^P ±e for infinite population
^P = P ±e √(N-n) / N-1) for finite population
where, P = Sample proportion, q = 1-PN = Size of population, n = Size of sample
e = Acceptable error = Z = √pq/ n
Z = Standard variate= ^P-P / √pq/n (Available from table, for given confidence level)
n = Z2
pq / e2
for infinite population
n = Z2
pqN / (N-1) e2
+ Z2
pq for infinite population
32
Example of Estimating Sample Size
33
References
Abramson JH. 1984. Survey methods in community medicine: An
introduction to epidemiological and evaluative studies. Churchill Livingston.
pp. 70-80.
Booth WC, et. al. The craft of research.2 ed. Chicago: The University of
Chicago Press, 2003.6
Fleiss JL 1981. Statistical methods for rates and proportions. Wiley.
Lavrakas P. 1987. Telephone survey methods: sampling, selection, and
supervision. Sage Publications.
Lemeshow S. et al. 1990. Adequacy of sample size in health studies. Wiley.
Levy PS., Lemeshow, S. 1991. Sampling of populations: methods and
applications. Wiley.
Moser CA, Kalton G. 1986. Survey methods in social investigation. Gower
Publishing Company.
Shuster JJ. 1989. CRC handbook of sample size guidelines for clinical trials.
CRC Press.

More Related Content

PPT
Sampling methods in medical research
PPT
graphic representations in statistics
PPTX
Sampling techniques
PPTX
Sampling designs
PPTX
sampling techniques used in research
PPTX
Probability sampling
PPTX
CLUSTER SAMPLING PPT
Sampling methods in medical research
graphic representations in statistics
Sampling techniques
Sampling designs
sampling techniques used in research
Probability sampling
CLUSTER SAMPLING PPT

What's hot (20)

PPT
Sampling Design
PPTX
Stratified sampling
PPTX
Sampling techniques
PPTX
Non-Probability Sampling
PPTX
STEPS IN SAMPLING PROCESS 5 stepmodel.pptx
PPTX
Sample Size Determination
PPTX
Sampling techniques
PPT
Stratified Random Sampling
PPT
Non sampling error
PPTX
Population vs sample
PPTX
Probability Sampling
PPT
Sample Surveys
PPTX
Non- Probability Sampling & Its Methods
PPSX
Population and Sample
PPTX
DOCX
Assignment sampling techniques
PPTX
PPT
Non Probability Sampling
PPTX
Characteristics of a good sample design & types of sample design
Sampling Design
Stratified sampling
Sampling techniques
Non-Probability Sampling
STEPS IN SAMPLING PROCESS 5 stepmodel.pptx
Sample Size Determination
Sampling techniques
Stratified Random Sampling
Non sampling error
Population vs sample
Probability Sampling
Sample Surveys
Non- Probability Sampling & Its Methods
Population and Sample
Assignment sampling techniques
Non Probability Sampling
Characteristics of a good sample design & types of sample design
Ad

Viewers also liked (20)

PPTX
RESEARCH METHOD - SAMPLING
PPTX
Research Methods: Sampling
PPTX
Sampling Methods in Qualitative and Quantitative Research
PPTX
7 dan 8
PPTX
5. sampling design
PDF
Role of evolutionary psychology in consumer behaviour
PPT
Mba2216 week 09 10 sampling
PPTX
Entrepreneurship and the Entrepreneur
PDF
Research method ch01 introduction
PPTX
No-probilty
PPTX
Research on sampling
PPTX
Non probability sampling methods
PPTX
Types of sampling design
PDF
PDU 202 Qualitative Research Method: Sampling
PPTX
Sampling design
PPT
Topic 10 sample designs & procedures
PPTX
Sampling - Probability Vs Non-Probability
PDF
online consumer psychology
PPT
Sampling design
PPTX
Sampling design
RESEARCH METHOD - SAMPLING
Research Methods: Sampling
Sampling Methods in Qualitative and Quantitative Research
7 dan 8
5. sampling design
Role of evolutionary psychology in consumer behaviour
Mba2216 week 09 10 sampling
Entrepreneurship and the Entrepreneur
Research method ch01 introduction
No-probilty
Research on sampling
Non probability sampling methods
Types of sampling design
PDU 202 Qualitative Research Method: Sampling
Sampling design
Topic 10 sample designs & procedures
Sampling - Probability Vs Non-Probability
online consumer psychology
Sampling design
Sampling design
Ad

Similar to Research method ch06 sampling (20)

PPT
SAMPLING DESIGNthe process of selecting a subset.ppt
PPTX
Sampling biostatistics.pptx
PPTX
2RM2 PPT.pptx
PPT
How to do sampling?
PPTX
Sampling
PPTX
Sample and sampling techniques
PDF
Methods.pdf
PPTX
Chapter 11 - Understanding Sampling.pptx
PPTX
Chapter 11 - Understanding Sampling.pptx
PPTX
Chapter 11 - Understanding Sampling.pptx
PPTX
Sampling
PPTX
4. Sampling.pptx
PPTX
Chapter 6 Selecting a Sample
PPTX
Maneesh (economics)
PPT
13342046.ppt
PDF
Sampling PPT By RG.pdf
PPTX
chapter8-sampling-IoxO.pptx
PPTX
chapter8-sampling-IoxO.pptx
DOC
Sampling Design
PPT
sampling methods definition and types and difference between them
SAMPLING DESIGNthe process of selecting a subset.ppt
Sampling biostatistics.pptx
2RM2 PPT.pptx
How to do sampling?
Sampling
Sample and sampling techniques
Methods.pdf
Chapter 11 - Understanding Sampling.pptx
Chapter 11 - Understanding Sampling.pptx
Chapter 11 - Understanding Sampling.pptx
Sampling
4. Sampling.pptx
Chapter 6 Selecting a Sample
Maneesh (economics)
13342046.ppt
Sampling PPT By RG.pdf
chapter8-sampling-IoxO.pptx
chapter8-sampling-IoxO.pptx
Sampling Design
sampling methods definition and types and difference between them

More from naranbatn (20)

DOCX
эрүүл мэндийн шинжлэх ухаан
DOCX
Instructions to authors
DOCX
Төгсөлтийн сургалтыг зохицуулах журам
DOCX
хичээлийн хуваарь 2011-2012 1-р улирал
DOC
д.амарсайхан захирал
DOC
ц.лхагвасүрэн захирал
DOC
Self eval report english final for printing, 28.09.2011last for printing
PDF
Бүрдүүлэх материал
DOCX
мэргэжлийн индекс
PPTX
Health sciences university of mongolia
DOCX
Магистрын ганцаарчилсан сургалтын төлөвлөгөө
PDF
Germany summer school-2010
PDF
Uni sannio courses
PDF
Germany summer school-2010
PDF
Germany international semester
PDF
Uni sannio courses_faculty_economics
PDF
Ull ph dproposals
PDF
Ull ph dproposals
PDF
Su pd ph-dproposals
PDF
Sannio ph dproposals_v4
эрүүл мэндийн шинжлэх ухаан
Instructions to authors
Төгсөлтийн сургалтыг зохицуулах журам
хичээлийн хуваарь 2011-2012 1-р улирал
д.амарсайхан захирал
ц.лхагвасүрэн захирал
Self eval report english final for printing, 28.09.2011last for printing
Бүрдүүлэх материал
мэргэжлийн индекс
Health sciences university of mongolia
Магистрын ганцаарчилсан сургалтын төлөвлөгөө
Germany summer school-2010
Uni sannio courses
Germany summer school-2010
Germany international semester
Uni sannio courses_faculty_economics
Ull ph dproposals
Ull ph dproposals
Su pd ph-dproposals
Sannio ph dproposals_v4

Recently uploaded (20)

PPT
Teaching material agriculture food technology
PDF
Chapter 3 Spatial Domain Image Processing.pdf
PDF
cuic standard and advanced reporting.pdf
PPTX
Spectroscopy.pptx food analysis technology
DOCX
The AUB Centre for AI in Media Proposal.docx
PDF
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
PPTX
Cloud computing and distributed systems.
PDF
Agricultural_Statistics_at_a_Glance_2022_0.pdf
PDF
Encapsulation_ Review paper, used for researhc scholars
PPTX
Understanding_Digital_Forensics_Presentation.pptx
PDF
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
PDF
Per capita expenditure prediction using model stacking based on satellite ima...
PDF
Encapsulation theory and applications.pdf
PDF
Electronic commerce courselecture one. Pdf
PDF
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
PDF
The Rise and Fall of 3GPP – Time for a Sabbatical?
PDF
Approach and Philosophy of On baking technology
PDF
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
PDF
Review of recent advances in non-invasive hemoglobin estimation
PDF
Diabetes mellitus diagnosis method based random forest with bat algorithm
Teaching material agriculture food technology
Chapter 3 Spatial Domain Image Processing.pdf
cuic standard and advanced reporting.pdf
Spectroscopy.pptx food analysis technology
The AUB Centre for AI in Media Proposal.docx
Architecting across the Boundaries of two Complex Domains - Healthcare & Tech...
Cloud computing and distributed systems.
Agricultural_Statistics_at_a_Glance_2022_0.pdf
Encapsulation_ Review paper, used for researhc scholars
Understanding_Digital_Forensics_Presentation.pptx
Build a system with the filesystem maintained by OSTree @ COSCUP 2025
Per capita expenditure prediction using model stacking based on satellite ima...
Encapsulation theory and applications.pdf
Electronic commerce courselecture one. Pdf
7 ChatGPT Prompts to Help You Define Your Ideal Customer Profile.pdf
The Rise and Fall of 3GPP – Time for a Sabbatical?
Approach and Philosophy of On baking technology
Peak of Data & AI Encore- AI for Metadata and Smarter Workflows
Review of recent advances in non-invasive hemoglobin estimation
Diabetes mellitus diagnosis method based random forest with bat algorithm

Research method ch06 sampling

  • 1. 1 Research Methods in Health Chapter 6. Sampling Design Young Moon Chae, Ph.D. Graduate School of Public Health Yonsei University, Korea ymchae@yuhs.ac
  • 2. 2 Table of Contents • Introduction • Population and sampling • Probability sampling and non-probability sampling • Types of probability sampling • Determination of sample size
  • 3. 3 Introduction to Sampling Design • Selection of some part of an aggregate or totality on the basis of which a judgment or inference (statistical) about the aggregate or totality is made • Items so selected constitute a sample • The selection process or technique is sample design • Survey conducted on the basis of sample is sample survey • A complete enumeration of all the items in the ‘Population’ is called a census inquiry • Census inquiry requires enormous time, money & energy; slightest bias may get magnified when number of observations are increased • It is possible to obtain sufficiently accurate results by studying only a part (a miniature cross section ) of the population
  • 4. 4 Basic Concepts • UNIVERSE: Totality of items or units in any field of enquiry • POPULATION - Total of items about which information is desired aggregate of elementary units (finite or infinite, N) possess at least one common characteristics -real or hypothetical • ELEMENTARY UNITS - Units possessing the relevant characteristics i.e., attributes that are the object of study (operational definition) • SAMPLING FRAME: - A list of all the units of population - Perfect frames seldom exists; elementary units or cluster of such units or the group form the basis for frame for finite population - Frame is either constructed by the researcher or some existing list of population is used - Sampling frame should be a good representative of the population and as far as possible free from (i) Incompleteness (ii) Inaccurateness (iii)Inadequateness (iv) Out-of-date (complete, no missing element, no ineligibles, no element should appear more than once & up to date) • SAMPLING DESIGN: (Target population →Survey population) - A definite plan for obtaining a sample from the sampling frame - Refers to technique or procedure adopted by the researcher
  • 5. 5 Characteristics of a Good Sampling Design • Truly representative (Representativeness is not the property of the sample but of the procedure by which the sample is obtained) • Having small sampling error • Economically viable • Systematic bias is controlled (in a better way) • Results can be applied to the universe in general with a reasonable level of confidence ⇒reliability • Optimum size (adequately large) • Similar to population / universe • Should have all the characteristics that are present in the population / universe
  • 6. 6 Population and Sampling • Sampling is the process of selection of a number of units from a defined study population. • The process of sampling involves: 1. Identification of study population 2. Determination of sampling population 3. Definition of the sampling unit 4. Choice of sampling method 5. Estimation of the sample size
  • 7. 7 Identification of study population • The study or target population is the one upon which the results of the study will be generalized. • It is crucial that the study population is clearly defined, since it is the most important determinant of the sampling population
  • 8. 8 Determination of sampling population • The sampling population is the one from which the sample is drawn. • The definition of the sampling population by the investigator is governed by two factors: - Feasibility: reachable sampling population - External validity: the ability to generalize from the study results to the target population.
  • 9. 9 Aims and Advantages of Sampling • By examining a sample it saves money. Obviously the assessment of a sample of 100 from a population of 1000 will reduce costs considerably not only because the sample consists of fewer people, but also because sampling saves labour - only a smaller number of staff are needed for both the fieldwork and the processing of data. • Sampling also saves time. An estimate of the characteristics of the population can be gathered in a short time. As a result of reduced costs and the time saved through sampling, more attention can be given to each subject hence, increasing the accuracy of the analysis. • Sampling allows us to deduce characteristics of a population while observing only some of its members (the sample). In order to achieve this aim it is necessary to avoid selection bias. • Sampling allows for more accurate measurements − Inspection fatigue is reduced (non-sampling error) − Sampling error can be studied, controlled & probability statement can be made about magnitude
  • 10. 10 Bias in the Sampling • Bias in the selection of a sample can arise if: •The sampling is done by a non-random method, which generally means the selection is influenced by human choice. • The sampling frame (census, phone book) which serves as the basis for selection does not cover the population adequately, completely or accurately. • Groups of the population are not represented because you can't find them or they refuse to participate.
  • 11. 11 Choice of sampling method • Non probability sampling - In non-probability sampling, there is no assurance that each individual has a known chance of being selected • Probability sampling - Probability sampling means that each individual in the population has a known probability or chance of selection into the sample
  • 12. 12 Non-probability Sampling: • Types of non-probability sampling: - Convenience sampling - Quota sampling üSelection based on some basic parameters like age, sex, income, etc. of population so as to make it representative üField workers are assigned quotas of number of units satisfying the required characteristics for collecting data üLooks similar to stratified sampling, but differs in the fact that the discretion of field worker is not found in ‘Stratified sampling’ which makes random sample from each cell üDifficult to obtain an accurate & up to -date proportion of respondents assigned to each cell • Not recommended in medical research - It is by far the most biases sampling procedure as it is not random (not everyone in the population has an equal chance of being selected to participate in the study).
  • 13. 13 Probability Sampling • Assigns equal probability to each unit of the population i.e., every element has equal (non-zero) chance of being selected (equi-probability) • Random does not mean haphazard • Errors of estimation or significance of results obtained can be measured • Best technique for representative sample • Ensures the law of statistical regularity (i.e.. on an average,sample chosen will have the same composition and structure as the universe / population) • Even each combination (possible ) of sample will have equal probability of being picked • All choices are independent of one another
  • 14. 14 Types of Probability Sampling • Simple random sampling • Systematic random sampling • Stratified random sampling • Multi-stage random sampling • Cluster sampling • Multi-phase sampling
  • 15. 15 Simple Random Sampling • In this method, all subject or elements have an equal probability of being selected. There are two major ways of conducting a random sample. • SRS is the basic selection process & all other procedures are viewed as modifications of SRS • The first is to consult a random number table, and the second is to have the computer select a random sample. e.g., Throwing of a fair dice. Probability of getting one is same for each of 20 sample throws and the 20 throws are all independent
  • 17. 17 Advantage • Decreases external validity of the results • Allows for a smaller sample size • Eliminates the influence of confounding factors (cont.)
  • 18. 18 Systematic Random Sampling • A systematic sample is conducted by randomly selecting a first case on a list of the population and then proceeding every Nth case until your sample is selected. This is particularly useful if your list of the population is long. • For example, if your list was the phone book, it would be easiest to start at perhaps the 17th person, and then select every 50th person from that point on.
  • 19. 19 (cont.) • Advantage of systematic sampling − Easier to use & less costlier for large population − Sample is spread more evenly over the entire population − Elements can be ordered in a manner found in the universe − Can be used even without list of units in the population • Disadvantage − If there is hidden periodicity or intrinsic cyclical relationship between elements of the population, it is dangerous e.g.:10 sample days selected from 70 consecutive days for number of issues in a library
  • 20. 20 Stratified Sampling • In a stratified sample, we sample either proportionately or equally to represent various strata or subpopulations. • In the proportional allocation, the same sampling ratio may be used in all strata. That is, the number of individuals chosen in each stratum I proportional to the size of the stratum • If a uniform sampling fraction was used, the sample is ‘self-weighting’ and can for some purposes be treated as if it were a simple random or systematic sample • For example if our strata were cities in a country we would make sure and sample from each of the cities. If our strata were gender, we would sample both men and women.
  • 21. 21 (cont.) Steps for stratified sampling • Divide population into several sub-population or strata which are more homogeneous than population based on factors or characteristics more closely related to the purpose of study, past experience, personal judgment and consideration of relationship between the characteristics of population and characteristics to be estimated. Conduct a pilot study if necessary and examine the variances within and among strata. • Make simple random sampling from each strata - If stratification is implicit in ordering the list, systematic random sampling will serve the purpose - e.g., Students of school by the standard in which studying • Decide on the size of the sample for each strata based on: (I) Size(ii) Variability & (iii) Sampling cost of strata - A uniform sampling fraction: Sample size is proportional to the size, variability & cost ( of sampling ) of strata - A variable sampling fraction : may be appropriate when there is special interest in some sub groups or strata.( Weighting adjustments may be necessary while analyzing data)
  • 22. 22 22
  • 23. 23 Advantage • There is less sampling variation than with simple random or systematic sampling • If the strata are more uniform than the total population with respect to other attributes, it reduces sampling variation with respect to other properties also • The greater the differences between the strata and the less the differences within the strata. The greater is the gain due to stratification • It is possible to compare the differences in mean between the strata by using ANOVA (cont.)
  • 24. 24 Multi-stage Sampling • This method is used in large-scale surveys. A sample of first—stage sampling units is chosen, each of the selected units is divided into second- stage units, samples of second-stage units are selected, and so on • Different methods (simple random, stratified, systematic or cluster sampling) may be used at any stage • The first-stage units may be province, villages, or other aggregations. This method ha the same advantages as cluster sampling – less travel by interviewers, no need for a sampling frame showing all individuals in the population, etc.
  • 26. 26 Cluster Sampling • In cluster sampling we take a random sample of strata and then survey every member of the group. • That is, the sampling units are clusters, and the sampling frame is a list of these clusters. • For example, if our strata were individuals schools in a city, we would randomly select a number of schools and then test all of the students within those schools.
  • 27. 27 27
  • 28. 28 (cont.) • Advantage - Less traveling by interviewers, fewer school teachers to negotiate with - No need for a sampling frame showing all individuals in the population - Better field supervision • Disadvantage - If the clusters contain similar persons (high intra-class correlation), it is difficult to estimate the precision with which generalizations may be made to the parent population - A large number of small clusters is preferable to a small number of large clusters
  • 29. 29 Determination of Sample Size • Nature of universe / population - Heterogeneous or homogenous - Dispersion factor (or variability) • Number of variables to be studied • Number of groups & sub-groups proposed - Special interest in subgroups • Nature of study (qualitative or quantitative) - Intensive & continuous or general survey • Sampling design or type of sample
  • 30. 30 (cont.) • Intended depth of analysis • Precision and reliability (acceptable confidence level) required • Level of non-response (item & unit) expected • Available finance and other resources (trained investigators) • Size of population • Nature of units in the population • Size of questionnaire NOTE: Size is more important than the proportion of the population represented by sample
  • 31. 31 Sample Size when Estimating a Percentage or Proportion Confidence interval for population proportion, ^p ^P = ^P ±e for infinite population ^P = P ±e √(N-n) / N-1) for finite population where, P = Sample proportion, q = 1-PN = Size of population, n = Size of sample e = Acceptable error = Z = √pq/ n Z = Standard variate= ^P-P / √pq/n (Available from table, for given confidence level) n = Z2 pq / e2 for infinite population n = Z2 pqN / (N-1) e2 + Z2 pq for infinite population
  • 33. 33 References Abramson JH. 1984. Survey methods in community medicine: An introduction to epidemiological and evaluative studies. Churchill Livingston. pp. 70-80. Booth WC, et. al. The craft of research.2 ed. Chicago: The University of Chicago Press, 2003.6 Fleiss JL 1981. Statistical methods for rates and proportions. Wiley. Lavrakas P. 1987. Telephone survey methods: sampling, selection, and supervision. Sage Publications. Lemeshow S. et al. 1990. Adequacy of sample size in health studies. Wiley. Levy PS., Lemeshow, S. 1991. Sampling of populations: methods and applications. Wiley. Moser CA, Kalton G. 1986. Survey methods in social investigation. Gower Publishing Company. Shuster JJ. 1989. CRC handbook of sample size guidelines for clinical trials. CRC Press.