SlideShare a Scribd company logo
Stat-3203: Sampling Technique-II
(Chapter-1: PPS sampling)
Md. Menhazul Abedin
Lecturer
Statistics Discipline
Khulna University, Khulna-9208
Email: menhaz70@gmail.com
Main book
Outline…
Background study of PPS sampling or Why
PPS sampling (re45ppview of SRS, Stratified,
Systematic etc)
What is PPS sampling
Sampling unit selection procedure
Estimators (ordered & unordered)
Simple Random Sampling (SRS)
Population
Sample
30
16
1. Homogeneous
2. Equal probability
3. Simple in concept
Simple Random Sampling (SRS)
Procedure of selecting a random sample
Lottery method
Use of random number table
Remind basic concept of estimating mean,
total, variance and their properties.
Different schemes of using random number
table.
Stratified sampling
Strata-1
N1
Strata-2
N2
Strata-3
N2
Strata-4
N2
n1
n3n2 n4
N1+N2+
N3+N4=
N
n1+n2+n
3+n4=n
Stratified sampling
• Heterogeneous
• Do SRS in each stratum
• Calculate mean, total, variance or measuring
statistics for each strata and combine them.
• Study the allocation rules
Equal allocation
Proportional allocation
Neyman allocation
Optimum allocation
• Gain in precision
Systematic Sampling
Systematic Sampling
• Sample selection procedure
Linear systemic sampling
Circular systematic
• Estimate total, mean and variance
• Study their properties
• Gain in precision
Unequal size sample unit
Draw a
sample
of four
garden
Unequal size sample unit
• Architecture= 350 student
• CSE = 400
• URP= 700
• ECE= 300
• Mathematics= 250
• Physics= 130
• Chemistry= 80
• Statistics= 50
Select
three
discipline
How???
SRS??
Stratified??
Systematic??
Ans: No
Probability Proportional to Size(PPS)
• How to draw a sample?
Probability proportional to size(PPS)
• Procedures of selecting a sample with
replacement
Cumulative total method
Lahiri’s method
• Procedures of selecting a sample without
replacement
General selection procedure
Sen-midzuno method
Narain’s scheme of sample selection
Cumulative total method (PPSwr)…
• Sampling procedure
S.N. of
holdings
Size
(Xi)
Cumulative
size
Numbers
associated
1 50 50 1-50
2 30 80 51-80
3 45 125 81-125
4 25 150 126-150
5 40 190 151-190
6 26 216 191-216
7 44 260 217-260
8 35 295 261-295
9 28 323 296-323
10 27 350 324-350
1. Random number less
than equal max
cumulative size (350).
2. Let it 272, it lies
between 261-295. 8th
holding is selected.
3. 346, 165 and 044
random number thus
10th , 5th and 1st holding
selected.
4. 8th , 10th , 5th and 1st
unit makes sample
Cumulative total method (PPSwr)…
• Drawback : This procedure involves writing
down the successive cumulative totals. This is
time consuming and tedious if the number of
units in the population is large.
Lahir’s Method (PPSwr)
N = Number of units; M=Maximum size units
= Size of k th unit
1 - N
l1 - M
k
kX
Select a
random
number
Accept k th unit if (k, l < )
Reject k th unit if (k, l > )
kX
kX
Lahir’s Method(1951)
• Referring to the random number table, the pair
is (10, 13). Hence the 10 th unit is selected in
the sample.
• Similarly, choosing other pairs, we can have
(4, 26), (5, 35), (7,26). (4, 26) rejected. Why???
• Another pair (8, 16) .
• Sample is 10, 5, 7 and 8 th unit
Lahir’s Method(1951)
• Advantage:
– It does not require writing down all cumulative
totals for each unit.
– Sizes of all the units need not be known before
hand. We need only some number greater than the
maximum size and the sizes of those units which
are selected by the choice of the first set of random
numbers 1 to N for drawing sample under this
scheme.
Lahir’s Method(1951)
• Disadvantage:
– It results in the wastage of time and efforts if units
get rejected. The probability of rejection 1 −
𝑋
𝑀
.
• The expected numbers of draws required to draw one
unit
𝑀
𝑋
.
• This number is large if 𝑀 is much larger than 𝑋
Journey: Sample to Population
• Total, Mean, Variance
Sample
Sample
mean
Sample
variance
Sample
total
Population
Population
mean
Population
variance
Population
total
• Sample total/mean with unbiased/biased
estimator pop total/mean having population
variance.
• Sample variance unbiased/biased estimator of
pop variance
Journey: Sample to Population
𝐸 𝑡 /𝑚 = 𝑇 /𝑀 with variance 𝑉.
and 𝐸 𝑣 = 𝑉
𝐸 𝑡 /𝑚 = 𝑇 /𝑀 with variance 𝑣
Estimators Sample Population
Total 𝑡 𝑇
Mean 𝑚 𝑀
Variance 𝑣 𝑉
Expectation
Sample-1 Sample-2 Sample-3 Sample-k
𝑠1 𝑠2 𝑠3 𝑠 𝑘
𝐸𝑥𝑝𝑒𝑐𝑡𝑎𝑡𝑖𝑜𝑛 𝐸[𝑠] =
1
𝑘
𝑖=1
𝑘
𝑠𝑖
𝑠𝑖 be any
statistic like
mean
variance,
standard
deviation
… … …
… … …
𝑃𝑜𝑝 𝑛
𝑠𝑖𝑧𝑒 = 𝑁
Sample size=n
Sample
𝑁
𝑛
= 𝑘
IID random variables
𝑥1 𝑥2 𝑥 𝑛𝑥3
𝐷𝑖𝑠𝑡 𝑛 𝐷𝑖𝑠𝑡 𝑛 𝐷𝑖𝑠𝑡 𝑛 𝐷𝑖𝑠𝑡 𝑛
Look like twin but not.
They comes from different
mother.
IID
random
variables
… … …
Defining random variale…
• 𝑦𝑖 = Value of the characreristics under study
(𝑦𝑖 ?? ambiguity?? Next slide)
• 𝑁 = Population size
• 𝑝𝑖 = 𝑋𝑖/𝑋
• 𝑧𝑖 =
𝑦 𝑖
𝑝 𝑖
; 𝑖 = 1, 2, 3, … , 𝑛 IID
random variable....... Why ????
• 𝑝𝑖 =
1
𝑁
Simple Random Sampling
Example 5.3
• Selected sample (cummulative/Lahiri’s method)
Area under
Crop
5.2 5.9 3.9 4.2 4.7 4.8 4.9 6.8 4.7 5.7
Yield of crop 28 29 30 22 24 25 28 37 26 32
Area under
Crop
5.2 5.2 4.9 4.0 1.3 7.4 7.4 4.8 6.2 6.2
Yield of crop 25 38 31 16 6 61 61 29 47 47
Size (X)
Value of the
characteristic under
study (Y)
(N=100, n=20)
Estimators…
Theorem 5.3.1: In pps sampling, wr, an unbiased
estimator of the population total Y is given by
𝑌𝑝𝑝𝑠 =
1
𝑛
1
𝑛
(𝑦𝑖/𝑝𝑖)
With its sampling variance
𝑉( 𝑌𝑝𝑝𝑠) =
1
𝑛 1
𝑁
𝑝𝑖 (
𝑦 𝑖
𝑝 𝑖
− 𝑌)2
*** find unbiased estimator of mean....
***See corollary
Estimators…
• Theorem: In pps sampling, wr, an unbiased
estimator of 𝑉( 𝑌𝑝𝑝𝑠)is given by
• 𝑣( 𝑌𝑝𝑝𝑠) =
1
𝑛(𝑛−1) 1
𝑛
(
𝑦 𝑖
𝑝 𝑖
− 𝑌𝑝𝑝𝑠)2
=
1
𝑛(𝑛−1)
[ 1
𝑛
(
𝑦 𝑖
𝑝 𝑖
)2
−𝑛 𝑌𝑝𝑝𝑠
2
]
Gain due to pps sampling...
• Study gain due to PPS sampling with
replacement
Example 5.3
• 𝑦𝑝𝑝𝑠 =
1
𝑛𝑁 1
𝑛
(𝑦𝑖/𝑝𝑖) =
𝑋
𝑛𝑁 1
𝑛
(𝑦𝑖/𝑥𝑖) =
484.5
20 ∗ 100
∗ 120.5930 = 29.11
• 𝑣( 𝑦𝑝𝑝𝑠) =
1
𝑛 𝑛−1 𝑁2 [ 1
𝑛 𝑦 𝑖
𝑝 𝑖
2
− 𝑛 𝑌𝑝𝑝𝑠
2
] =
1
20∗19∗100∗100
171249828.1 − 20 ∗ 155785427.3
= 4.06957916 ≅ 4
• Stadard error= 𝑣( 𝑦𝑝𝑝𝑠 = 4 = 2
PPS Sampling Without
Replacement
PPS Sampling WoR
• It is difficult to draw a PPS sample without
replacement. Over 50 methods have been
proposed but none is perfect.
Techniques…
• General selection procedure
• Sen-Midzuno sample selection
• Narain’s scheme of sampe selection
• Systematic PPS method (Madow (1949)
• Durbin (1967) method
Our interest
PPS sampling WoR
• General selection procedure 𝑝𝑖 = 𝑋𝑖/𝑋
Select a pair of random numbers 𝑖, 𝑗 𝑠. 𝑡. ( 𝑖 ≤
Orchard 1 2 3 4 5 6 7 8
Trees 50 30 25 40 26 44 20 35
Orchard 1 2 3 4 Blank 5 6 7
Trees 50 30 25 40 44 20 35
PPS sampling without replaceent…
• The first order incluson probability for unit 𝑖 is
the probability that 𝑖 is included in a sample of
size n and is given by
𝜋𝑖 = 𝑠∋𝑖 𝑝(𝑠) .
• The second order inclusion probability for unit 𝑖
and 𝑗 is defined as the probability that the two
units 𝑖 and 𝑗 are included in a sample of size n
𝜋𝑖𝑗 = 𝑠∋𝑖,𝑗 𝑝(𝑠) .
Example
• A={1,2,3}
• 𝑠1 ={1,2}, 𝑠2 ={1,3}, 𝑠3 ={2,3}
• 𝑝(𝑠1) =
1
3
𝑝(𝑠2) =
1
3
𝑝(𝑠3) =
1
3
• 𝜋1 =
1
3
+
1
3
=
2
3
𝜋2 =
1
3
+
1
3
=
2
3
• 𝜋3 =
1
3
+
1
3
=
2
3
• 𝜋1 + 𝜋2 + 𝜋3 =
2
3
+
2
3
+
2
3
= 2
Property: Inclusion probability
• 𝑖=1
𝑁
𝜋𝑖 = 𝑛
• 𝑗=1
𝑁
𝜋𝑖𝑗 = (𝑛 − 1) 𝜋𝑖
• 𝑖=1
𝑁
𝑗=1 𝑖≠𝑗
𝑁
𝜋𝑖𝑗 = 𝑛 − 1 𝑛
Sen-Midzuno…
• First unit from N sized population
Without replacement
• (n-1)unit from remaining (N-1)
Simple random sampling
𝜋𝑖 = 𝑝𝑖 + 1 − 𝑝𝑖
𝑛−1
𝑁−1
=
𝑁−𝑛
𝑁−1
𝑝𝑖 +
𝑛−1
𝑁−1
,
1 ≤ 𝑖 ≤ 𝑁
𝜋𝑖𝑗 = 𝑝𝑖
𝑛−1
𝑁−1
+ 𝑝𝑗
𝑛−1
𝑁−1
+ (1 − 𝑝𝑖 − 𝑝𝑖)
𝑛−1
𝑁−1
𝑛−2
𝑁−2
1 ≤ 𝑖 ≠ 𝑗 ≤ 𝑁
Sen-Midzuno…
• Higher order inclusion probabilities
• 𝜋𝑖𝑗…𝑞 =
1
𝑁−1
𝑛−1
(𝑝𝑖 + 𝑝𝑗 + ⋯ + 𝑝 𝑞)
Ordered and unordered estimator
• Ordred estimator: Incorporate sampling unit’s
order. Need only conditional probability not
inclusion probability.
• Unordered estimator: Free from order concept
of sampling unit’s ordes. Incorporate inclusion
probability.
Ordered and unordered estimator
• Das-Raj’s ordered estimator→ No need
inclusion probability
• Horvitz-Thompson estimator
(H-T estimator)
• Murthy’s estimator
Unordered
estimator need
inclusion
probability
Das-Raj ordered estimator(n=2)
• 𝑦1 → 𝑝1 and 𝑦2 → 𝑝2
[Initial probabilities]
• Define two random variable
– 𝑧1 =
𝑦1
𝑝1
– 𝑧2 = 𝑦1 + 𝑦2(1 − 𝑝1)/𝑝2
• Des-Raj’s total
– 𝑌𝐷 =(𝑧1 + 𝑧2)/2 =
𝑦1 1+𝑝1
𝑝1
+
𝑦2 1−𝑝1
𝑝2
Des-Raj ordered estimator(n=2)
• Theorem 5.8.1 In PPS sampling, wor, the estimator 𝑌𝐷
isan unbiased estimator andits sampling variance is
given by
𝑉 𝑌𝐷
= 1 −
1
2
𝑖
𝑁
𝑝𝑖
2
1
2
𝑖
𝑁
𝑦𝑖
𝑝𝑖
− 𝑌
2
𝑝𝑖
−
1
4
𝑖
𝑁
𝑦𝑖
𝑝𝑖
− 𝑌
2
𝑝𝑖
2
Also find the unbiased estimator of variance.
Unordered Estimator (H-T Estimator)
• Inclusion probability calculation
• Define unbiased estimator of total
• Its variance
Theorem 5.9.1
Any ambiguity
Thanks

More Related Content

PDF
Contingency table
PPT
analysis plan.ppt
PPTX
probability proportional to size.pptx.By Rc
PPTX
Sample determinants and size
PDF
Basic Biostatistics and Data managment
PDF
Sampling methods and sample size
PDF
Statistical Estimation and Testing Lecture Notes.pdf
PPT
Chi-square, Yates, Fisher & McNemar
Contingency table
analysis plan.ppt
probability proportional to size.pptx.By Rc
Sample determinants and size
Basic Biostatistics and Data managment
Sampling methods and sample size
Statistical Estimation and Testing Lecture Notes.pdf
Chi-square, Yates, Fisher & McNemar

What's hot (20)

PPTX
Stat 3203 -cluster and multi-stage sampling
PPTX
Stat 3203 -multphase sampling
PDF
Normality tests
PPT
SURVIVAL ANALYSIS.ppt
PPTX
sampling simple random sampling
PPT
Measure of Dispersion
PPT
Regression analysis ppt
PPTX
Systematic ranom sampling for slide share
PDF
Introduction to Generalized Linear Models
PPTX
Point and Interval Estimation
PPTX
Multinomial Logistic Regression Analysis
PPTX
Basics of Educational Statistics (Inferential statistics)
PPTX
Statistical inference
PPTX
Introduction to Statistics - Basic concepts
PPTX
Testing of hypotheses
PPT
T test statistics
PPTX
Lesson 2 stationary_time_series
PDF
Kaplan meier survival curves and the log-rank test
PPTX
Lecture 6. univariate and bivariate analysis
PDF
Categorical data analysis
Stat 3203 -cluster and multi-stage sampling
Stat 3203 -multphase sampling
Normality tests
SURVIVAL ANALYSIS.ppt
sampling simple random sampling
Measure of Dispersion
Regression analysis ppt
Systematic ranom sampling for slide share
Introduction to Generalized Linear Models
Point and Interval Estimation
Multinomial Logistic Regression Analysis
Basics of Educational Statistics (Inferential statistics)
Statistical inference
Introduction to Statistics - Basic concepts
Testing of hypotheses
T test statistics
Lesson 2 stationary_time_series
Kaplan meier survival curves and the log-rank test
Lecture 6. univariate and bivariate analysis
Categorical data analysis
Ad

Similar to Stat 3203 -pps sampling (20)

PDF
Sarjinder singh
PPTX
Sampling techniques NEEDED FOR STUDY DESIGNS
PPTX
Sampling Technique - Anish
PPTX
sampling technique
PPTX
Sampling and Sampling Distributions
PPTX
Sampling distribution concepts
DOCX
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docx
PPT
SAMPLING methods d p singh .ppt
PDF
Inferential Statistics
PPTX
Sampling_Distribution_stat_of_Mean_New.pptx
PDF
Sampling Theory Part 1
PPTX
Chapter three and - Economics-introduction
PPTX
Basic of Statistical Inference Part-I
PPTX
Lesson-3-Sources-of-Data-and-Sampling-Procedures-1.pptx
PDF
Chapter One-converted business research .pdf
PPTX
Sampling and Central Limit Theorem_18_01_23 new.pptx
PPTX
5 Introduction to elementary sampling theory.pptx
PPTX
Sampling, Census
PDF
Cluster Sampling
PDF
Sampling.pdf research methodology in sampling
Sarjinder singh
Sampling techniques NEEDED FOR STUDY DESIGNS
Sampling Technique - Anish
sampling technique
Sampling and Sampling Distributions
Sampling distribution concepts
HW1_STAT206.pdfStatistical Inference II J. Lee Assignment.docx
SAMPLING methods d p singh .ppt
Inferential Statistics
Sampling_Distribution_stat_of_Mean_New.pptx
Sampling Theory Part 1
Chapter three and - Economics-introduction
Basic of Statistical Inference Part-I
Lesson-3-Sources-of-Data-and-Sampling-Procedures-1.pptx
Chapter One-converted business research .pdf
Sampling and Central Limit Theorem_18_01_23 new.pptx
5 Introduction to elementary sampling theory.pptx
Sampling, Census
Cluster Sampling
Sampling.pdf research methodology in sampling
Ad

More from Khulna University (9)

PPTX
Stat 2153 Introduction to Queiueng Theory
PPTX
Stat 2153 Stochastic Process and Markov chain
PPTX
Stat 3203 -sampling errors and non-sampling errors
PPTX
Ds 2251 -_hypothesis test
PPTX
Stat 1163 -statistics in environmental science
PPTX
Stat 1163 -correlation and regression
PPTX
Introduction to matlab
PPTX
Different kind of distance and Statistical Distance
PPTX
Regression and Classification: An Artificial Neural Network Approach
Stat 2153 Introduction to Queiueng Theory
Stat 2153 Stochastic Process and Markov chain
Stat 3203 -sampling errors and non-sampling errors
Ds 2251 -_hypothesis test
Stat 1163 -statistics in environmental science
Stat 1163 -correlation and regression
Introduction to matlab
Different kind of distance and Statistical Distance
Regression and Classification: An Artificial Neural Network Approach

Recently uploaded (20)

PPTX
Pharma ospi slides which help in ospi learning
PDF
Business Ethics Teaching Materials for college
PPTX
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
PDF
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
PPTX
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
PDF
RMMM.pdf make it easy to upload and study
PDF
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
PDF
Basic Mud Logging Guide for educational purpose
PDF
STATICS OF THE RIGID BODIES Hibbelers.pdf
PPTX
Cell Types and Its function , kingdom of life
PDF
O7-L3 Supply Chain Operations - ICLT Program
PDF
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
PDF
2.FourierTransform-ShortQuestionswithAnswers.pdf
PDF
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
PPTX
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
PDF
Insiders guide to clinical Medicine.pdf
PPTX
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
PDF
Anesthesia in Laparoscopic Surgery in India
PDF
Module 4: Burden of Disease Tutorial Slides S2 2025
PDF
Pre independence Education in Inndia.pdf
Pharma ospi slides which help in ospi learning
Business Ethics Teaching Materials for college
BOWEL ELIMINATION FACTORS AFFECTING AND TYPES
grade 11-chemistry_fetena_net_5883.pdf teacher guide for all student
school management -TNTEU- B.Ed., Semester II Unit 1.pptx
RMMM.pdf make it easy to upload and study
Mark Klimek Lecture Notes_240423 revision books _173037.pdf
Basic Mud Logging Guide for educational purpose
STATICS OF THE RIGID BODIES Hibbelers.pdf
Cell Types and Its function , kingdom of life
O7-L3 Supply Chain Operations - ICLT Program
3rd Neelam Sanjeevareddy Memorial Lecture.pdf
2.FourierTransform-ShortQuestionswithAnswers.pdf
BÀI TẬP BỔ TRỢ 4 KỸ NĂNG TIẾNG ANH 9 GLOBAL SUCCESS - CẢ NĂM - BÁM SÁT FORM Đ...
IMMUNITY IMMUNITY refers to protection against infection, and the immune syst...
Insiders guide to clinical Medicine.pdf
Introduction to Child Health Nursing – Unit I | Child Health Nursing I | B.Sc...
Anesthesia in Laparoscopic Surgery in India
Module 4: Burden of Disease Tutorial Slides S2 2025
Pre independence Education in Inndia.pdf

Stat 3203 -pps sampling

  • 1. Stat-3203: Sampling Technique-II (Chapter-1: PPS sampling) Md. Menhazul Abedin Lecturer Statistics Discipline Khulna University, Khulna-9208 Email: menhaz70@gmail.com
  • 3. Outline… Background study of PPS sampling or Why PPS sampling (re45ppview of SRS, Stratified, Systematic etc) What is PPS sampling Sampling unit selection procedure Estimators (ordered & unordered)
  • 4. Simple Random Sampling (SRS) Population Sample 30 16 1. Homogeneous 2. Equal probability 3. Simple in concept
  • 5. Simple Random Sampling (SRS) Procedure of selecting a random sample Lottery method Use of random number table Remind basic concept of estimating mean, total, variance and their properties. Different schemes of using random number table.
  • 7. Stratified sampling • Heterogeneous • Do SRS in each stratum • Calculate mean, total, variance or measuring statistics for each strata and combine them. • Study the allocation rules Equal allocation Proportional allocation Neyman allocation Optimum allocation • Gain in precision
  • 9. Systematic Sampling • Sample selection procedure Linear systemic sampling Circular systematic • Estimate total, mean and variance • Study their properties • Gain in precision
  • 10. Unequal size sample unit Draw a sample of four garden
  • 11. Unequal size sample unit • Architecture= 350 student • CSE = 400 • URP= 700 • ECE= 300 • Mathematics= 250 • Physics= 130 • Chemistry= 80 • Statistics= 50 Select three discipline How??? SRS?? Stratified?? Systematic?? Ans: No
  • 12. Probability Proportional to Size(PPS) • How to draw a sample?
  • 13. Probability proportional to size(PPS) • Procedures of selecting a sample with replacement Cumulative total method Lahiri’s method • Procedures of selecting a sample without replacement General selection procedure Sen-midzuno method Narain’s scheme of sample selection
  • 14. Cumulative total method (PPSwr)… • Sampling procedure S.N. of holdings Size (Xi) Cumulative size Numbers associated 1 50 50 1-50 2 30 80 51-80 3 45 125 81-125 4 25 150 126-150 5 40 190 151-190 6 26 216 191-216 7 44 260 217-260 8 35 295 261-295 9 28 323 296-323 10 27 350 324-350 1. Random number less than equal max cumulative size (350). 2. Let it 272, it lies between 261-295. 8th holding is selected. 3. 346, 165 and 044 random number thus 10th , 5th and 1st holding selected. 4. 8th , 10th , 5th and 1st unit makes sample
  • 15. Cumulative total method (PPSwr)… • Drawback : This procedure involves writing down the successive cumulative totals. This is time consuming and tedious if the number of units in the population is large.
  • 16. Lahir’s Method (PPSwr) N = Number of units; M=Maximum size units = Size of k th unit 1 - N l1 - M k kX Select a random number Accept k th unit if (k, l < ) Reject k th unit if (k, l > ) kX kX
  • 17. Lahir’s Method(1951) • Referring to the random number table, the pair is (10, 13). Hence the 10 th unit is selected in the sample. • Similarly, choosing other pairs, we can have (4, 26), (5, 35), (7,26). (4, 26) rejected. Why??? • Another pair (8, 16) . • Sample is 10, 5, 7 and 8 th unit
  • 18. Lahir’s Method(1951) • Advantage: – It does not require writing down all cumulative totals for each unit. – Sizes of all the units need not be known before hand. We need only some number greater than the maximum size and the sizes of those units which are selected by the choice of the first set of random numbers 1 to N for drawing sample under this scheme.
  • 19. Lahir’s Method(1951) • Disadvantage: – It results in the wastage of time and efforts if units get rejected. The probability of rejection 1 − 𝑋 𝑀 . • The expected numbers of draws required to draw one unit 𝑀 𝑋 . • This number is large if 𝑀 is much larger than 𝑋
  • 20. Journey: Sample to Population • Total, Mean, Variance Sample Sample mean Sample variance Sample total Population Population mean Population variance Population total • Sample total/mean with unbiased/biased estimator pop total/mean having population variance. • Sample variance unbiased/biased estimator of pop variance
  • 21. Journey: Sample to Population 𝐸 𝑡 /𝑚 = 𝑇 /𝑀 with variance 𝑉. and 𝐸 𝑣 = 𝑉 𝐸 𝑡 /𝑚 = 𝑇 /𝑀 with variance 𝑣 Estimators Sample Population Total 𝑡 𝑇 Mean 𝑚 𝑀 Variance 𝑣 𝑉
  • 22. Expectation Sample-1 Sample-2 Sample-3 Sample-k 𝑠1 𝑠2 𝑠3 𝑠 𝑘 𝐸𝑥𝑝𝑒𝑐𝑡𝑎𝑡𝑖𝑜𝑛 𝐸[𝑠] = 1 𝑘 𝑖=1 𝑘 𝑠𝑖 𝑠𝑖 be any statistic like mean variance, standard deviation … … … … … … 𝑃𝑜𝑝 𝑛 𝑠𝑖𝑧𝑒 = 𝑁 Sample size=n Sample 𝑁 𝑛 = 𝑘
  • 23. IID random variables 𝑥1 𝑥2 𝑥 𝑛𝑥3 𝐷𝑖𝑠𝑡 𝑛 𝐷𝑖𝑠𝑡 𝑛 𝐷𝑖𝑠𝑡 𝑛 𝐷𝑖𝑠𝑡 𝑛 Look like twin but not. They comes from different mother. IID random variables … … …
  • 24. Defining random variale… • 𝑦𝑖 = Value of the characreristics under study (𝑦𝑖 ?? ambiguity?? Next slide) • 𝑁 = Population size • 𝑝𝑖 = 𝑋𝑖/𝑋 • 𝑧𝑖 = 𝑦 𝑖 𝑝 𝑖 ; 𝑖 = 1, 2, 3, … , 𝑛 IID random variable....... Why ???? • 𝑝𝑖 = 1 𝑁 Simple Random Sampling
  • 25. Example 5.3 • Selected sample (cummulative/Lahiri’s method) Area under Crop 5.2 5.9 3.9 4.2 4.7 4.8 4.9 6.8 4.7 5.7 Yield of crop 28 29 30 22 24 25 28 37 26 32 Area under Crop 5.2 5.2 4.9 4.0 1.3 7.4 7.4 4.8 6.2 6.2 Yield of crop 25 38 31 16 6 61 61 29 47 47 Size (X) Value of the characteristic under study (Y) (N=100, n=20)
  • 26. Estimators… Theorem 5.3.1: In pps sampling, wr, an unbiased estimator of the population total Y is given by 𝑌𝑝𝑝𝑠 = 1 𝑛 1 𝑛 (𝑦𝑖/𝑝𝑖) With its sampling variance 𝑉( 𝑌𝑝𝑝𝑠) = 1 𝑛 1 𝑁 𝑝𝑖 ( 𝑦 𝑖 𝑝 𝑖 − 𝑌)2 *** find unbiased estimator of mean.... ***See corollary
  • 27. Estimators… • Theorem: In pps sampling, wr, an unbiased estimator of 𝑉( 𝑌𝑝𝑝𝑠)is given by • 𝑣( 𝑌𝑝𝑝𝑠) = 1 𝑛(𝑛−1) 1 𝑛 ( 𝑦 𝑖 𝑝 𝑖 − 𝑌𝑝𝑝𝑠)2 = 1 𝑛(𝑛−1) [ 1 𝑛 ( 𝑦 𝑖 𝑝 𝑖 )2 −𝑛 𝑌𝑝𝑝𝑠 2 ]
  • 28. Gain due to pps sampling... • Study gain due to PPS sampling with replacement
  • 29. Example 5.3 • 𝑦𝑝𝑝𝑠 = 1 𝑛𝑁 1 𝑛 (𝑦𝑖/𝑝𝑖) = 𝑋 𝑛𝑁 1 𝑛 (𝑦𝑖/𝑥𝑖) = 484.5 20 ∗ 100 ∗ 120.5930 = 29.11 • 𝑣( 𝑦𝑝𝑝𝑠) = 1 𝑛 𝑛−1 𝑁2 [ 1 𝑛 𝑦 𝑖 𝑝 𝑖 2 − 𝑛 𝑌𝑝𝑝𝑠 2 ] = 1 20∗19∗100∗100 171249828.1 − 20 ∗ 155785427.3 = 4.06957916 ≅ 4 • Stadard error= 𝑣( 𝑦𝑝𝑝𝑠 = 4 = 2
  • 31. PPS Sampling WoR • It is difficult to draw a PPS sample without replacement. Over 50 methods have been proposed but none is perfect.
  • 32. Techniques… • General selection procedure • Sen-Midzuno sample selection • Narain’s scheme of sampe selection • Systematic PPS method (Madow (1949) • Durbin (1967) method Our interest
  • 33. PPS sampling WoR • General selection procedure 𝑝𝑖 = 𝑋𝑖/𝑋 Select a pair of random numbers 𝑖, 𝑗 𝑠. 𝑡. ( 𝑖 ≤ Orchard 1 2 3 4 5 6 7 8 Trees 50 30 25 40 26 44 20 35 Orchard 1 2 3 4 Blank 5 6 7 Trees 50 30 25 40 44 20 35
  • 34. PPS sampling without replaceent… • The first order incluson probability for unit 𝑖 is the probability that 𝑖 is included in a sample of size n and is given by 𝜋𝑖 = 𝑠∋𝑖 𝑝(𝑠) . • The second order inclusion probability for unit 𝑖 and 𝑗 is defined as the probability that the two units 𝑖 and 𝑗 are included in a sample of size n 𝜋𝑖𝑗 = 𝑠∋𝑖,𝑗 𝑝(𝑠) .
  • 35. Example • A={1,2,3} • 𝑠1 ={1,2}, 𝑠2 ={1,3}, 𝑠3 ={2,3} • 𝑝(𝑠1) = 1 3 𝑝(𝑠2) = 1 3 𝑝(𝑠3) = 1 3 • 𝜋1 = 1 3 + 1 3 = 2 3 𝜋2 = 1 3 + 1 3 = 2 3 • 𝜋3 = 1 3 + 1 3 = 2 3 • 𝜋1 + 𝜋2 + 𝜋3 = 2 3 + 2 3 + 2 3 = 2
  • 36. Property: Inclusion probability • 𝑖=1 𝑁 𝜋𝑖 = 𝑛 • 𝑗=1 𝑁 𝜋𝑖𝑗 = (𝑛 − 1) 𝜋𝑖 • 𝑖=1 𝑁 𝑗=1 𝑖≠𝑗 𝑁 𝜋𝑖𝑗 = 𝑛 − 1 𝑛
  • 37. Sen-Midzuno… • First unit from N sized population Without replacement • (n-1)unit from remaining (N-1) Simple random sampling 𝜋𝑖 = 𝑝𝑖 + 1 − 𝑝𝑖 𝑛−1 𝑁−1 = 𝑁−𝑛 𝑁−1 𝑝𝑖 + 𝑛−1 𝑁−1 , 1 ≤ 𝑖 ≤ 𝑁 𝜋𝑖𝑗 = 𝑝𝑖 𝑛−1 𝑁−1 + 𝑝𝑗 𝑛−1 𝑁−1 + (1 − 𝑝𝑖 − 𝑝𝑖) 𝑛−1 𝑁−1 𝑛−2 𝑁−2 1 ≤ 𝑖 ≠ 𝑗 ≤ 𝑁
  • 38. Sen-Midzuno… • Higher order inclusion probabilities • 𝜋𝑖𝑗…𝑞 = 1 𝑁−1 𝑛−1 (𝑝𝑖 + 𝑝𝑗 + ⋯ + 𝑝 𝑞)
  • 39. Ordered and unordered estimator • Ordred estimator: Incorporate sampling unit’s order. Need only conditional probability not inclusion probability. • Unordered estimator: Free from order concept of sampling unit’s ordes. Incorporate inclusion probability.
  • 40. Ordered and unordered estimator • Das-Raj’s ordered estimator→ No need inclusion probability • Horvitz-Thompson estimator (H-T estimator) • Murthy’s estimator Unordered estimator need inclusion probability
  • 41. Das-Raj ordered estimator(n=2) • 𝑦1 → 𝑝1 and 𝑦2 → 𝑝2 [Initial probabilities] • Define two random variable – 𝑧1 = 𝑦1 𝑝1 – 𝑧2 = 𝑦1 + 𝑦2(1 − 𝑝1)/𝑝2 • Des-Raj’s total – 𝑌𝐷 =(𝑧1 + 𝑧2)/2 = 𝑦1 1+𝑝1 𝑝1 + 𝑦2 1−𝑝1 𝑝2
  • 42. Des-Raj ordered estimator(n=2) • Theorem 5.8.1 In PPS sampling, wor, the estimator 𝑌𝐷 isan unbiased estimator andits sampling variance is given by 𝑉 𝑌𝐷 = 1 − 1 2 𝑖 𝑁 𝑝𝑖 2 1 2 𝑖 𝑁 𝑦𝑖 𝑝𝑖 − 𝑌 2 𝑝𝑖 − 1 4 𝑖 𝑁 𝑦𝑖 𝑝𝑖 − 𝑌 2 𝑝𝑖 2 Also find the unbiased estimator of variance.
  • 43. Unordered Estimator (H-T Estimator) • Inclusion probability calculation • Define unbiased estimator of total • Its variance Theorem 5.9.1