Create a quadratic form's matrix to represent a variance estimator for PPSWOR designs, based on commonly-used approximations
Create a quadratic form's matrix to represent a variance estimator for PPSWOR designs, based on commonly-used approximations
Several variance estimators for designs that use unequal probability sampling without replacement (i.e., PPSWOR), variance estimation tends to be more accurate when using an approximation estimator that uses the first-order inclusion probabilities (i.e., the basic sampling weights) and ignores the joint inclusion probabilities. This function returns the matrix of the quadratic form used to represent such variance estimators.
probs: A vector of first-order inclusion probabilities
method: A string specifying the approximation method to use. See the "Details" section below. Options include:
"Deville-1"
"Deville-2"
"Beaumont-Emond"
Returns
A symmetric matrix whose dimension matches the length of probs.
Deville's Estimators
The "Deville-1" and "Deville-2" approximations have been shown to be effective for designs that use a fixed sample size with a high-entropy sampling method. This includes most PPSWOR sampling methods, but unequal-probability systematic sampling is an important exception.
Deville's variance estimators generally take the following form:
where y˘i=yi/πi is the weighted value of the the variable of interest, and ci are constants that depend on the approximation method used.
The matrix of the quadratic form, denoted Σ, has its ij-th entry defined as follows:
σii=ci(1−∑k=1nckci) when i=jσij=∑k=1nck−cicj when i=j
When πi=1 for every unit, then σij=0 for all i,j. If there is only one sampling unit, then σ11=0; that is, the unit is treated as if it was sampled with certainty.
The constants ci are defined for each approximation method as follows, with the names taken directly from Matei and Tillé (2005).
"Deville-1" :
ci=(1−πi)n−1n
"Deville-2" :
ci=(1−πi)[1−k=1∑n(∑k=1n(1−πk)1−πk)2]−1
Both of the approximations "Deville-1" and "Deville-2" were shown in the simulation studies of Matei and Tillé (2005) to perform much better in terms of MSE compared to the strictly-unbiased Horvitz-Thompson and Yates-Grundy variance estimators. In the case of simple random sampling without replacement (SRSWOR), these estimators are identical to the usual Horvitz-Thompson variance estimator.
Beaumont-Emond Estimator
Beaumont and Emond (2022) proposed a variance estimator for unequal probability sampling without replacement. This estimator is simply the Horvitz-Thompson variance estimator with the following approximation for the joint inclusion probabilities.
πkl≈πkπl(n−1)+(1−πk)(1−πl)n−1
In the case of cluster sampling, this approximation should be applied to the clusters rather than the units within clusters.
References
Matei, Alina, and Yves Tillé. 2005. “Evaluation of Variance Approximations and Estimators in Maximum Entropy Sampling with Unequal Probability and Fixed Sample Size.” Journal of Official Statistics 21(4):543–70.