On reuse of clusters in repeated studies


PDF document icon wp-05-87.pdf — PDF document, 540 kB (553,907 bytes)

Author(s): Kolenikov S, Angeles G

Year: 2005

Abstract:
Suppose data for a survey with multistage design is to be collected in two periods of time. This paper assesses the relative merits of keeping the same clusters in the sample vs. sampling new clusters, under differential statistical (correlation between clusters and overtime) and logistical (costs of survey) scenarios. The design effect of reusing the same clusters from the master sample over time is of the form 1 − Aρπ/n where ρ is intertemporal correlation of the cluster totals, n is the number of clusters, π is the proportion of clusters retained from the previous round, and A > 0 is a fixed constant. As long as the efficiency gains appear to be minor, the value of the designs that reuse the clusters comes from the logistical (cost of the survey) considerations. Empirical demonstrations that use Demographic and Health Survey (DHS) data for Bangladesh, 1996 and 2000, and a Monte Carlo simulation, are provided.