A stratified sample uses some characteristic such as age or gender to divide a population into sub-groups. This may be used to deliberately ensure that the groups are sampled in proportion to the prevalance in the population as a whole. Alternatively, some other criterion may be used to decide the subgroup samples; the statistics for each sub-group can then be weighted if the sub-groups have different proportions to the population proportion.
For example, suppose there are two subgroups with m1 and m2 individuals in the sample and n1 and n2 in the population as a whole. If we have measured means of μ1 and μ2 for the two subgroups our estimate of the overall population mean μ would be:
( μ1×m1/n1 + μ2×m2/n2 ) / (n1 + n2)
( μ1×m1/n1 + μ2×m2/n2 ) / (n1 + n2)
It can be especially important to use stratified sampling when a subgroup has a very low prevalance, for example those with a rare illness, as simple random sample might either not have any members of the minority group at all, or have very small numbers hence making estimates about the subgroup unreliable.
Used in Chap. 14: page 180
Also used in hcistats2e: Chap. 7: page 81
