Sometimes a strict simple random sample is difficult to obtain; therefore, we need to find other ways of randomly selecting units for a sample.
This type of sampling is used to select a sample from a very large population where certain groups and subgroups are available.
Suppose we wish to obtain a sample of people in the United States.
At each stage, we obtained a sampling frame (list of states; list of counties in the selected states; list of neighborhoods in the selected counties; list of people in the selected neighborhoods) and selected a simple random sample from that sampling frame. This method (multistage sampling) is much easier that selecting our sample from a list of all Americans (if such a list could be found).
This method is performed by randomly selecting a starting point in the sampling frame and then selecting every nth unit to be in the sample until the desired sample size has been reached. Usually every 5th or every 10th unit is selected.
This method divides the sampling frame into groups that are of interest.
Note: The major way that stratified samples differ from simple random samples is that stratified samples need not give all units in the population the same chance of being chosen.
A university has 30,000 students of whom 3,000 (10%) are black. Using an SRS of 500 students, we would expect 50 (10%) of the students in the sample to be black. This size has low precision (the opinions of the black students will be underrepresented), so we must increase the number of black students in the sample to increase this precision. So instead of the SRS, we'll randomly select 200 black students and 300 other students by dividing the registrar's list of students into two lists (one of black students and one of other students), and then we'll take an SRS of 200 from the list of black students and an SRS of 300 from the list of other students.


It is true that the blacks seem to be over-represented in our sample, but since the probability of selection for each group is known, we can correct for the over-representation when we analyze our data; when we know the probabilities associated with each stratum, we have a probability sample.
Suppose we sample 200 black students and 300 other students by the method of the preceding example and ask each of the students, "Do you favor the creation of a new degree program in African studies?" In the sample, 162 of the black students and 174 of the other students say that they are interested in the new African studies degree program. We then calculate, for each stratum, the statistic that will estimate the proportion of the students that are interested in the new program.
We estimate
or
or

In all, we estimate that 2,430 + 15,660 = 18,090 of the 30,000 students at the
university are in favor of the new degree program. That is, of all students at the university,
or
As you can see, stratified samples (and likewise, probability samples) give us more information than we can get from just a simple random sample.
(Scott Street's section only)
Please direct all questions regarding STAT-110 to your instructor or to the director of STAT-110, Dr. Todd Ogden at ogden@stat.sc.edu.
Mail comments regarding this presentation to W. Scott Street, IV at street@stat.sc.edu.
© 1996 by W. Scott Street, IV