Sign in to view assessments and invite other educators
Sign in using your existing Kendall Hunt account. If you don’t have one, create an educator account.
To see what might be happening when we regroup data, consider an experiment that takes 12 subjects and divides them into 2 groups at random. The control group contains 6 subjects, and the treatment group contains 6 subjects. To explore what's possible, assume the control group results in these data: 1, 3, 4, 6, 8, and 10. The treatment group results in these data: 2, 5, 7, 9, 11, and 12.
With a smaller data set like this, we can actually consider all of the different arrangements of the data. There are 924 distinct ways to separate the 12 values into 2 groups of 6. The frequency table shows all non-negative differences in means and how often they occur. Due to symmetry, the negative differences should match. Notice that a difference in means of 4.33 occurs 7 times, so a difference of -4.33 also occurs 7 times. The dot plot shows the same information.
| difference in means | 6 | 5.67 | 5.33 | 5 | 4.67 | 4.33 | 4 |
|---|---|---|---|---|---|---|---|
| frequency | 1 | 1 | 2 | 3 | 5 | 7 | 11 |
| difference in means | 3.67 | 3.33 | 3 | 2.67 | 2.33 | 2 |
|---|---|---|---|---|---|---|
| frequency | 13 | 18 | 22 | 28 | 32 | 39 |
| difference in means | 1.67 | 1.33 | 1 | 0.67 | 0.33 | 0 |
|---|---|---|---|---|---|---|
| frequency | 42 | 48 | 51 | 55 | 55 | 58 |
What proportion of possible groupings have a difference at least as great as the difference in means for the original groups? Explain or show your reasoning.
The proportion you calculate represents the probability that, if we assumed the treatment had no impact on the response, we would still see a difference at least as large as what we saw with our original grouping. Based on the proportion you calculated for this situation, which description is most accurate? Explain your reasoning.
Because the proportion is so low, it is unlikely that the difference in means is due to the randomized groupings. This means that the difference in means is most likely caused by the treatment.
Because the proportion is not that low, it is possible that the original difference in means is due to the random groupings. This means that there is not enough evidence to determine that the difference in means is likely caused by the treatment.
If students are confused about how to interpret the frequency table, consider asking:
“How is the information in the frequency table displayed in the dot plot?”
“There is a dot over -6 in the dotplot. How is that information represented in the frequency table?”
The purpose of the discussion is to help students understand that redistributing the data from an experiment into groups at random can help determine whether the original results are significant, that is, whether they provide evidence against the claim that the treatment did not cause any impact on the response.
Select students to share their solutions and reasoning.
Tell students that the use of statistics cannot prove with absolute certainty that the results of an experiment are due to the treatment or groups. An analysis of this type can merely provide strong evidence that the treatment is the reason the original means are different. Or the analysis can tell us that there is not enough evidence to make that conclusion.
Consider asking:
Statistical technology is needed for every student.
Tell students to close their books or devices (or to keep them closed). Arrange students in groups of 2. Use Three Reads to support reading comprehension and sense-making about this problem. Display only the problem stem and distribution, without revealing the questions.
Researchers want to know the effect of captively raising birds on the weight of the birds. The researchers begin with 100 birds divided into 2 groups of 50 each. One group of 50 is raised in captivity and the other 50 are tagged and released into the wild. After 5 years, all 100 birds are collected and weighed.
There are more than different ways to regroup the 100 birds into groups of 50 again, so looking at all the combinations would be too time consuming to reproduce. In this case, we can run simulations to determine how the original difference in means compares to those from regrouping the data.
The original groups have a difference of means of 0.27 gram. Researchers run 1,000 simulations that regroup the data into 2 groups at random and record the differences in means for the groups in each simulation. The histogram shows the differences in means from the simulations.
The researchers determine that the mean of the differences of means from the simulations is 0.0021 gram and the standard deviation for the differences of means from the simulations is 0.112 gram.
What features of the distribution in the histogram let us know that modeling with a normal distribution is reasonable?
Model the simulations using a normal distribution with a mean of 0 and a standard deviation of 0.112. What is the area under this normal curve that is greater than 0.27?
What does this area mean in this situation?
Based on the area under the normal curve, is there evidence that the original difference in means is likely due to where the birds spent the 5 years? Explain your reasoning.
The purpose of this discussion is to show that areas under normal curves can be used to help decide whether the difference in means from an experiment are significant.
Select students to share their solutions and reasoning. To promote discussion, consider asking: "In this problem, a 1.7% chance of the difference in means from the original groups happening even if we assumed the treatment had no impact seems low enough to conclude that the difference in means is likely due to the treatment. What percentage might make you change your opinion on whether the groupings were the cause of the original difference in means?"
Tell students that common cutoffs that indicate statistical results are considered statistically significant are 0.1%, 1%, 5%, or 10%.
Ask students, "Why do you think it would be important to set your expectations for a cutoff before doing the experiment?" (Setting the expectations can help decide how many subjects to include in the study and does not unfairly bias the results. Some experimenters may continue to run new simulations until their results become significant.)