Not all roles available for this page.
Sign in to view assessments and invite other educators
Sign in using your existing Kendall Hunt account. If you don’t have one, create an educator account.
The purpose of this Warm-up is for students to first reason about the mean of a data set without calculating and then to practice calculating the mean. The context will be used in an upcoming activity in this lesson, so this Warm-up familiarizes students with the context for talking about deviation from the mean.
In their predictions, students may think that Elena will have the highest mean, because she has a few very high scores (7, 8, and 9 points). They may also think that Lin and Jada will have very close means because they each have 5 higher scores than one another, and their other scores are the same. Even though each player has the same mean, all of these ideas are reasonable things for students to consider when looking at the data. Record and display their predictions without further questions until they have calculated and compared the mean of their individual data sets.
Arrange students in groups of 3.
Tell each group member to calculate the mean of the data set for one player in the task, share their work in the small group, and complete the remaining questions.
Elena, Jada, and Lin enjoy playing basketball during recess. Lately, they have been practicing free throws. They record the number of baskets they make out of 10 attempts. Here are their data sets for 12 school days.
Elena
2
2
2
2
4
5
5
6
8
9
9
9
Jada
2
4
5
4
6
6
4
7
3
4
8
7
Lin
3
6
6
4
5
5
3
5
4
6
6
7
Ask students to share the mean for each player's data set. Record and display their responses for all to see. After each student shares, ask the class if they agree or disagree and what the mean tells us in this context. If the idea that the means show that all three students make, on average, half of the 10 attempts to get the basketball in the hoop does not arise, make that idea explicit.
If there is time, consider revisiting the predictions and asking how the mean of Elena's data set can be the same as the others when she has more high scores?
In this activity, students learn the term mean absolute deviation (MAD) as a way to quantify variability and calculate it by finding distances between the mean and each data value. Students compare data sets with the same mean but different MADs and interpret the variability in context.
While this process of calculating MAD involves taking the absolute value of the difference between each data point and the mean, this formal language is downplayed here. Instead, the idea of “finding the distance,” which is always positive, is used. This is done for a couple of reasons. One reason is to focus students' attention on the statistical work rather than on terminology or symbolic work. Another reason is that finding these differences may involve operations with signed numbers, which are not expected in this course.
Remind students that earlier they found the distance between each data point and the mean, and found that the sum of those distances on the left and the sum on the right are equal, which allows us to think of the mean as the balancing point, or the center, of the data. Explain that the distance between each point and the mean can be used to tell us something else about a distribution.
Arrange students in groups of 2. Give students 4–5 minutes to complete the first set of questions with their partner, and then 4–5 minutes of quiet time to complete the remaining questions. Follow with a whole-class discussion.
The tables show the number of baskets made by Jada and Lin in several games. Recall that the mean of Jada and Lin’s data is 5.
Record the distance between the number of baskets Jada made in each game and the mean.
| Jada | 2 | 4 | 5 | 4 | 6 | 6 | 4 | 7 | 3 | 4 | 8 | 7 |
| distance from 5 |
Now find the average of the distances in the table. Show your reasoning, and round your answer to the nearest tenth.
This value is the mean absolute deviation (MAD) of Jada’s data. Jada’s MAD: _________
Find the mean absolute deviation of Lin’s data. Round it to the nearest tenth.
| Lin | 3 | 6 | 6 | 4 | 5 | 5 | 3 | 5 | 4 | 6 | 6 | 7 |
| distance from 5 |
Lin’s MAD: _________
Elena’s distribution has a MAD of about 2.5. Compare the MADs and dot plots of the three students’ data. Do you see a relationship between each student’s MAD and the distribution on her dot plot? Explain your reasoning.
Students may recall an earlier lesson about thinking of the mean as a balance point and think that the MAD should always be zero because the left and right distances should be equal. Remind them that distances are always positive, so the average of these distances to the mean can be zero only if all the data points are exactly at the mean.
During discussion, highlight that finding how far away, on average, the data points are from the mean is a way to describe the variability of a distribution. Discuss:
This activity allows students to practice calculating MAD and to build a better understanding of what it tells us. Students compare data sets with the same mean but different MADs and interpret what these differences imply in the context of the situation. During the discussion, they select a student to be on their team based on the comparison.
Expect students to choose different players to be on their team, but be sure they support their preferences with a reasonable explanation (MP3).
Arrange students in groups of 3–4. Before students read the Task Statement, display the two dot plots in the task for all to see. Give students up to 1 minute to study the dot plots and share with their group what they notice and wonder about the plots.
Next, select a few students to share what they notice and what they wonder. It is not necessary to confirm or correct students' observations or answer their questions at this point. If no one mentioned comparing the distributions, ask them to think about how they might do that. Explain to students that they will find more information in the Task Statement to help them compare and interpret the dot plots.
Give students 3–4 minutes of quiet work time to complete the first set of questions, and then 8–10 minutes to complete the second set with their group. Allow at least a few minutes for a whole-class discussion.
Andre and Noah joined Elena, Jada, and Lin in recording their basketball scores. They all record their scores in the same way: the number of baskets made out of 10 attempts. Each person collects 12 data points.
Here are two dot plots that represent the two data sets. The triangle indicates the location of the mean.
| eighth‐grade student |
6 | 5 | 4 | 7 | 6 | 5 | 7 | 8 | 5 | 6 | 5 | 8 |
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| distance from 6 |
Draw a dot plot to represent his data and mark the location of the mean with a triangle.
Select a couple of students to share their responses to the first set of questions about how they matched the dot plots to the players and how they knew.
Then, display a completed table and the MAD for the second set of questions. Give students a moment to check their work. To facilitate discussion, help students connect MAD and the spread of data, and enable them to make a comparison. Consider displaying all three dot plots at the same scale and using a line segment to represent the MAD on each dot plot, as shown here.
Invite a few students to share their observations about how the means and MADs of Noah and the eighth-grade student compare. Discuss:
Students should walk away understanding that, in this context, a higher MAD indicates more variability and less consistency in the number of shots made.
In this activity, students continue to practice interpreting the mean and the MAD and to use them to answer statistical questions. A new context is introduced, but students should continue to consider both the center and variability of the distribution as ways of thinking about what is typical for a set of data and how consistent the data tends to be.
Give students 5–7 minutes of quiet work time. Ask students to consider drawing a triangle and a line segment on each dot plot in the last question to represent the mean and MAD for each data set (as was done in an earlier lesson).
The mean age of swimmers on a 1984 national swim team is 18.2 years and the MAD is 2.2 years. The mean age of the swimmers on the 2016 team is 22.8 years, and the MAD is 3 years.
Here are dot plots showing the ages of the swimmers on the national swim teams in 1984 and in 2016. Use them to make two other comments about how the team has changed over the years.
Display the dot plots for all to see. Invite a student to add the means and MADs to the plots. Then invite several students to share their comparison of the distributions. Here are some discussion questions:
The purpose of the discussion is to understand how a measure of spread can be used to quantify variability for a distribution. Ask students:
We use the mean of a data set as a "measure of center" of its distribution, but two data sets with the same mean could have very different distributions.
This dot plot shows the number of stickers on each page of a 22-page sticker book.
The mean number of stickers is 21. All the pages have within 3 stickers of the mean, and most of them are even closer. These pages are all fairly close in the number of stickers on them.
This dot plot shows the number of stickers on each page of another sticker book that has 30 pages.
In this sticker book, the mean number of stickers on each page is also 21, but some pages have less than half that number of stickers and others have more than one-and-a-half times as many. There is a lot more variability in the number of stickers.
There is a number that we can use to describe how far away, or how spread out, data points generally are from the mean. This measure of spread is called the mean absolute deviation (MAD).
To find the MAD, we find the distance between each data value and the mean, and then calculate the mean of those distances. For instance, the point that represents 18 stickers is 3 units away from the mean of 21 stickers.
We can find the distance between each point and the mean of 21 stickers and then organize the distances into a table, as shown.
The values in the first row of the table are the number of stickers on each page in the first book. Their mean, 21, is the mean number of stickers on a page.
The values in the second row of the table are the distances, or absolute deviation, between the values in the first row and 21. The mean of these distances is the MAD of the number of stickers on a page, about 1.2 stickers.
What can we learn from the averages of these distances once they are calculated?
In the first book, the distances are all between 0 and 3. The MAD is 1.2 stickers, which tells us that the number of stickers are typically within 1.2 of the mean number, 21. We could say that a typical page has between 19.8 and 22.2 stickers.
In the second book, the distances are all between 0 and 13. The MAD is 5.6 stickers, which tells us that the number of stickers are typically within 5.6 of the mean number, 21. We could say that a typical page has between 15.4 and 26.6 stickers.
The MAD is also called a measure of the variability of the distribution. In these examples, it is easy to see that a higher MAD suggests a distribution that is more spread out, showing more variability.
In summary, a measure of center, such as the mean, gives a sense of what is typical for a set of data. A measure of variability, such as the MAD, gives a sense of how consistent the data are.