Not all roles available for this page.
Sign in to view assessments and invite other educators
Sign in using your existing Kendall Hunt account. If you don’t have one, create an educator account.
The purpose of this Warm-up is for students to first reason about the mean of a data set without calculating and then to practice calculating the mean. The context will be used in an upcoming activity in this lesson, so this Warm-up familiarizes students with the context for talking about deviation from the mean.
In their predictions, students may think that Elena will have the highest mean, because she has a few very high scores (7, 8, and 9 points). They may also think that Lin and Jada will have very close means because they each have 5 higher scores than one another, and their other scores are the same. Even though each player has the same mean, all of these ideas are reasonable things for students to consider when looking at the data. Record and display their predictions without further questions until they have calculated and compared the mean of their individual data sets.
Arrange students in groups of 3.
Tell each group member to calculate the mean of the data set for one player in the task, share their work in the small group, and complete the remaining questions.
Elena, Jada, and Lin enjoy playing basketball during recess. Lately, they have been practicing free throws. They record the number of baskets they make out of 10 attempts. Here are their data sets for 12 school days.
Elena
2
2
2
2
4
5
5
6
8
9
9
9
Jada
2
4
5
4
6
6
4
7
3
4
8
7
Lin
3
6
6
4
5
5
3
5
4
6
6
7
Ask students to share the mean for each player's data set. Record and display their responses for all to see. After each student shares, ask the class if they agree or disagree and what the mean tells us in this context. If the idea that the means show that all three students make, on average, half of the 10 attempts to get the basketball in the hoop does not arise, make that idea explicit.
If there is time, consider revisiting the predictions and asking how the mean of Elena's data set can be the same as the others when she has more high scores?
In this activity, students turn their focus to variability while continuing to develop their understanding of what could be considered typical for a group. Students compare distributions with the same mean but different spreads and interpret them in the context of a situation. The context given here (basketball scores) prompts them to connect the mean to the notion of how well a player plays in general, and deviations from the mean to how “consistently” that player plays.
They encounter the idea of calculating the average absolute deviation from the mean as a way to describe variability in data.
Arrange students in groups of 3–4. Give groups 6–7 minutes to answer the questions, and follow with a whole-class discussion.
Here are the dot plots showing the number of baskets that Elena, Jada, and Lin each made over 12 school days.
On each dot plot, mark the location of the mean with a triangle. Then, contrast the dot plot distributions. Write 2–3 sentences to describe the shape and spread of each distribution.
Discuss these questions with your group. Explain your reasoning.
The purpose of the discussion is to highlight that the center of the distribution is not always the only consideration when discussing data. The variability or spread can also influence how we understand the data.
There are many ways to answer the second set of questions. Invite students or groups who have different interpretations of playing well and playing consistently to share their thinking. Allow as many interpretations to be shared as time permits. Discuss:
In this activity, students learn the term mean absolute deviation (MAD) as a way to quantify variability and calculate it by finding distances between the mean and each data value. Students compare data sets with the same mean but different MADs and interpret the variability in context.
While this process of calculating MAD involves taking the absolute value of the difference between each data point and the mean, this formal language is downplayed here. Instead, the idea of “finding the distance,” which is always positive, is used. This is done for a couple of reasons. One reason is to focus students' attention on the statistical work rather than on terminology or symbolic work. Another reason is that finding these differences may involve operations with signed numbers, which are not expected in this course.
Remind students that earlier they found the distance between each data point and the mean, and found that the sum of those distances on the left and the sum on the right are equal, which allows us to think of the mean as the balancing point, or the center, of the data. Explain that the distance between each point and the mean can be used to tell us something else about a distribution.
Arrange students in groups of 2. Give students 4–5 minutes to complete the first set of questions with their partner, and then 4–5 minutes of quiet time to complete the remaining questions. Follow with a whole-class discussion.
The tables show the number of baskets made by Jada and Lin in several games. Recall that the mean of Jada and Lin’s data is 5.
Record the distance between the number of baskets Jada made in each game and the mean.
| Jada | 2 | 4 | 5 | 4 | 6 | 6 | 4 | 7 | 3 | 4 | 8 | 7 |
| distance from 5 |
Now find the average of the distances in the table. Show your reasoning, and round your answer to the nearest tenth.
This value is the mean absolute deviation (MAD) of Jada’s data. Jada’s MAD: _________
Find the mean absolute deviation of Lin’s data. Round it to the nearest tenth.
| Lin | 3 | 6 | 6 | 4 | 5 | 5 | 3 | 5 | 4 | 6 | 6 | 7 |
| distance from 5 |
Lin’s MAD: _________
Elena’s distribution has a MAD of about 2.5. Compare the MADs and dot plots of the three students’ data. Do you see a relationship between each student’s MAD and the distribution on her dot plot? Explain your reasoning.
Students may recall an earlier lesson about thinking of the mean as a balance point and think that the MAD should always be zero because the left and right distances should be equal. Remind them that distances are always positive, so the average of these distances to the mean can be zero only if all the data points are exactly at the mean.
During discussion, highlight that finding how far away, on average, the data points are from the mean is a way to describe the variability of a distribution. Discuss:
Optional
This optional activity uses a game to help students develop the idea of variability. Use this activity if students could benefit from more concrete experiences with the idea of distance from the mean. Students will draw from a standard deck of playing cards and find the sum of the values of the drawn cards. They will determine how far their sum is from “22.” After playing 5 rounds, the player with the least mean distance from 22 wins the game.
Have students work in groups of 2–3. Provide a deck of standard playing cards to each group. Play 5 rounds.
Your teacher will give your group a deck of cards. Shuffle the cards, and put the deck face down on the playing surface.
| player A | round 1 | round 2 | round 3 | round 4 | round 5 |
|---|---|---|---|---|---|
| sum of cards | |||||
| distance from 22 |
Average distance from 22: ____________
| player B | round 1 | round 2 | round 3 | round 4 | round 5 |
|---|---|---|---|---|---|
| sum of cards | |||||
| distance from 22 |
Average distance from 22: ____________
| player C | round 1 | round 2 | round 3 | round 4 | round 5 |
|---|---|---|---|---|---|
| sum of cards | |||||
| distance from 22 |
Average distance from 22: ____________
Whose average distance from 22 is the smallest? Who won the game?
Ask students to think about how average distance from a number can be used to summarize variability, and invite a couple of students to share their thinking.
In the game, we can think of the player with the least average distance from 22 as having cards that are, on the whole, closest to 22 or the “least different” from 22. By the same token, a player with the greatest average distance from 22 can be seen as having cards that are, on the whole, farthest away from 22 or the “most different” from 22. Connect this to the idea that a data set with a large MAD means it has many values that vary from what we could consider a typical member of the group.
The purpose of the discussion is to understand how a measure of spread can be used to quantify variability for a distribution. Ask students:
We use the mean of a data set as a "measure of center" of its distribution, but two data sets with the same mean could have very different distributions.
This dot plot shows the number of stickers on each page of a 22-page sticker book.
The mean number of stickers is 21. All the pages have within 3 stickers of the mean, and most of them are even closer. These pages are all fairly close in the number of stickers on them.
This dot plot shows the number of stickers on each page of another sticker book that has 30 pages.
In this sticker book, the mean number of stickers on each page is also 21, but some pages have less than half that number of stickers and others have more than one-and-a-half times as many. There is a lot more variability in the number of stickers.
There is a number that we can use to describe how far away, or how spread out, data points generally are from the mean. This measure of spread is called the mean absolute deviation (MAD).
To find the MAD, we find the distance between each data value and the mean, and then calculate the mean of those distances. For instance, the point that represents 18 stickers is 3 units away from the mean of 21 stickers.
We can find the distance between each point and the mean of 21 stickers and then organize the distances into a table, as shown.
The values in the first row of the table are the number of stickers on each page in the first book. Their mean, 21, is the mean number of stickers on a page.
The values in the second row of the table are the distances, or absolute deviation, between the values in the first row and 21. The mean of these distances is the MAD of the number of stickers on a page, about 1.2 stickers.
What can we learn from the averages of these distances once they are calculated?
In the first book, the distances are all between 0 and 3. The MAD is 1.2 stickers, which tells us that the number of stickers are typically within 1.2 of the mean number, 21. We could say that a typical page has between 19.8 and 22.2 stickers.
In the second book, the distances are all between 0 and 13. The MAD is 5.6 stickers, which tells us that the number of stickers are typically within 5.6 of the mean number, 21. We could say that a typical page has between 15.4 and 26.6 stickers.
The MAD is also called a measure of the variability of the distribution. In these examples, it is easy to see that a higher MAD suggests a distribution that is more spread out, showing more variability.