Not all roles available for this page.
Sign in to view assessments and invite other educators
Sign in using your existing Kendall Hunt account. If you don’t have one, create an educator account.
The purpose of this Warm-up is to elicit the idea that there are multiple ways to describe variability, which will be useful when students learn about range and interquartile range in a later activity. While students may notice and wonder many things about these dot plots, variability and how to measure it are the important discussion points.
When students articulate what they notice and wonder, they have an opportunity to attend to precision in the language that they use to describe what they see (MP6). They might first propose less formal or imprecise language, and then restate their observation with more precise language in order to communicate more clearly
Arrange students in groups of 2. Display the dot plots for all to see. Ask students to think of at least one thing that they notice and at least one thing they wonder. Give students 1 minute of quiet think time, and then 1 minute to discuss with their partner the things that they notice and wonder about.
Here are dot plots that show the ages of people at two different parties. The mean of each distribution is marked with a triangle.
What do you notice and what do you wonder about the distributions in the two dot plots?
Ask students to share the things they noticed and wondered. Record and display their responses for all to see, without editing or commentary. If possible, record the relevant reasoning on or near the dot plots. Next, ask students, “Is there anything on this list that you are wondering about now?” Encourage students to respectfully disagree, ask for clarification, or point out contradicting information.
If the idea that the MAD does not describe the variability of these two sets well does not come up during the conversation, ask students to discuss that idea.
Two key ideas to uncover here are:
This activity introduces students to the five-number summary and the process of identifying the five numbers. Students learn how to partition the data into four sets: using the median to decompose the data into upper and lower halves, and then finding the middle of each half to further decompose it into quarters. They learn that each value that decomposes the data into four parts is called a quartile, and the three quartiles are the first quartile (Q1), second quartile (Q2, or the median), and third quartile (Q3). Together with the minimum and maximum values of the data set, the quartiles provide a five-number summary that can be used to describe a data set without listing or showing each data value.
Students reason abstractly and quantitatively (MP2) as they identify and interpret the quartiles in the context of the situation given.
To save time, find the median together with the whole class. Then assign half of the class to find the first quartile and the other half to find the third quartile. After 2 minutes of group work time, ask a group from each half to share their results with the class.
During the Activity Synthesis, introduce the terms range and interquartile range (IQR). The range is the difference between the maximum and minimum values in the data, and the IQR is the difference between the third and first quartiles. Demonstrate how to compute the range () and interquartile range () of the data.
Arrange students in groups of 2. Give groups 8–10 minutes to complete the activity. Follow with a whole-class discussion.
Ask students if they have attended different kinds of parties with different age ranges. If a few different possibilities don’t come up, share some examples:
Remind students that they previously summarized variability by finding the MAD, which involves calculating the distance of each data point from the mean and then finding the average of those distances. Explain that we will now explore another way to describe variability and summarize the distribution of data. Instead of measuring how far away data points are from the mean, we will decompose a data set into four equal parts and use the markers that partition the data into quarters to summarize the spread of data.
If necessary, remind students how to find the median, especially when there are an even number of values in the data set.
Here are the ages of the people at one party, listed from least to greatest.
Label the least value in the set “minimum” and the greatest value “maximum.”
The values you have identified make up the five-number summary for the data set. Record them here.
minimum: _____ Q1: _____ Q2: _____ Q3: _____ maximum: _____
The median of this data set is 20. This tells us that half of the people at the party were 20 years old or younger, and the other half were 20 or older. What do each of these other values tell us about the ages of the people at the party?
Ask a student to display the data set that they have decomposed and labeled, or display the diagram for all to see.
Focus the conversation on students' interpretation of the five numbers. As students discuss their solutions, color code or annotate the five-number summary on the data set and diagram. Discuss:
Previously, students learned to identify the median, quartiles, and five-number summary of data sets. They also calculated the range and interquartile range of distributions. In this activity, students rely on those experiences to make sense of box plots. They explore this new representation of data kinesthetically: by creating a human box plot to represent class data on the lengths of student names, which they collected in the “Finding the Middle” activity in an earlier lesson.
Before the lesson, use thin painter’s tape to make a number line on the ground. If the floor is tiled with equal-sized tiles, consider using the tiles for the intervals of the number line. Otherwise, mark off equal intervals on the tape. The number line should cover at least the distance between the least data value (the fewest number of letters in a student's name) and the greatest (the most number of letters).
Provide each student with a copy of the data on the lengths of students’ names from the “Finding the Middle” activity. If any students were absent then, add their names and numbers of letters to the data set.
Give students 4–5 minutes to find the quartiles and write the five-number summary of the data. Then, invite several students to share their findings and come to an agreement on the five numbers. Record and display the summary for all to see.
Explain to students that the five-number summary can be used to make another visual representation of a data set called a “box plot.” Tell students that they will create a human box plot in a way similar to the way they found the median.
Now that the five numbers are identified and each is associated with one or more students, use wide painter's tape to construct a box plot.
This image shows an example of a completed human box plot.
Explain to students that they have made a human box plot. Consider taking a picture of the box plot for reference and discussion later.
Your teacher will give you the data on the lengths of names of students in your class. Write the five-number summary by finding the data set's minimum, Q1, Q2, Q3, and the maximum.
Pause for additional instructions from your teacher.
Tell students that a box plot is a representation that shows the five-number summary of a data set. Discuss:
In this activity, students learn to draw a box plot, and they explore the connections between a dot plot and a box plot of the same data set. Then they compare the representations by commenting on what information can be quickly understood from each, based on the structure of the representations (MP7).
Arrange students in groups of 2. Give students 8–10 minutes to complete the questions, and then follow with a whole-class discussion.
Tell students that they will now draw a box plot to represent another set of data. For their background information, explain that scientists believe people blink their eyes to keep the surface of the eye moist and also to give the brain a brief rest. On average, people blink between 15 and 20 times a minute.
Display the box plot for all to see. Tell students that their box plot will have all of these features, but will not look exactly like this because their data is different from the one used to make this box plot.
Use Collect and Display to create a shared reference that captures students’ developing mathematical language. Collect the language that students use to compare the representations. Display words and phrases such as “median,” “quartiles,” “five-number summary,” “estimate,” and “exact.”
Twenty people participate in a study about blinking. The number of times each person blinked while watching a video for one minute is recorded. The data values are shown here, in order from smallest to largest.
What are the minimum and maximum values?
A box plot can be used to represent the five-number summary graphically. Let’s draw a box plot for the number-of-blinks data. Above the dot plot:
Compare the information that can be quickly understood from each representation.
Display the dot plot and the box plot for all to see.
Direct students' attention to the reference created using Collect and Display. Ask students to share their comparison of the representations. Invite students to borrow language from the display as needed, and update the reference to include additional phrases as they respond. (For example, “The box plot shows the five-number summary easily, but the exact data are lost. The dot plot shows the shape of the distribution better, but there are no calculated values shown.”)
Discuss:
The focus of this activity is on constructing a box plot and understanding its parts, rather than on interpreting it in context. If students seem to have a good grasp of the drawing process and what the parts entail and mean, consider asking them to interpret the plots in the context of the research study. Ask: “Suppose you are the scientist who conducted the research and are writing an article about it. Write 2–3 sentences that summarize your findings, based on your analyses of the dot plot and the box plot.” (Half of the participants blink between 12 and 21 times per minute. A person blinked as few as 3 times and another as many as 51 times, but these values were unusual in this group.)
The purpose of this discussion is for students to clarify their understanding of quartiles, interquartile range, and box plots.
Review with students:
Earlier we learned that the mean is a measure of the center of a distribution and the MAD is a measure of the variability (or spread) that goes with the mean. There is also a measure of spread that goes with the median. It is called the interquartile range (IQR).
Finding the IQR involves splitting a data set into fourths. Each of the three values that splits the data into fourths is called a quartile. For example, here is a data set with 11 values.
| 12 | 19 | 20 | 21 | 22 | 33 | 34 | 35 | 40 | 40 | 49 | 
| Q1 | Q2 | Q3 | 
The difference between the maximum and minimum values of a data set is the range. For this data set, the range is 37 because .
The difference between Q3 and Q1 is the interquartile range (IQR). For this data set, the IQR is 20 because . Because the distance between Q1 and Q3 includes the middle two-fourths of the distribution, the values between those two quartiles are sometimes called the middle half of the data.
The bigger the IQR, the more spread out the middle half of the data values are. The smaller the IQR, the closer together the middle half of the data values are. This is why we can use the IQR as a measure of spread.
A five-number summary can be used to summarize a distribution. It includes the minimum, first quartile, median, third quartile, and maximum of the data set. For the previous example, the five-number summary is 12, 20, 33, 40, and 49. These numbers are marked with diamonds on the dot plot.
A box plot represents the five-number summary of a data set.
It shows the first quartile (Q1) and the third quartile (Q3) as the left and right sides of a rectangle, or a box. The median (Q2) is shown as a vertical segment inside the box. On the left side, a horizontal line segment, sometimes called a whisker, extends from Q1 to the minimum value. On the right, a whisker extends from Q3 to the maximum value.
The rectangle in the middle represents the middle half of the data. Its width is the IQR. The whiskers represent the bottom quarter and the top quarter of the data set.