Sign in to view assessments and invite other educators
Sign in using your existing Kendall Hunt account. If you don’t have one, create an educator account.
This week, your student will work with data and use data to answer statistical questions. Questions such as “Which band is the most popular among students in sixth grade?” or “What is the most common number of siblings among students in sixth grade?” are statistical questions. They can be answered using data, and the data are expected to vary (that is, the students do not all have the same musical preference or the same number of siblings).
Students have used bar graphs and line plots, or dot plots, to display and interpret data. Now they learn to use histograms to make sense of numerical data. The dot plot and histogram here display the distribution of the weights of 30 dogs.
A dot plot shows individual data values as points. In a histogram, the data values are grouped. Each group is represented as a vertical bar. The height of the bar shows how many values are in that group. The tallest bar in this histogram shows that there are 10 dogs that weigh from 20 kilograms up to 25 kilograms (not including 25).
The shape of a histogram can tell us about how the data are distributed. For example, we can see that more than half of the dogs weigh less than 25 kilograms, and that a dog weighing from 25 to 30 kilograms is not typical.
Here is a task to try with your student:
This histogram shows the weights of 143 bears.
About how many bears weigh from 100 to 150 pounds?
About how many bears weigh less than 100 pounds?
Solution:
This week, your student will learn to use the median and interquartile range or IQR to summarize the distribution of data.
The median is the middle value of a data set whose values are listed in order. To find the median, arrange the data in order from least to greatest, and look at the middle of the list.
Suppose nine students reported the numbers of hours of sleep they got on a weeknight.
6
7
7
8
9
9
10
11
12
The middle number is 9, so the median number of hours of sleep is 9 hours. This means that half of the students slept for less than or equal to 9 hours, and the other half slept for greater than or equal to 9 hours.
Suppose eight teachers reported these numbers of hours of sleep on a weeknight.
5
6
6
6
7
7
7
8
This data set has an even number of values, so there are two numbers in the middle—6 and 7. The median is the number exactly in between them: 6.5. In other words, if there are two numbers in the middle of a data set, the median is the average of those two numbers.
When the data set is large, it can be helpful to break it down further. After finding the median, we can focus on the lower half of the data and split those values using the median of just that half. This value is called the first quartile or Q1. It tells us that 25% of the full data set is less than or equal to Q1 and 75% is greater than or equal to Q1. Similarly, the third quartile (Q3) is the median of the upper half of the data set.
A box plot is a way to represent the 3 quartiles of a data set, along with its maximum and minimum. This box plot shows those five numbers for the data on the students’ hours of sleep.
The distance between the first and third quartiles is the interquartile range or the IQR of data. It tells us about the middle half of the data and is represented by the width of the box of the box plot. We can use it to describe how alike or different the data values are. Box plots are especially useful for comparing the distributions of two or more data sets.
The box plots show that the smallest measured beetle is 5 millimeters long, and that half of the beetles are between approximately 7 and 14 millimeters long.
Here is a task to try with your student:
Here are data showing the number of points Jada scored in 10 basketball games.
10
14
6
12
38
12
8
7
10
23
What is her median score?
Solution:
This week, your student will learn to calculate and interpret the mean, or the average, of a data set.
We can think of the mean of a data set as a fair share—what would happen if the numbers in the data set were distributed evenly. Suppose a runner ran 3, 4, 3, 1, and 5 miles over five days. If the total number of miles she ran, 16 miles (\(3+4+3+1+5=16\)), were distributed evenly across five days, the distance run per day, 3.2 miles (\(16 \div 5 = 3.2\)), would be the mean. To calculate the mean, we can add the data values and then divide the sum by how many values there are.
If we think of data points as weights along a number line, the mean can also be interpreted as the balance point of the data. The dots show the travel times, in minutes, of Lin and Andre. The triangles show each mean travel time. Notice that the data points are “balanced” on either side of each triangle.
Your student will also learn to find and interpret the mean absolute deviation, or the MAD, of data. The MAD tells us the distance, on average, of a data point from the mean. When the data points are close to the mean, the distances between them and the mean are small, so the average distance—the MAD—will also be small. When points are more spread out, the MAD will be greater.
We use mean and MAD values to help us summarize data.
Here is a task to try with your student:
Solution:
For Lin’s data, the mean is \(\frac{8 + 11 + 11 + 18 + 22}{5} = \frac{70}{5}\), which equals 14. For Andre’s data, the mean is \(\frac{12 + 12 + 13 + 16 + 17}{5} = \frac{70}{5}\), which also equals 14.
Sample explanations:
Lin’s data has a higher MAD. Sample explanations: