Not all roles available for this page.
Sign in to view assessments and invite other educators
Sign in using your existing Kendall Hunt account. If you don’t have one, create an educator account.
The average is another name for the mean of a data set. To find the average, add all the numbers in the data set. Then divide by how many numbers there are.
The average is 7.5.
A box plot is a way to represent data on a number line with a box and some lines. The data is divided into four sections by 5 values. Those values are the minimum, first quartile, median, third quartile, and maximum.
A set of categorical data has values that are words instead of numbers.
For example, Han asks 5 friends to each name their favorite color. Their answers are “blue,” “blue,” “green,” “blue,” and “orange.”
The center of a data set is a value in the middle. It represents a typical value for the data set.
The center of this data set is between 4.5 and 5 kilograms. So a typical cat in this group weighs between 4.5 and 5 kilograms.
The distribution of a data set tells how many times each value occurs.
This dot plot shows the distribution for the data set 6, 10, 7, 35, 7, 36, 32, 10, 7, 35 kilograms.
A dot plot is a way to represent data with dots. Each dot above a number shows one time the value occurs in the set.
The frequency of a data value is how many times it occurs in the data set.
For example, there are 20 dogs in a park. The table shows the frequency of each color.
| dog color | frequency |
|---|---|
| white | 4 |
| brown | 7 |
| black | 3 |
| multi-color | 6 |
A histogram is a way to represent data on a number line with bars. Data values are grouped by ranges. The height of each bar shows how many data values are in that range. Each bar includes the left-end value but not the right-end value.
The interquartile range is one way to measure how spread out a data set is. To find the IQR, subtract the first quartile (Q1) from the third quartile (Q3).
For example, the IQR of this data set is 20 because \(50-30=20\).
| 22 | 29 | 30 | 31 | 32 | 43 | 44 | 45 | 50 | 50 | 59 |
| Q1 | Q2 | Q3 |
The mean is one way to measure the center of a data set. It can be thought of as a balance point. To find the mean, add all the numbers in the data set. Then divide by how many numbers there are.
The mean is 11. So, the typical travel time is 11 minutes.
The mean absolute deviation (MAD) is one way to measure how spread out a data set is. To find the MAD, find the distance between each data value and the mean. Add all the distances. Then divide by how many distances there are.
The MAD is 2.4. So, these travel times are typically 2.4 minutes away from the mean of 11 minutes.
A measure of center is a value that seems typical for a data distribution.
Mean and median are both measures of center.
The median is one way to measure the center of a data set. It is the middle number when the data set is listed in order of value.
A set of numerical data has values that are numbers.
For example, Han lists the ages of people in his family: 7, 10, 12, 36, 40, 67.
Quartiles are the numbers that divide a data set into four sections. Each section has the same number of data values.
In this data set, the first quartile (Q1) is 30. The second quartile (Q2) is the median, 43. The third quartile (Q3) is 50.
| 22 | 29 | 30 | 31 | 32 | 43 | 44 | 45 | 50 | 50 | 59 |
| Q1 | Q2 | Q3 |
The range is the distance between the smallest and largest values in a data set.
In the data set 3, 5, 6, 8, 11, 12, the range is 9, because \(12-3=9\).
The spread of a set of data tells how far apart the values are.
These dot plots show that the travel times for students in South Africa are more spread out than for students in New Zealand.
A statistical question can be answered by collecting data that has different values. Here are some examples of statistical questions: