Each histogram represents a group of 500 healthy people who had their temperature taken. Three histograms represent examples of data that approximate a normal distribution, and three histograms represent data that do not approximate a normal distribution. What do you think are the elements that define a normal distribution?
Some approximately normal distributions:
Histogram. The vertical axis is labeled from 0 to .24 by .02’s. The horizontal axis, labeled body temperature, degrees Fahrenheit, has bin widths of .5, starting at 95. Beginning at 95 up to, but not including 95.5, height of bar at each interval is 0, 0, .008, .018, .086, .136, .224, .222, .176, .082, .034, .014, 0.
Histogram. The vertical axis is labeled from 0 to .24 by .2’s. The horizontal axis, labeled body temperature, degrees Fahrenheit, has bin widths of .5, starting at 95. Beginning at 95 up to, but not including 95.5, height of bar at each interval is .003, .01, .024, .075, .114, .166, .194, .188, .121, .066, .024, .014, .001, 0.
Histogram. The vertical axis is labeled from 0 to .24 by .02’s. The horizontal axis, labeled body temperature, degrees Fahrenheit, has bin widths of .5, starting at 101. Beginning at 101 up to, but not including 101.5, height of bar at each interval is .008, .018, .086, .136, .224, .222, .176, .082, .034, .014, 0.
Some distributions that are not approximately normal distributions:
Histogram. The vertical axis is labeled from 0 to .24 by .2’s. The horizontal axis, labeled body temperature, degrees Fahrenheit, has bin widths of .5, starting at 95.5. Beginning at 95.5 up to, but not including 96, height of bar at each interval is .196, .126, .092, .068, .016, .01, .02, .064, .092, .122, .194, 0.
Histogram. The vertical axis is labeled from 0 to .24 by .2’s. The horizontal axis, labeled body temperature, degrees Fahrenheit, has bin widths of .5, starting at 95.5. Beginning at 95.5 up to, but not including 96, height of bar at each interval is .018, .034, .05, .062, .102, .16, .228, .242, .006, .004, 0.
Histogram. The vertical axis is labeled from 0 to 120 by 10s. The horizontal axis, labeled body temperature, degrees Fahrenheit, has bin widths of .5, starting at 96. Beginning at 96 up to, but not including 96.5, height of bar at each interval is 45, 51, 52, 48, 52, 53, 49, 48, 55, 47, 0.
5.2
Activity
Playing a Piano
On some pianos, the average distance from one white key to the next is 2.39 centimeters. How many of your classmates could reach two notes that are 9 keys apart (21.5 cm) on a keyboard using only one hand?
Stretch your fingers apart as wide as you can, and measure the farthest distance from your thumb to your smallest finger. Round your measurement to the nearest tenth of a centimeter.
Your teacher will collect the measurements from the class. Draw a dot plot or histogram from the class data.
Describe the distribution you drew using terms such as “symmetric,” “approximately symmetric,” “skewed left,” “skewed right,” “approximately uniform,” “uniform,” “bell-shaped,” or “bimodal.” Estimate the center of your distribution.
How would you use your distribution to determine how many people in the class can reach the two notes 9 keys apart?
5.3
Activity
Relative Frequency Distribution
Manufacturers of butter make sticks of butter that weigh 110 grams on average. A manufacturer suspects the machine that forms the sticks of butter may have a problem, so they weigh each stick of butter the machine produces in an hour.
The weights are grouped into intervals of 0.5 gram and are summarized in a frequency table.
weight (grams)
frequency
relative frequency
107–107.5
5
0.0043
107.5–108
17
108–108.5
52
108.5–109
118
109–109.5
172
109.5–110
232
110–110.5
219
110.5–111
172
111–111.5
95
111.5–112
57
112–112.5
23
112.5–113
8
113–113.5
1
total
1,171
The same data are summarized in this histogram.
Histogram. The vertical axis is labeled from 0 to 240 by 20’s. The horizontal axis, labeled weight, grams, has bin widths of .5, starting at 107. Beginning at 107 up to, but not including 107.5, height of bar at each interval is 5, 17, 52, 118, 172, 232, 219, 172, 95, 57, 23, 8, 1, 0, 0.
Although this information is useful, it might be more helpful to know the proportion of sticks of butter in each weight interval rather than the actual number of sticks in that weight interval.
Complete the table by dividing each frequency value by the total number of sticks of butter in the data set. Round each value to 4 decimal places.
A relative frequency histogram is a histogram in which the height of each bar is the relative frequency. Since the heights of the bars are found by dividing each height by the total number of sticks of butter, the shape of the distribution is the same as a regular histogram, but the labels on the -axis are changed. Label the -axis with the correct values for each mark.
Histogram. The vertical axis labels are blank. The horizontal axis, labeled weight, grams, has bin widths of .5, starting at 106. The bar heights are shaped in a bell curve.
The manufacturer believes they should replace parts of the machine if more than 25% of the sticks of butter are more than 1 gram away from the intended value of 110 grams.
Indicate on the relative frequency histogram the bars that correspond to sticks of butter that are more than 1 gram away from the intended weight of a stick of butter.
Should this machine be replaced? Explain or show your reasoning.
5.4
Activity
The Normal Curve
These curves represent normal distributions with different means and standard deviations. What do you notice?
mean: 10, standard deviation: 1
A bell-shaped distribution with the horizontal axis labeled from 0 to 20 by 2’s. The peak of the bell is centered at 10. The vertical axis labeled from 0 to 0.5 by 0.1’s. The bell curve is skinny, with most of the curve between 8 and 12.
mean: 10, standard deviation: 0.8
A bell-shaped distribution with the horizontal axis labeled from 0 to 20 by 2’s. The peak of the bell is centered at 10. The vertical axis labeled from 0 to 0.5 by 0.1’s. The bell curve is skinny, with most of the curve between 8 and 12.
mean: 12, standard deviation: 1
A bell-shaped distribution with the horizontal axis labeled from 0 to 20 by 2’s. The peak of the bell is centered at 12. The vertical axis labeled from 0 to 0.5 by 0.1’s. The bell curve is skinny, with most of the curve between 10 and 14.
mean: 8, standard deviation: 2
A bell-shaped distribution with the horizontal axis labeled from 0 to 20 by 2’s. The peak of the bell is centered at 8. The vertical axis labeled from 0 to 0.5 by 0.1’s.
mean: 10, standard deviation: 2
A bell-shaped distribution with the horizontal axis labeled from 0 to 20 by 2’s. The peak of the bell is centered at 10. The vertical axis labeled from 0 to 0.5 by 0.1’s.
Student Lesson Summary
A histogram shows the number of items in a data set that fall into specified intervals. A relative frequency histogram shows the proportion of an entire data set that falls into specified intervals.
For example, a study measured the handspan, in centimeters, of 1,000 adults. A handspan is the distance from the thumb to the smallest finger when the fingers are stretched out as much as possible. A histogram shows the number of people whose handspans are in certain intervals. The height of each bar in a histogram represents the frequency for the corresponding interval.
In this example, there are 132 adults in the study whose handspans are at least 20 centimeters but less than 20.5 centimeters.
Histogram from 16.5 to 25.5 by 0 point 5’s. Handspan, centimeters. Beginning at 16.5 up to but not including 17, height of bar at each interval is 1, 7, 13, 16, 52, 59, 87, 132, 159, 132, 104, 98, 69, 32, 23, 10, 4, 2, 0.
A histogram that uses the relative frequencies shows a distribution with the same shape, but the heights of the bars represent the relative frequency for the corresponding intervals.
For example, out of all the adults in the study, 13.2% have handspans that are at least 20 centimeters but less than 20.5 centimeters (since ).
Histogram from 16.5 to 25.5 by 0 point 5’s. Handspan, centimeters. Beginning at 16.5 up to but not including 17, height of bar at each interval is .001, .007, .013, .016, .052, .059, .087, .132, .159, .132, .104, .098, .069, .032, .023, .010, .004, .002, 0.
Similarly to how we can model data in a scatter plot with a line or other curve so that additional information can be estimated or predicted, it can be useful to model an approximately symmetric and bell-shaped distribution with a particular distribution called the normal distribution. A normal distribution is symmetric and bell-shaped, has an area of 1 between the -axis and the curve, and has the -axis as a horizontal asymptote. A normal distribution is determined entirely by the mean and standard deviation.
For the handspan data, the mean is 20.9 cm and the standard deviation is 1.41. The curve that represents the normal distribution with this same mean and standard deviation is shown on top of the relative frequency histogram for the actual data.
Notice that the curve does a fairly good job of modeling the actual data in this situation, although it is not perfect.
Histogram from 16.5 to 25.5 by 0 point 5’s. Handspan, centimeters. Beginning at 16.5 up to but not including 17, height of bar at each interval is .001, .007, .013, .016, .052, .059, .087, .132, .159, .132, .104, .098, .069, .032, .023, .010, .004, .002, 0. A bell-curve line is drawn at the top of each of the bars across the graph.
Glossary
normal distribution
mean = 10. standard deviation = 1
A bell-shaped distribution with the horizontal axis labeled from 0 to 20 by 2’s. The peak of the bell is centered at 10. The vertical axis labeled from 0 to 0.5 by 0.1’s. The bell curve is skinny, with most of the curve between 8 and 12.
mean = 10. standard deviation = 2
A bell-shaped distribution with the horizontal axis labeled from 0 to 20 by 2’s. The peak of the bell is centered at 10. The vertical axis labeled from 0 to 0.5 by 0.1’s.
mean = 8. standard deviation = 2
A bell-shaped distribution with the horizontal axis labeled from 0 to 20 by 2’s. The peak of the bell is centered at 8. The vertical axis labeled from 0 to 0.5 by 0.1’s.
A specific distribution in statistics whose graph is symmetric and bell-shaped, has an area of 1 between the -axis and the graph, and has the -axis as a horizontal asymptote.
relative frequency histogram
Histogram from 16.5 to 25.5 by 0 point 5’s. Handspan, centimeters. Beginning at 16.5 up to but not including 17, height of bar at each interval is .001, .007, .013, .016, .052, .059, .087, .132, .159, .132, .104, .098, .069, .032, .023, .010, .004, .002, 0.
A histogram where the height of each bar is the fraction of the entire data set that falls into the corresponding interval (that is, it is the relative frequency with which the data values fall into that interval).
Have feedback on the curriculum?
Help us improve by sharing suggestions or reporting issues.