Unit 8

Data Sets, Distributions, and Sampling

Unit Narrative

This unit is a brief overview of some key statistical concepts. First, students learn about populations and study variables associated with a population. They begin by classifying questions as either statistical or non-statistical—based on whether variable data is necessary to answer the question. This leads to further investigation into variability and data displays, such as dot plots and histograms. As students visualize data, they begin to describe the distribution of data more precisely as they work with mean and mean absolute deviation (MAD).

After working with those statistics, students begin to recognize that some distributions are not well-suited to description by mean and MAD. Students are introduced to median, range, and interquartile range as additional measures of center and variability that can be used to describe distributions in some situations. That also leads to the box plot as an additional way to visualize data.

Box plots and dot plots for two sets of data: "pug weights in kilograms” and "beagle weights in kilograms".<br>

Next, students examine different ways to collect data from samples within a population to understand why random selection is useful. Then students generate samples and estimate information about the population from sample data.

The unit concludes with an optional section exploring probability. Students are introduced to probability as a way to quantify how likely an event is to happen. They explore the connection between probability and results of repeated experiments, ways to examine the sample space for more complex experiments, and simulating experiments.

Note that the introduction of mean absolute deviation is used as an introductory model for understanding variability. Although standard deviation is more mathematically useful, its calculation and meaning may be difficult for students at this level without an understanding of normal distributions. In later courses, when student understanding of variability and their exposure to additional distributions is expanded, students will learn about standard deviation and evolve their understanding away from mean absolute deviation.

Progression of Disciplinary Language

In this unit, teachers can anticipate students using language for mathematical purposes, such as comparing, interpreting, and justifying. Throughout the unit, students will benefit from routines designed to grow robust disciplinary language, both for their own sense-making and for building shared understanding with peers. Teachers can formatively assess how students are using language in these ways, particularly when students are using language to:

Compare

Questions that produce numerical and categorical data (Lesson 1).
Dot plots and histograms (Lesson 3).
Features and distributions of data sets (Lessons 4 and 5).
Measures of center with samples (Lesson 9).
Sampling methods (Lesson 10).
Methods for writing sample spaces (Lesson 15).

Interpret

Dot plots (Lessons 2 and 5).
Histograms (Lesson 3).
Mean of a data set (Lesson 4).
Five-number summaries and box plots (Lesson 7).
Situations involving populations and samples (Lesson 8).
Situations involving sample spaces and probability (Lesson 16).

Justify

Reasoning for matching data sets to questions (Lesson 1).
Reasoning about mean and median (Lesson 6).
Which samples are or are not representative of a larger population (Lesson 9).
Which samples correspond with different populations (Lesson 11).
Whether situations are surprising and possible (Lesson 14).

In addition, students are expected to represent data using dot plots, histograms, five-number summaries, and box plots, and to represent probabilities using sample spaces. Students also have opportunities to use language to describe features of a data set, describe patterns observed in repeated experiments, and explain how to use a simulation to answer questions about the situation.

The table shows lessons where new terminology is first introduced in this course, including when students are expected to understand the word or phrase receptively and when students are expected to produce the word or phrase in their own speaking or writing. Terms that appear bolded are in the Glossary. Teachers should continue to support students’ use of a new term in the lessons that follow where it was first introduced.

lesson	new terminology
lesson	receptive	productive
Acc6.8.1	numerical data categorical data dot plot statistical question variability distribution frequency
Acc6.8.2	center spread typical	variability
Acc6.8.3	histogram bins	distribution center spread
Acc6.8.4	average mean measure of center fair share balance point
Acc6.8.5	mean absolute deviation (MAD) measure of spread symmetrical	mean typical
Acc6.8.6	median peak cluster unusual value	measure of center
Acc6.8.7	range quartile interquartile range (IQR) box plot whisker five-number summary	median measure of spread minimum maximum
Acc6.8.8	population sample survey	mean absolute deviation (MAD)
Acc6.8.9	representative
Acc6.8.10	random sample
Acc6.8.11	measure of variability	population sample random sample symmetrical
Acc6.8.12		representative measure of variability
Acc6.8.13	event chance experiment outcome probability random sample space	likely unlikely impossible certain
Acc6.8.14		outcome probability
Acc6.8.15	tree (diagram)	sample space
Acc6.8.16		tree (diagram)
Acc6.8.17	simulation	random

Unit | Loading...

Navigation

Gradeband

Course

Unit

Settings

Audience

Assessment Access

Data Sets, Distributions, and Sampling

Unit Narrative

Dot Plots and Histograms

Section Goals

Section Narrative

Dot Plots and Histograms

Representing Data

Using Dot Plots to Answer Statistical Questions

Interpreting Histograms

Measures of Center and Variability

Section Goals

Section Narrative

The Mean

Variability and MAD

The Median

Box Plots and Interquartile Range

Measures of Center and Variability

The Mean

Variability and MAD

The Median

Box Plots and Interquartile Range

Sampling

Section Goals

Section Narrative

Larger Populations

What Makes a Good Sample?

Sampling in a Fair Way

Estimating Population Measures of Center

Sampling

Larger Populations

What Makes a Good Sample?

Sampling in a Fair Way

Estimating Population Measures of Center

Probability

Section Goals

Section Narrative

What Are Probabilities?

Estimating Probabilities through Repeated Experiments

Keeping Track of All Possible Outcomes

Multi-step Experiments

Probability

What Are Probabilities?

Estimating Probabilities through Repeated Experiments

Keeping Track of All Possible Outcomes

Multi-step Experiments

Have feedback on the curriculum?

Unit 8 Resources

About Unit 8

Other Resources

More about Sampling Variability (Optional)

Designing Simulations

Sampling

Section Goals

Section Narrative

Larger Populations

What Makes a Good Sample?

Sampling in a Fair Way

Estimating Population Measures of Center

More about Sampling Variability (Optional)

Probability

Section Goals

Section Narrative

What Are Probabilities?

Estimating Probabilities through Repeated Experiments

Keeping Track of All Possible Outcomes

Multi-step Experiments

Designing Simulations

Measures of Center and Variability

Section Goals

Section Narrative

The Mean

Variability and MAD