In this unit, students use statistical methods to look for associations in bivariate data. The unit begins with students analyzing categorical data arranged in two-way tables. Students use the relative frequencies of the combinations of those categorical variables to check for evidence of any associations in the data.
The unit then transitions to bivariate numerical data, which are visualized using scatter plots and lines of best fit. Students use technology to compute the lines of best fit and observe how well the linear models match the data. Residuals and correlation coefficients are used to quantify the goodness of fit for linear models.
The unit closes with an exploration of the difference between correlation and causal relationships, and it is also an opportunity to apply this learning to areas of interest, like anthropology and sports.
In grade 8, students informally constructed scatter plots and lines of fit, noticed linear patterns, and observed associations in categorical data using two-way tables. In this unit, students build on this previous knowledge by assessing how well a linear model matches the data by using residuals as well as the correlation coefficient for best-fit lines (found using technology).
There are opportunities to practice concepts from a previous unit by interpreting the slope and intercept of a linear model in context as well as using the models to predict one variable given information about the other.
A scatterplot. Horizontal, from 0 to 3, by 0 point 5's, labeled weight in pounds. Vertical, 0 to 2 point 5, by 0 point 25s, labeled price in dollars.
12 dots trending upward and to the right. A line of best fit passes through the y axis at 0 comma 0 point 92, and trends upwards and to the right, passing through three dots.
Create relative frequency tables from information given in a two-way table or about a situation.
Inspect patterns in relative frequency tables and two-way tables to determine if there is a possible association between two variables of interest.
Section Narrative
This section focuses on interpreting categorical data using two-way tables. First, students recall from grade 8 how to interpret the values in a two-way table. Then, they expand on their understanding by creating relative frequency tables and use those to look for an association between the variables.
Comprehend the connection between residuals, variability, and whether or not using a linear model is appropriate.
Interpret the rate of change and vertical intercept for a linear model in the context of a situation.
Section Narrative
In this section, students focus on linear models and how well they fit data. The first lesson motivates the importance of having a linear model by making sense of the slope and intercept in context and showing how to predict additional values from the model.
The next two lessons are about informally assessing how to obtain the linear models. Students progress from selecting between given models to estimating their own models to using technology in order to find a best-fit line. They then use residuals to more quantitatively understand how well a linear model fits data.
Describe the strength and sign of the relationship between variables based on the correlation coefficient.
Investigate the relationship between two variables to analyze whether or not the relationship is causal.
Section Narrative
In this section, students look more deeply at the relationships between two variables. In particular, they use the correlation coefficient to categorize the relationships as strong or weak and as positive or negative. They then use their understanding of situations to suggest whether the relationship is causal or merely a correlation.
In the final section, students have the opportunity to apply their thinking from throughout the unit. As this is a short section followed by an End-of-Unit Assessment, there are no section goals or checkpoint questions.