Not all roles available for this page.
Sign in to view assessments and invite other educators
Sign in using your existing Kendall Hunt account. If you don’t have one, create an educator account.
Coins are usually stamped with the year and location of the mint where they were made. D represents the mint in Denver, Colorado, and a blank or P represents the mint in Philadelphia, Pennsylvania.
Diego has a jar containing 36 coins. Select a sample of 5 coins by rolling your number cube once to represent the row and then rolling again to find the column. For example, rolling a 3 and then a 5 would represent selecting the coin marked “2000 P.” Repeat this process to collect a sample of 5 coins. The samples are drawn with replacement, which means we allow for the opportunity to draw the same coin more than once into our sample.
| coin 1 | coin 2 | coin 3 | coin 4 | coin 5 | sample mean year | sample proportion minted in Denver | |
|---|---|---|---|---|---|---|---|
| sample 1 | |||||||
| sample 2 | |||||||
| sample 3 |
A manufacturer is worried that their product may not be consistently good enough to pass quality control inspections. They are going to take a random sample of 10 of their products and have a quality control expert examine the items to determine if they pass or fail.
Your teacher will give you a bag with paper slips inside. 7 are marked “pass” and 3 are marked “fail.”
| simulated sample | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| number of "pass" slips | |||||
| proportion of sample that passes |
In many cases, it is difficult to collect data from an entire population, so using data from a small subset of the larger group, called a sample, is needed. The trade-off is that the incomplete information from samples can provide only estimates of characteristics for the population.
For example, an ecologist may wonder what proportion of trees in a particular large forest contains bird nests. It would be hard to look at every tree in the forest, so the researcher might take a random sample of 1,000 trees to find the proportion that have nests. Let’s say the ecologist found that a proportion of 0.04 trees in the sample have nests. That is a good number to give as an estimate for the percentage of trees in the forest that have nests, but because the sample was randomly selected, it would probably not be surprising if 0.028 or 0.05 of the trees have nests.
To give a sense of the variability and confidence in estimates, a margin of error is usually given along with the point estimate. A margin of error is the maximum expected difference between a point estimate for a population characteristic and the actual value of the population characteristic. For means and proportions, the distribution of point estimates derived from samples, called the sampling distribution, tends to be approximately normal and centered around the actual population characteristic, so it is reasonable to expect that about 95% of the point estimates are within 2 standard deviations of the actual population characteristic. In this unit, we will use 2 standard deviations of the sampling distribution as the margin of error.
The ecologist could take the data from their sample and use a computer to simulate drawing, with replacement, thousands more samples of the same size from the original sample to create a sampling distribution. If the standard deviation for the sampling distribution is 0.006, then they can use a margin of error of 0.012 along with the estimate of 0.04 for the population proportion and report an estimate of for the proportion of trees in the forest that have nests. This means that any value in the range of 0.028 to 0.052 would not be surprising for the proportion of trees in the forest that have nests.
The maximum expected difference between an estimate for a population characteristic and the actual value of the population characteristic.
A sampling distribution is a distribution of statistics obtained from samples drawn from a specific population. For example, a group of students draw several samples from a population that has 70% of items that Pass. The proportion of papers from each sample that are marked Pass is shown in this dot plot.