Suppose all samples of size [latex]n[/latex] are selected from a population with mean [latex]\mu[/latex] and standard deviation [latex]\sigma[/latex]. For each sample, the sample mean [latex]\overline[/latex] is recorded. The probability distribution of these sample means is called the sampling distribution of the sample means. The central limit theorem describes the properties of the sampling distribution of the sample means.
Suppose all samples of size [latex]n[/latex] are taken from a population with mean [latex]\mu[/latex] and standard deviation [latex]\sigma[/latex]. The collection of sample means forms a probability distribution called the sampling distribution of the sample mean.
Because the central limit theorem states that the sampling distribution of the sample means follows a normal distribution (under the right conditions), the normal distribution can be used to answer probability questions about sample means. The [latex]z[/latex]-score for the sampling distribution of the sample means is
where [latex]\mu[/latex] is the mean of the population the sample is taken from, [latex]\sigma[/latex] is the standard deviation of the population the sample is taken from, and [latex]n[/latex] is the sample size.
Because the distribution the sample means follows a normal distribution (under the right conditions), the norm.dist(x,[latex]\mu[/latex],[latex]\sigma[/latex],logic operator) function can be used to calculated probabilities associated with a sample mean.
In this case, we want to calculate probabilities associated with a sample mean. The sample means follow a normal distribution (under the right conditions), which allows us to use the norm.dist function to calculate probabilities. Because we are working with sample means, we must enter the mean and the standard distribution of the distribution of the sample means into the norm.dist function, and not the mean and standard distribution of the population the samples are taken from. The mean of the sample means equals the mean of the population, so we are entering the value of [latex]\mu[/latex] into the second field of the norm.dist function. But the standard distribution of the sample means equals [latex]\displaystyle>>[/latex], so we must enter this value into third field of the norm.dist function.
We use the norm.dist function in the same way as we learned previously to calculate the probability a sample mean is less than a given value, a sample mean is greater than a given value, or a sample mean is in between two given values.
An alternative approach in Excel is to use the norm.s.dist(z,true) function. In the norm.s.dist function, we enter the z-score for the corresponding value of [latex]\overline[/latex] (using the z-score for sample means given above).
The length of time, in hours, it takes an “over 40” group of people to play one soccer match is normally distributed with a mean of 2 hours and a standard deviation of 0.5 hours. Suppose a sample of size 25 is drawn randomly from the population.
Solution:
Function | norm.dist | Answer |
Field 1 | 1.7 | 0.0013 |
Field 2 | 2 | |
Field 3 | 0.5/sqrt(25) | |
Field 4 | true |
Function | 1-norm.dist | Answer |
Field 1 | 2.2 | 0.0228 |
Field 2 | 2 | |
Field 3 | 0.5/sqrt(25) | |
Field 4 | true |
Function | norm.dist | -norm.dist | Answer |
Field 1 | 2.3 | 1.8 | 0.9759 |
Field 2 | 2 | 2 | |
Field 3 | 0.5/sqrt(25) | 0.5/sqrt(25) | |
Field 4 | true | true |
The length of time taken on the SAT for a group of students has a mean of 2.5 hours and a standard deviation of 0.25 hours. A sample size of 60 is drawn randomly from the population.
Function | norm.dist | -norm.dist | Answer |
Field 1 | 2.8 | 2.4 | 0.9990 |
Field 2 | 2.5 | 2.5 | |
Field 3 | 0.25/sqrt(60) | 0.25/sqrt(60) | |
Field 4 | true | true |
Function | 1-norm.dist | Answer |
Field 1 | 2.6 | 0.0010 |
Field 2 | 2.5 | |
Field 3 | 0.25/sqrt(60) | |
Field 4 | true |
Function | norm.dist | Answer |
Field 1 | 2.45 | 0.0607 |
Field 2 | 2.5 | |
Field 3 | 0.25/sqrt(60) | |
Field 4 | true |
In a recent study reported Oct. 29, 2012 on the Flurry Blog, the mean age of tablet users is 34 years and the standard deviation is 15 years. Suppose a sample of 100 tablet users is taken.
Solution:
Function | 1-norm.dist | Answer |
Field 1 | 30 | 0.9962 |
Field 2 | 34 | |
Field 3 | 15/sqrt(100) | |
Field 4 | true |
In an article on Flurry Blog, a gaming marketing gap for men between the ages of 30 and 40 is identified. You are researching a start-up game targeted at the 35-year-old demographic. Your idea is to develop a strategy game that can be played by men from their late 20s through their late 30s. Based on the article’s data, industry research shows that the average strategy player is 28 years old with a standard deviation of 4.8 years. You take a sample of 100 randomly selected gamers. If your target market is 29- to 35-year-olds, should you continue with your development strategy?
Click to see Solution
You need to determine the probability for men whose mean age is between 29 and 35 years of age wanting to play a strategy game.
Function | norm.dist | -norm.dist | Answer |
Field 1 | 35 | 29 | 0.0186 |
Field 2 | 28 | 28 | |
Field 3 | 4.8/sqrt(100) | 4.8/sqrt(100) | |
Field 4 | true | true |
There is 1.86% chance that the mean age of men who will play your game is between 29 years and 35 years. Because this is a very low probability, you should not continue your development strategy.
The mean number of minutes for app engagement by a tablet user is 8.2 minutes with a standard deviation of 1 minute. Suppose a sample of 60 table users is taken.
Solution:
Function | norm.dist | -norm.dist | Answer |
Field 1 | 8.5 | 8 | 0.9293 |
Field 2 | 8.2 | 8.2 | |
Field 3 | 1/sqrt(60) | 1/sqrt(60) | |
Field 4 | true | true |
Function | norm.dist | Answer |
Field 1 | 8.3 | 0.7807 |
Field 2 | 8.2 | |
Field 3 | 1/sqrt(60) | |
Field 4 | true |
Cans of a cola beverage claim to contain 16 ounces with a standard deviation of 0.143 ounces. The amounts in a sample of 34 cans are measured and the mean is 16.01 ounces. Find the probability that a sample of 34 cans will have an average amount greater than 16.01 ounces. Do the results suggest that cans are filled with an amount greater than 16 ounces?
Click to see Solution
Function | 1-norm.dist | Asnwer |
Field 1 | 16.01 | 0.3417 |
Field 2 | 16 | |
Field 3 | 0.143/sqrt(34) | |
Field 4 | true |
Because there is a 34.17% probability that the average sample volume is greater than 16.01 ounces, we should be skeptical of the company’s claimed volume. That is, based on this sample, it is likely that the average volume of the cans is higher than the claimed 16 ounces.
As consumers, we would be glad if the average was higher than 16 ounces because we are likely receiving more cola in the can that what we paid for. As the manufacturer, we would need to inspect our bottling process to determine if the processes is working within acceptable limits.
The distribution of the sample means follows a normal distribution if one of the following conditions is met:
The mean of the sample means [latex]\mu_<\overline