STA218 Assignment for Quiz 10
Quiz on Thursday April 5th at the beginning of class
For this quiz, you will be allowed a single 8.5 by 11 one-sided formula sheet and a calculator. So write your formula sheet and use it to do the homework. We will supply a copy of the normal table. You might need the normal table for p-values.
Here is a set of standard questions that apply to any statistical test. These are the questions you will see on both the quiz and the final exam. Any reference to the "standard questions" means these:
Some discussion may be useful before we start. The textbook frequently wants you to choose a one-sided test when it is clear to me that a two-sided test is better. Many of the examples and homework problems even provide a model of looking at the results first, and then phrasing the research question (corresponding to the alternative hypothesis) so that it implies a one-tailed test in the direction of the results that were obtained (Like "can we conclude that the engine is really using too much fuel?") If the results had come out in the other direction, they would have phrased the question differently, and adopted a rejection region in the opposite tail. Effectively, this is the same as doing a two-tailed test, but with a significance level that is twice what you are claiming.
This practice was so common in real applications of statistical tests, and so hard to eradicate, that after a brief period of "controversy," scientific journals mostly stopped publishing claims that were backed up with one-sided tests. Basically, our textbook is about 30 years behind the times in this respect, though it is modern in other ways.
There are a few cases, mostly in business or government and not in science, where a one-tailed test is still appropriate. If you can really be sure that there would be absolutely no consequences, and no action of any kind if the results came out opposite to what you are looking for, then go ahead and do a one-tailed test. I gave an example in class. If a factory is emitting poisonous chemicals above a certain (average) standard, then there are serious regulatory consequences. If they are below the standard, nobody cares; it's more or less expected.
The rule for our class is that you will always do two-sided tests unless you are explicitly told that the problem calls for a one-sided test. This means that the answers in the back of the book (as well as those scanned and posted by Lennon) will often be "wrong," at least in terms of the alternative hypotheses, p-values and conclusions. However, the numerical values of Z and t will still be okay.
Please read pages 271-279. You are responsible for: null hypothesis, alternative hypothesis, rejection region, critical value, Type I error, Type II error. The quantity α (introduced in the discussion of Type I error) is called the significance level; it is very important. You will not be asked anything about the acceptance region because some people think you should never accept the null hypothesis. You will not be asked anything about β or about power, because we are running out of time.
Z-test for a single mean: Read 280-285. You're not responsible for power.
t-test for a single mean: Read 286-289. Note that you will only apply this method if you believe the data come from a normal distribution. It is not a tool for small samples in general. Our course does not provide any solutions for the case of small samples from a distribution that is not normal.
Please do the following exercises.
p-values: Please read Pages 293-296 and do Exercises 8.16 and 8.18. You are responsible for the concept of a p-value, and how to calculate it for Z-tests (not t-tests).
Z-test for two means: Please read pages 297-299. The phrase "when sample sizes are large" will mean n1≥30 and n2≥30, because the population variances will never be known; we'll always substitute sample variances in the formula for Z. Also, D0, the difference between population means that is stated in the null hypothesis, will always be zero; you might as well record it that way (with no ) on your formula sheet.
t-test for two means (2-sample t-test, or independent t-test): Please read pages 299-304. There are several things to note here.
The practical implication of all this for you is that you will only apply the t-test when the problem explicitly states that the distribution is normal. As in the case of the one-sample test, this course does not provide tools for small samples from non-normal populations.
Now please do these exercises.
Paired difference tests: When the data consist of differences between matched pairs of data, like before-after or a comparison of wheels on the same automobile, you just apply a one-sample test to the differences, usually testing H0: μ=0. The book presents this as a separate method (called a paired-difference test) and presents separate formulas with d-bar instead of x-bar and so on, but it's just the same old formulas with a slightly more obscure notation. The book presents only t-tests, but of course you could do a one-sample Z-test on differences from a large sample from a possibly non-normal distribution -- of differences. Note that for the paired-difference t-test, it is assumed that the differences are normally distributed. Anyway, please read pages 307-312. Their formula for a confidence interval is nothing new.
Look at exercise 8.43. This really is a dog, no, that's unfair to dogs. The assumption of normality is extremely questionable. Don't do that problem. Instead, do 8.44, answering the standard questions. Assume normality, which they forgot to say, again. Don't bother with the confidence interval.
Z-test for a single proportion: Please read pages 316-318. Note the useful and clear discussion (p. 317, repeated in blue on p. 318) of why you use p0 instead of p-hat in the standard error. And as usual, their x, in p-hat=x/n, is our Σxi, where the data are binary.
Do Exercise 8.52, sticking to our standard questions instead of the book's questions. First test if the sample size is big enough (use p0). Can you conclude that the check verification programme reduced the number of bad checks?
Z-test for difference between proportions: Please read pages 320-323. As usual, D0 will always equal zero, so only the second-to-last formula for Z on page 321 is needed. The formula for "p-hat" reduces to the proportion of ones in the two samples combined. That's right; merge the two samples into one big sample, and calculate the sample proportion. You will not be responsible for checking whether the sample sizes are big enough.
Please do Exercise 8.58, but answer the standard questions. Ignore the questionable suggestion that the modification of the production line "could not possibly increase the fraction defective." What would you do if it did? As usual, we do a t-tailed test.
You are not responsible for any tests or confidence intervals for variances.
Do the assigned problems in preparation for the quiz. They are not to be handed in.