STA441s20 Final Exam



On Thursday April 16th around 3 pm, I posted some practice questions on the multinomial logit model, with answers.

On Friday morning, the link to Quiz 9 is working now.

On Friday afternoon, answer to Q7 of 2018 final is posted.



Summary


Office Hours

Office hours will be in the Quercus course room.

Past Exams

Quiz solutions

Multinomial Logit Practice

Homework

This course is all about the homework. The homework tells you what I want you to be able to do. Lecture material is only useful to the extent that it helps you do the homework. The text may help too. It is less focused on what we are doing this time, but it is more detailed.

To study for the final, I recommend that you

  1. Re-do the non-SAS parts of the homework.
    1. For each assignment, locate the corresponding lecture slides. They are pretty much in chronological order (order of time). If this is a difficult task, you are not familiar enough with the course material.
    2. Look at the lecture slides and the homework problems together. Observe how most of the homework problems are asking you to use some concept or method from the lecture. Of course sometimes I just want you to think about something, but most questions have a lesson.
    3. Re-do the problems, referring to your earlier answers
    4. If you do not get what a problem means or what it is asking you to do, this means you should find out. You are missing something, and it could be on the final exam.
  2. Using SAS, do something reasonable with the final data sets described below. What's reasonable? In my opinion, more or less what you did on the SAS part of the homework. However, there is more than one "right answer." The important thing is to become familiar with the data sets, try some analyses, and understand the results. You will not bring your output to the exam. Questions will be based on my output.

Sample Sample Size Questions

  1. In a double-blind drug trial with an experimental group and a control group, we want to be 90% sure of detecting an effect on blood pressure if the true mean response to the drug is a quarter of a standard deviation above the control group. The plan is to use a 2-tailed t-test with the usual 0.05 significance level. What is the smallest sample size that will get this job done? (Using matpow1.sas, I got n = 675 which I increased to 676 to maintain equal sample sizes.)
  2. In another version of the first question, suppose we want a power of 0.90 if the drug explains 1% of the population variation in blood pressure. What sample size is needed? Note that "remaining variation" in this case is just variation, because we are not controlling for anything. The same formulas apply. (Using popvar.sas, which writes on the log file, I got n = 1,045. There was no assumption of equal sample sizes.)
  3. In yet another version of the 2-sample problem, what sample size is required if we want the difference to be significant provided it explains 1% of the sample variation? That's an R2 of 0.01, which is tiny. (Using sampvar.sas, I get n = 385)
  4. Baby chickens are to be randomly assigned to one of four feed formulas, and then weighed after 6 weeks. Suppose the true (population) treatment means in grams are μ1 = 220, μ2 = 275, μ3 = 250 and μ4 = 330, with a common variance of σ2 = 2,800. Based on an F-test with the usual α = 0.05 significance level, what total sample size is needed to have a power of 0.80? Assume equal sample sizes, so that your answer will be a multiple of 4. (Using matpow2.sas, I get n = 23 increased to n = 24 for equal sample sizes.)
  5. Pigs are routinely given large doses of antibiotics even when they show no signs of illness, to protect their health under unsanitary conditions. Pigs will be randomly assigned to one of three antibiotics. The response variable will be dressed weight (weight of the pig after slaughter and removal of head, intestines and skin). Mother's and father's live adult weight will be used as covariates. Suppose that antibiotic explains 12% of the remaining variation in weight after taking parents' weight into account. What sample size is required for an F-test to detect this with probability 0.80? Equal sample sizes are not required. (Using popvar.sas, I get n=79)
  6. In another version of the pig weight question above, what sample size is required for the F-test for antibiotic to be significant, provided that antibiotic explains at least 12% of the remaining sample variation? (using sampvar.sas, I get n = 52.)

Data Sets

Exam questions worth 32 points out of 100 will be based on my SAS output for at least two and at most four of the following data sets. Try some analyses. Look up any terminology that is unfamiliar, or you can ask in office hours (but why wait?). Understand what the variables are, because I will not be answering questions about the data sets during the exam. What I will do with the data is very predictable.

Further Comments











This document is licensed under a Creative Commons Attribution-ShareAlike 3.0 (or later) Unported License. The basketball data are protected by the Creative Commons license too.