STA442/1008 Final Exam Information
You will write your answers in examination books. Please avoid the temptation to write answers on the examination paper, especially when filling out tables. Copy the table into your exam book, and then fill it out. If you write on the examination paper rather than the book, your answer will probably get lost, and you will get a zero for the question.
The exam will be closed book and closed notes. You should bring a calculator.
There are nine questions. Most of the questions have more than one part. The questions are not equally difficult, and not equally time-consuming. The questions on assignments and quizzes are a good indication of what to expect.
The last several questions (worth 40 out of 100 marks on the exam) are based on SAS output that will be provided to you. Again, the type of questions will be familiar from the assignments and quizzes. More information about the SAS part is given below.
Not all parts of the course are equally represented. Here are some details.
If you do nothing else, at least familiarize yourself with the studies and variables. My SAS variable names are given, and I suggest you use them. The final exam does not include a full description of the studies and variables. During the exam, Christine and I will answer questions about the data, but only if the answers are brief.
You will notice below that unlike the SAS assignments during the term, you are not always being asked specific sample questions about the data sets. This time, it is your job to ask the relevant questions and choose the statistical techniques that will help you answer them. The questions on computer assignments during the term should be your guide. For some of the data sets, more than one statistical technique is applicible, and you should not hesitate to do more than one kind of analysis.
Of course you may discuss this with other people, but this is not the time to let yourself be convinced too easily by your friends. I promise you that in several cases, there is more than one set of questions you could ask about the data, and (correspndingly) more than one natural and reasonable analysis. If you avoid tunnel vision by doing it your way first and then comparing answers, it's more likely that one of you come up with what you'll see on the exam. In a group setting, if four people come up with six analyses, the whole group will benefit.
IQ is short for "Intelligence Quotient." It is measured by various tests. For all the tests, a score of 100 is considered average, while scores above 100 are above average and scores below 100 are below average. Whether IQ tests really measure intelligence is debatable and highly political. How much IQ is influenced by heredity as opposed to environment is also a question on which many people have strong opinions.
In the Longitudinal IQ Data, the IQs of adopted children were measured at ages 2, 4, 8 and 13. The birth mother's IQ was assessed at the time of adoption, and the adoptive mother's education (in years) was also recorded. The variables (with my variable names) are:
People who raise large numbers of birds inhale potentially dangerous material, especially tiny fragments of feathers. Can this be a risk factor for lung cancer, controlling for other possible risk factors? Which of those other possible risk factors are important? Here are the variables in the file, along with my variable names.
Variable My variable Name Values Lung Cancer cancer 1=Yes, 0=No Gender sex 1=Female, 0=Male Socioeconomic Status ses 1=High, 0=Low Birdkeeping birdkeep 1=Yes, 0=No Age age Years smoked yrsmoke Cigarettes per day cigday
In this study, pairs of university students came to a Psychology laboratory to eat a meal together. They were either friends or strangers, they ate from either small or large plates, and the food was in either a common bowl or separate bowls. Before the meal, they rated how hungry they were. The total amount of food they served out onto their plates and the total amount of food they actually ate were recorded, in grams. Here are the variables in the file, along wth my variable names.
Variable Values Friend 1 = Friends 2 = Strangers Plate 1 = Large Plate 2 = Small Plate Share 1 = Common Bowl 2 = Separate bowls Hunger Mean of the two ratings FoodSrv Grams FoodEat GramsHere are some hints. What are the natural dependent variables? What is the natural covariate? I did not make my own dummy variables.
In a dichotic listening experiment, subjects wear stereo headphones that allow the presentation of different sound tracks to each ear, at the same time. In this example, right-handed female university students listened to short lectures on art history in the presence of background noise. After each lecture, they answered a set of multiple choice questions.
Two factors were varied experimentally:.
Each subject in the experiment experienced all nine treatment combinations, in a balanced order that was different for each subject, and randomly assigned. Thus, there are nine data values for each subject: number of questions answered correctly in each experimental condition. The first part of the raw data appears below; it shows just the first three cases. Follow the link above to get the whole data set. You should use my SAS variable names (test11, etc.).
Signal in Signal in Signal in Left Ear Right Ear Both Ears ____________________ ____________________ ____________________ HipHop Classc Radio HipHop Classc Radio HipHop Classc Radio ------ ------ ------ ------ ------ ------ ------ ------ ------ test11 test12 test13 test21 test22 test23 test31 test32 test33 1 13 12 10 15 14 14 14 13 14 2 4 8 8 6 5 8 6 3 4 3 13 15 11 11 13 15 11 13 12
Christine marks the quizzes, and I mark the final examination. We have basically the same standards and objectives, but we are not identical (lucky for her). You might say that this section is about my personal peculiarities -- just in the way I mark exams, of course. It is helpful for you to know about this, so your exam-taking strategy will not conflict with my exam-marking strategy.
The purpose of STA442 is for you to learn to use statistical methods to draw reasonable conclusions from numerical data. Often, the first several parts of a question will ask for technical details, and the last part will ask for a conclusion. If the technical part is missing, it does not matter what you conclude. Similarly, an answer that has most of the technical details right but gets the conclusion wrong (or leaves it off, or states it incompletely) is almost worthless, and will get few marks. On the other hand, if you make technical mistakes but draw reasonable conclusions from what you have, you can still get substantial marks.
When I read an answer, my main goal is to verify that you know what's going on. Here are some more details, mostly about what to avoid.
In a real-world situation (and in the artificial world we presently inhabit, too), you don't get part marks for an answer that (correctly) indicates a relationship is present, but does not say what it is. Imagine you are working in marketing, and you leave a voice mail that says "Consumers recalled one of the commercials better than the other one." Click. Are you trying to frustrate your boss? Are you trying to get fired?
But this strategy backfires when I mark an exam, because (except for simple numerical answers) I usually do not give marks for things that are correct; I take off marks for things that are wrong or missing. So, if a student writes a long answer that includes the correct conclusion, the wrong conclusion (based on the same information!) and something irrelevant, all I really see is the contradiction between the two conclusions, and I will probably give the answer a zero. Yet it might be that the student understands everything perfectly, but is just writing all the crazy stuff as insurance against the unlikely possibility that maybe that's what I am looking for. Let's make sure that you don't fall into this trap!