STA429/1007 Assignment 10
Quiz on Friday March 30 at 12:10 p.m.
The file noise.dat comes from a study in which men and women in 3 different age groups are tested on their ability to understand a conversation about politics under 5 different levels of background noise. There are 10 women and 10 men in each age group for a total n = 60. Order of presentation of noise levels was randomized for each subject, and the subjects themselves were tested in random order.
There are 5 lines of data for each case.
We are only going to use three variables in this assignment: Age group, sex and discrimination score at noise level 5. As long as you read these three variables for n=60 cases, you are fine. To make sure we are doing it the same way, my sample mean for discrimination score at noise level 5 is 31.445.
There are two natural ways to read the data. One way is to read all 5*7=35 variables per case, as in tuberead.sas. Another, less tedious way is to read just the data you want, taking advantage of the fixed column format. To use this approach see the input statement of senicread.sas. In the noise data we have "n=5" lines of data per case. Specify #1 to read from line 1, then read sex and age group from the appropriate columns. Still in the input statement, specify #5 to read from line 5, and then read the discrimination score from the appropriate columns. To make sure it worked, I recommend a cross-tabulation of sex by age and a proc means on discrimination score.
Make a single categorical independent variable consisting of all the age-sex combinations. This variable takes on 6 values. You will be testing contrasts of the group means.
The dependent variable is discrimination score at noise level 5. We'll just call it "discriminaton score."
Later, we are going to do a set of custom contrasts and convert them to Scheffé tests, so you'll need a table of critcal values. Produce one by modifying the proc iml code in kenton.sas. I found it better to put this at the end of my program.
First, do an overall one-way F test. Are there significant differences among the 6 group means? Be able to specify the numerical value of the test statistic, the p-value, etc..
Follow up with Scheffé tests. First do all pairwise comparisons of group means; you can and should do these with the means statement rather than setting up custom contrasts. Because the sample sizes are all equal, you get tests in a convenient format, not confidence intervals. Which comparisons, if any, are statistically significant? Give a one-sentence description of the results in plain language.
The table below will help in specifying the hypotheses you are to test.
Age Group Sex 1 2 3 F mu11 mu12 mu13 M mu21 mu22 mu23Now set up custom tests of contrasts to answer these questions, converting the tests to Scheffé tests by comparing the test statistics to modified critical values. For any significant results, you should be able to state the results in simple, non-statistical language.
Of course many more tests are possible, but at this point I have my main conclusion.
Please bring your log file and list file to the quiz.