For this question, the analyses will be based on fixed effects models using proc mixed. The model is linear regression, possibly with dummy variables:
Yi = β0 + β1xi1 + β2xi2 + ... + βpxip + εi,
where the error terms εi are normally distributed with expected value zero. Error terms for the same case have a covariance matrix Σ. The matrix Σ could be have all σ2 values on the main diagonal and zeros on the off-diagonals. In this case we have ordinary regression with independent errors.
Quite a few other structures are available for Σ, including the following:
Unknown (type=un) |
|
Variance Components (type=vc) |
|
Compound Symmetry (type=cs) |
|
Autoregressive (type=ar(1)) |
|
|
|
|
|
|
|
These formulas will be provided with the last quiz (in 2016), and they will be on the formula sheet for the final exam. For any set of proc mixed output you generate, you should be able to
- Give p, the number of explanatory variables in the regression model.
- Give the dimension (number of rows and columns) in Σ.
- Give the total number of parameters in the model you fit. That's the number of β values plus the number of unique elements in Σ.
- Locate any parameter estimates on your printout.
- For every test (and there may quite a few) be able to state the null hypothesis in terms of the symbols of your model.
In a test of how well people remember instructional materials, subjects of various educational levels were presented with training materials that were either in Black & White or in Colour. Their ability to recall the material was tested with both Cartoon and Realistic testing materials at two points in time -- immediately after training, and several weeks later. Scores on an IQ test (the Otis Mental Ability Test) were available for all subjects. The variables are
- Subject identification number
- Colour of training materials: 0 = Black & White, 1 = Colour
- Education: 0 = Pre-professional, 1 = Professional, 2 = Student.
- Location: 1 = Hospital A, 2 = Hospital B, 3 = Hospital C, 4 = Penn State University.
- Otis Test of Mental Ability (IQ)
- Recall at Time One, Cartoon testing materials
- Recall at Time One, Realistic testing materials
- Recall at Time Two, Cartoon testing materials
- Recall at Time Two, Realistic testing materials
The data are available in the file
cartoon.data.txt. This is a Minitab data set.
- We will limit the analysis to students, who are all from Penn State University. I used an if statement, but I had to put if before output.This leaves us with a three-factor analysis of covariance.
- What is the covariate?
- What are the factors?
- Label each factor as within-cases or between-cases.
- Write the full regression model, with products of dummy variables for all possible interactions. I get 9 βs.
- Give the dimension (number of rows and columns) in Σ.
- Let's not assume anything about the structure of the covariance matrix. How many unique elements are there in Σ?
- Carry out the analysis using proc mixed. Just so you can see if we are on the same page, my -2 Restricted Log Likelihood was 878.6.
- For each F-test, be able to state the null hypothesis in Greek letters.
- Locate the estimate of each element of Σ on your printout. Don't bother with the β-hats.
- For each significant F-test, be able to state the conclusion in plain, non-statistical language. This will require some lsmeans.
- What is the null hypothesis for the "Null Model Likelihood Ratio Test?" Why does it have 9 degrees of freedom?
- Describe an original study that would call for a multivariate regression in
which you control for an unordered categorical explanatory variable and then test the relationship
of two quantitative explanatory variables to two response variables simultaneously. Be sure to
specify which are the explanatory variables and which are the response variables, and which of
the variables you are controlling for. If a variable is to be represented by dummy variables you
should mention this and say how many dummy variables, but you don't need to say exactly how
they are defined or go into further detail. Don't bother to say anything about test statistics or pvalues. The study must be original or you will get no marks.
- Design an original study that would use a three-way multivariate analysis of covariance.
- Design an original study that would use a logistic regression with one quantitative explanatory variable and dummy variables for a categorical explanatory variable with 3 categories.
- Is it possible to have a study with repeated measures and a categorical response variable? If it is possible, make up an original example. If it is impossible, explain why.
- Make up an original example of a study that is multivariate, and both response variables are categorical.
- Invent and briefly describe (in a few sentences at most) original studies with the following characteristics. Do not use any examples from lecture or the class notes. If the requested example is impossible, say so and explain why it is impossible. The word original is important. If you give an example that is overly similar to one from lecture or the class notes, your answer will receive a zero. If two people give exactly the same example, they will both get a zero for the question.
- A categorical explanatory variable and a continuous response variable.
- A continuous explanatory variable and a continuous response variable.
- A nominal scale explanatory variable and an ordinal scale response variable.
- An explanatory variable that is both quantitative and nominal scale, and a response variable that is continuous.
- Two categorical explanatory variables and two categorical response variables.
- A single categorical explanatory variable and two quantitative response variables