This question uses data from the Toronto Raptors' 2006-2007 season. For
each regular season and playoff game, the following variables were recorded:
- Date
- Home or Away game
- Opponent
- Won or lost
- Days since last game
- Points scored by the Raptors
- Points scored by opponents
- Opponents' won-lost record the preceding year.
The data are available in the file
Raptors06-07.data.txt.
The response variable will be point spread, defined as Raptors' score
minus opponents' score. The explanatory variables will be Opponents' won-lost record, Home vs Away game, Days since last game, and a binary variable that equals one if the game was on a weekend, and zero if on a weekday.
Here a few hints and reminders intended to make it easier for you to read and process the data.
An ordinary regression on these data lacks credibility, because it's obviously a time series and the assumption of independent random sampling (which means no sequential association between observations) is implausible without further evidence. So please follow these steps:
- Start with an ordinary least squares (OLS) regression using proc reg, with the explanatory and response variables given above. Request the Durbin-Watson statistic and associated p-value. What is the estimated first-order auto-corrrelation of the residuals? The answer is a number.
- P = 0.0595 is not quite evidence for autocorrelation, but it's not a completely clean bill of health either. Fit a first-order autoregressive model with proc autoreg.
- Are the beta-hat values different when you fit the first-order autoregressive model? Do the conclusions change (using the 0.05 significance level)?
- Two of the time series plots indicate that the first-order autoregressive model is okay. Which ones?
- Based on the first-order autoregressive model, when the Raptors played at home it was worth an estimated ______ points.
- Did they do better when they played on the weekend, or worse?
- It's probably not necessary, but fit an AR(6) model (a model with 6 autoregressive parameters). Is there evidence of higher-order autocorrelation? Was the ordinary regression acceptable for these time series data?