Assignment Seven: Quiz
on Friday March 9th
Telephone sales representatives use computer software to help them locate potential customers, answer questions, take credit card information and place orders. Twelve sales representatives were randomly assigned to each of three new software packages the company was thinking of purchasing. The data for each sales representative include the software package (1, 2 or 3), sales last quarter with the old software, and sales this quarter with one of the new software packages. Sales are in number of units sold. The data are in the file
sales.data. We will treat sales this quarter as the dependent variable and sales last quarter as a covariate.
- First, ignoring which new software package the sales representative employed, are sales better on average this quarter than last quarter? Do the simplest, most basic test you can. Give the numerical value of the test statistic, the p-value, and state your conclusions in plain, non-statistical language.
- Now write E[Y|X] for a regression model in which the slope of the regression line relating sales this quarter to sales last quarter might depend on which software was used. This is pure paper-and-pencil, not SAS.
- Make a table showing how you defined the dummy variables for software package. Make number 3 the reference category. Add a column showing E[Y|X] for each software. This is pure paper-and-pencil, not SAS.
- Using SAS, fit your model to the sales data. Using a calculator or proc iml and the numbers from proc reg, give
- The estimated slope and intercept of the regression line relating sales this quarter to sales last quarter, for sales representatives using software package 1.
- The estimated slope and intercept of the regression line relating sales this quarter to sales last quarter, for sales representatives using software package 2.
- The estimated slope and intercept of the regression line relating sales this quarter to sales last quarter, for sales representatives using software package 3.
- Estimated sales this quarter for a representative with average (sample mean) sales last quarter, using software package 2.
- Make a plot of the three regression lines over the range of the data. A rough plot with SAS is okay, or you can use some other software. Bring the plot to the quiz.
- Now, test the null hypothesis of equal regressions. That is, the null hypothesis is that the threee regression lines are right on top of each other, and there is no difference in expected sales as a function of software used, for any value of sales last quarter.
- In terms of the symbols of your regression model, what is the null hypothesis?
- Give the test statistic (a number from your printout) and the p-value.
- Agree or Disagree: There is no evidence that expected sales are different using the 3 software packages, for any value of sales last quarter.
- Now, test whether an assumption of parallel regression lines is justified.
- In terms of the symbols of your regression model, what is the null hypothesis?
- Give the test statistic (a number from your printout) and the p-value.
- Agree or Disagree: There seems to be an interaction between software package used and sales last quarter.
- We want to know whether there is a difference in mean sales this quarter for sales representatives with average (sample mean) sales last quarter. (I suppose that theoetically, there is a large sub-population of such sales representatives.)
- In terms of the symbols of your regression model, what is the null hypothesis?
- Give the test statistic (a number from your printout) and the p-value. Do this the easiest, fastest way you can!
- Give mean sales for each software package, adjusted for sales last quarter. Yes, you are being asked for least squares means. Get them the easiest, fastest way you can.
- Agree or Disagree: For sales representatives wth average sales last quarter, there is good evidence that software package 2 is better.
- Give a prediction of sales for a sales representative who sold 70 units last quarter, and was using software package 1. Also, give a 95 percent prediction interval. There is an example of how to do this in "Regression on the Math Data with SAS, Part Three." The prediction interval is a bit tough, and will be worth one point at most on the quiz if it is asked.
Bring your log and list files to the quiz. Do not write anything
on the printouts in advance except your name and student number. You may be
asked to hand them in. The log and list files for each data set must
be generated by the same SAS program or you may lose a lot of marks. There
must be no errors or warnings in your log files. There must be no
notes about invalid data. Bring a calculator to the quiz.