Scab Disease Data

Scab disease is a fungal infection that affects potatoes. The fungus does not grow well in acidic soil, so investigators designed a study to see whether adding sulphur to the soil would reduce the scab disease. In a completely randomized design, plots of land were randomly assigned to either a control condition or to several levels of sulphur that was spread on the land in the Spring or Fall. The amounts of sulphur were either 300 pounds per acre, 600 pounds per acre or 1200 pounds per acre. The potatoes were harvested at the end of the growing season. One hundred potatoes were randomly selected from each plot of land. The potatoes were washed, and then a lab assistant estimated the percent of each potato's surface that was infected with scab disease. The response variable is, for each plot of land, the mean percent of the potato's surface covered with scab disease. The explanatory variable is pounds of sulphur, in hundreds of pounds; the control is zero.

The data are in a spreadsheet that looks like this.

Scab Disease Spreadsheet

Click here to download the spreadsheet.

Here are some questions we would like to answer. Use SAS. For each question, start by stating the null hypothesis in terms of μj values.

  1. Is amount of scab disease affected by experimental treatment?
  2. Is the average amount of scab disease for each treatment different from the average amount of scab disease in the control condition? That's six tests.
  3. Is the average amount of scab disease in the 3 Spring conditions different from the control condition? This is one test.
  4. Is the average amount of scab disease in the 3 Fall conditions different from the control condition? This is one test.
  5. Is the average amount of scab disease in the 3 Spring conditions different from the average amount of scab disease in the 3 Fall conditions? This is one test.
  6. Is amount of scab disease affected by the amount of sulfur when the sulfur is applied in the Spring? This is one test.
  7. Is amount of scab disease affected by the amount of sulfur when the sulfur is applied in the Fall? This is one test.
  8. For each amount of sulphur, is the average amount of scab disease different depending on whether the treatment is applied in Fall or the Spring? That's three tests.

Suppose we want to protect all these tests simultaneously at a joint significance level of 0.05.

Of course for each question you should be able to give the test statistic, p-value, and state the conclusion in plain, non-statistical language.

 

There are so many good questions and discussion topics that could be based on this example.

  1. Is n=4 per treatment really enough?
  2. Why not use the individual potatoes as experimental units?
  3. Why is the normal assuption pretty reasonable? (CLT, but not iid)
  4. Is there a case for one-sided tests here?
  5. Is the amount of reduction worth the expense?
  6. Monitoring actual soil acidity, at various depths. Mediating variable, if that's what it's called.
  7. Degree of infestation is likely not controlled, and could be the most powerful influence. Consider c ontrolling for it, perhaps by blocking.
  8. Other potential response variables, like size and flavour of the potatoes!
  9. Possible interactions with fertilizers and other pesticides. At least these should be held constant.
  10. How would things be different if this were a purely observational study? If would be easier, you know. Just ask the farmers.

 


This document is based on an example in Cochran and Cox's (1958) classic text Experimental design.. The original data appear on page 97 of Cochran and Cox's book. The data above are carefully designed to give the same results as the original, without actually using their numbers. The R function used to reconstruct the data appears in a comment statement at the end of this document. View the html source to see it.

This document, including the data and the R function, is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License. Use any part of it almost any way you like, as long as you share the results freely. See the license for details.