Design and analysis of spit plot experiments

Split plot design

Design and fieldbook template

In a field experiment to test for effects of fungicide on crop, treatment of fungicides may be distinguised into multiple factors – based on chemical constituent, based on formulation, based on the mode of spray, etc. In a general case scenario where two former factors could be controlled, factor combinations may be organized in several different ways. When fully crossed implementation is not possible, split plot design comes to the rescue.

It is fair to assume that fungicide constituent is relatively difficult to allocate in highly isolated patches, so we can allocate a larger plot parcel to this factor and allocate different levels of formulation to sub-plots.

The design fieldbook seems somewhat similar to that shown in Table 1.

Table 1: Split plot design with main and subplot factors in three replication blocks
plotssplotsblocktrt1trt2mainplot
10111Mancozeb + MetalaxylSeed + Foliar1
10121Mancozeb + MetalaxylControl1
10131Mancozeb + MetalaxylSeed Treatment1
10141Mancozeb + MetalaxylFoliar Spray1
10211TrichodermaFoliar Spray2
10221TrichodermaSeed + Foliar2
10231TrichodermaSeed Treatment2
10241TrichodermaControl2

Layout plan

An example grid layout plan of the aboveshown design is shown below.

Analyis of split plot design

Let us take a grain yield dataset. The dataset contains 48 observations. Below, (in Table 2) data head have been shown after import, type-conversion and factor recoding.

Table 2: Split plot design with main and subplot factors in three replication blocks
repdensitynitrogenyield
1low11503
1medium_low11866
1medium_high12469
1high13786
2low12299
2medium_low11892
2medium_high13517
2high12851

Calculating variance, and setting hypothesis: A case involving single factor

In the most primitive scenario, manual calculation of variance components could just as easily be done. However, as the number of treatment factors rise, so does the complexity of computation. Manual calculation of sum of squares and the test statistic could be done as shown below. This, however, only remains valid as long as no grouping factors besides nitrogen are present, thus making it a classical scenario of single factor variance partitioning.

mu <- mean(grain_yld$yield)  # whole sample mean
ssto <- sum((grain_yld$yield - mu)^2)  # total sum of squares
mu.i <- tapply(grain_yld$yield, grain_yld$nitrogen, 
    mean)  # nitrogen(factor) means 
sstr <- sum(table(grain_yld$nitrogen) * (mu.i - mu)^2)  # nitrogen(factor) sum of squares
sse <- ssto - sstr  # error sum of squares
fstat <- (sstr/3)/(sse/45)  # F-statistic
SourceSums of SquaresDegrees of FreedomMean SquareF-Statp-value
Treatment6.167^{7}32.056^{7}18.4655.925^{-8}
Error5.01^{7}451.113^{6}

\[ \begin{align*} H_0 & : \mu_1 = \mu_2 = \mu_3 \\ H_A & : \mbox{At least one pair of means not equal} \end{align*} \]

Before proceeding for an inference, It is worthwhile to be acquainted with what the distribution looks like.

Getting back to our specific split-plot design case, we develop model and generate the ANOVA table (Table 3). A split plot design is modeled with main plot factor nested within replication and a sub plot factor nested within main plot factor. This essentially partitions the main effects of replication and the main plot factor.

Table 3: Model assumption 1: ANOVA of split plot design with main and subplot factors in three replication blocks
DfSum SqMean SqF valuePr(>F)
rep2341376617068835.980.008
density321451719715057325.050.000
rep:density6657645510960763.840.008
density:nitrogen1273482134612351121.450.000
Residuals246849756285406

Alternatively, following model specification could be made by regarding response (yield) as a product of main plot effect and sub plot effect, wherein main plot is nested inside replication (block) (ANOVA shown in Table 4).

Table 4: Model assumption 2: ANOVA of split plot design with main and subplot factors in three replication blocks
DfSum SqMean SqF valuePr(>F)
density321451719715057325.050.000
nitrogen3616733052055776872.030.000
rep2341376617068835.980.008
density:nitrogen91180882913120924.600.001
density:rep6657645510960763.840.008
Residuals246849756285406

It should be the right time, now, to use plotting libraries and generate some beautiful graphs.

Best to take a look at some diagnostic plots now, just to make sure model assumptions and validity are not being flouted.

Box-Cox plots helps determine whether or not a transformation is required. To recapitulate the importance of Box-Cox plot, below is an statement quoted from http://www.itl.nist.gov/div898/handbook/eda/section3/eda336.htm, which goes:

The Box-Cox normality plot shows that the maximum value of the correlation coefficient exists at λ = (x-axis value | maximum y-axis height). The histogram of the data after applying the Box-Cox transformation(were it not indicative of normal) with λ = (x-axis value | maximum y-axis height) shows a data set for which the normality assumption is reasonable. This can be verified with a normal probability plot of the transformed data.

Mean separation should proceed as follows.

## 
## Study: Duncan multiple comparison among levels of nitrogen
## 
## Duncan's new multiple range test
## for yield 
## 
## Mean Square Error:  285406 
## 
## nitrogen,  means
## 
##   yield  std  r  Min  Max
## 1  2384  710 12 1503 3786
## 2  3714  757 12 2050 5281
## 3  5021 1268 12 3317 7090
## 4  5195 1367 12 3858 7798
## 
## Alpha: 0.05 ; DF Error: 24 
## 
## Critical Range
##   2   3   4 
## 450 473 487 
## 
## Means with the same letter are not significantly different.
## 
##   yield groups
## 4  5195      a
## 3  5021      a
## 2  3714      b
## 1  2384      c

Visualize the means resulting from Duncan’s test.

comments powered by Disqus

Related