-Design can be used when experimental units are essentially homogeneous. ... Very flexible design (i.e. number of treatments and replicates is only limited by.
COMPLETELY RANDOM DESIGN (CRD)
Description of the Design -Simplest design to use. -Design can be used when experimental units are essentially homogeneous. -Because of the homogeneity requirement, it may be difficult to use this design for field experiments. -The CRD is best suited for experiments with a small number of treatments.
Randomization Procedure -Treatments are assigned to experimental units completely at random. -Every experimental unit has the same probability of receiving any treatment. -Randomization is performed using a random number table, computer, program, etc. Example of Randomization -Given you have 4 treatments (A, B, C, and D) and 5 replicates, how many experimental units would you have?
1
D 11
C
2
D 12
B
3
B 13
A
4
C 14
B
5
D 15
C
6
C 16
B
7
A 17
C
8
A
9
B
18
D
19
A
-Note that there is no “blocking” of experimental units into replicates. -Every experimental unit has the same probability of receiving any treatment.
10
D 20
A
Advantages of a CRD 1. Very flexible design (i.e. number of treatments and replicates is only limited by the available number of experimental units). 2. Statistical analysis is simple compared to other designs. 3. Loss of information due to missing data is small compared to other designs due to the larger number of degrees of freedom for the error source of variation. Disadvantages 1. If experimental units are not homogeneous and you fail to minimize this variation using blocking, there may be a loss of precision. 2. Usually the least efficient design unless experimental units are homogeneous. 3. Not suited for a large number of treatments.
Analysis of the Fixed Effects Model Notation Statistical notation can be confusing, but use of the Y-dot notation can help simplify things.
The dot in the Y-dot notation implies summation across over the subscript it replaces.
For example, n
yi. yij Treatment total, where n number of observations in a treatment j 1
yi. yi. n Treatment mean y.. i 1 a
n
y j 1
ij
Experiment total, where a number of treatments
y.. y.. N Experiment mean, where N total number of observations in the experiment .
Linear Additive Model for the CRD
Y ij i ij where: Yij is the jth observation of the ith treatment, is the population mean,
i is the treatment effect of the ith treatment, and ij is the random error. -Using this model we can estimate i or ij for any observation if we are given Yij and . Example
Yi. Y i. Y i. Y ..
Treatment 1 4 5 6 15 5 -3
Treatment 2 9 10 11 30 10 2
Treatment 3 8 11 8 27 9 1
Y.. = 72 Y .. = 8
-We can now write the linear model for each observation ( Y ij ). -Write in for each observation.
Yi. Y i. Y i. Y ..
Treatment 1 4=8 5=8 6=8 15 5 -3
Treatment 2 9=8 10 = 8 11 = 8 30 10 2
Treatment 3 8=8 11 = 8 8=8 27 9 1
Y.. = 72 Y .. = 8
-Write in the respective i for each observation where i Y i. Y ..
Yi. Y i. Y i. Y ..
Treatment 1 4=8–3 5=8–3 6=8–3 15 5 -3
Treatment 2 9=8+2 10 = 8 + 2 11 = 8 + 2 30 10 2
Treatment 3 8=8+1 11 = 8 + 1 8=8+1 27 9 1
Y.. = 72 Y .. = 8
-Write in the ij for each observation.
Yi. Y i. Y i. Y ..
Treatment 1 4=8–3-1 5=8–3 +0 6=8–3+1 15 5 -3
-Note for each treatment
Treatment 2 9=8+2-1 10 = 8 + 2 + 0 11 = 8 + 2 + 1 30 10 2
ij
Treatment 3 8=8+1-1 11 = 8 + 1 + 2 8=8+1-1 27 9 1
Y.. = 72 Y .. = 8
0.
-If you are asked to solve for 3 , what is the answer? -If you are asked to solve for 23 , what is the answer? -Question: If you are given just the treatment totals ( Yi. ’s), how would you fill in the values for each of the observations such that the Error SS = 0. Answer: Remember that the Experimental Error is the failure of observations treated alike to be the same. Therefore, if all treatments have the same value in each replicate, the Experimental Error SS =0. Example Given the following information, fill in the values for all Y ij ’s such that the Experimental Error SS = 0.
Yi. Y i.
Treatment 1
Treatment 2
Treatment 3
15 5
30 10
27 9
Y.. = 72 Y .. = 8
Answer Treatment 1 5 5 5 15 5
Yi. Y i.
Treatment 2 10 10 10 30 10
Note in the previous two examples that
Given H 0 : 1 2 ... t
Treatment 3 9 9 9 27 9
i
Y.. = 72 Y .. = 8
0. This is true for all situations.
H A : i i ' for at least one pair of treatments (i, i' ) t
i 1
i
t
(i.e., the sum of the treatment means divided by thenumber of treatments equals the experiment mean).
t
This definition implies that i 0 (Yi. Y.. ). i 1
The hypothesis written above can be rewritten in terms of the treatment effects τ i as:
H 0 : 1 2 ... a 0
H A : i 0 for at least i. Thus, when we are testing the null hypothesis that all treatments means are the same, we are testing at the same time the null hypothesis that all treatment effects, τi, are zero.
ANOVA for Any Number of Treatments with Equal Replication Given the following data:
Replicate 1 2 3 4 Yi.
Y
2 ij
Treatment B 23 42 36 26 31 47 33 34 123 149
A
3,875
5,805
C 47 43 43 39 172 Y...=444 7,428
Step 1. Write the hypotheses to be tested.
H o : 1 2 3 H A : 1 2 3
Ho: All three means are equal.
or
HA: At least one of the means is different from the other means.
1 2 3 or
1 2 3 Step 2. Calculate the Correction Factor. CF
Y..2 444 2 16,428.0 rt 4*3
Step 3. Calculate the Total SS TotalSS Yij2 CF (232 36 2 312 ... 39 2 ) CF 17,108 16,428 680.0
Step 4. Calculate the Treatment SS (TRT SS) Yi.2 TRTSS CF r 123 2 149 2 172 2 16,428 4 4 4 16728.5 16428.0 300.5
Step 5. Calculate the Error SS Error SS = Total SS – Treatment SS = 680 – 300.5 = 379.5 Step 6. Complete the ANOVA table
Sources of variation Treatment Error Total
Df t-1 = 2 t(r-1) = 9 rt-1 = 11
SS 300.5 379.5 680.0
MS 150.25 42.167
F 3.563NS
Step 7. Look up Table F-values. F0.05;2,9 = 4.26 F0.01;2,9 = 8.02 Step 8. Make conclusions.
-Since F-calc (3.563) < 4.26 we fail to reject Ho: at the 95% level of confidence. -Since F-calc (3.563) < 8.02 we fail to reject Ho: at the 99% level of confidence. 0
3.563
4.26
8.02
Step 9. Calculate Coefficient of Variation (CV). %CV
s *100 Y
Remember that the Error MS = s2.
%CV
42.167 *100 444 4*3
6.494 / 37 *100 17.6%
SAS commands for a CRD with no sampling; options pagenot=1;
data crd; input Trt $ yield; datalines; a 23 a 36 a 31 a 33 b 42 b 26 b 47 b 34 c 47 c 43 c 43 c 39 ;; ods rtf file='crd.rtf'; proc anova; class trt; model yield=trt; means trt/lsd; title 'ANOVA for a CRD with no Sampling'; run; ods rtf close;
ANOVA for a CRD with no Sampling The ANOVA Procedure Class Level Information Class Levels Values Trt
3 abc
Number of Observations Read
12
Number of Observations Used
12
ANOVA for a CRD with no Sampling The ANOVA Procedure Dependent Variable: yield
Source
DF
Sum of Squares Mean Square F Value
Model
2 300.5000000 150.2500000
Error
9 379.5000000
Corrected Total
Pr > F
3.56 0.0725
42.1666667
11 680.0000000
R-Square Coeff Var Root MSE yield Mean
0.441912 17.55023 6.493587
Source DF Trt
37.00000
Anova SS Mean Square F Value
2 300.5000000 150.2500000
Pr > F
3.56 0.0725
ANOVA for a CRD with no Sampling The ANOVA Procedure
Note This test controls the Type I comparisonwise error rate, not the experimentwise error rate. :
Alpha
0.05
Error Degrees of Freedom
9
Error Mean Square
42.1666 7
Critical Value of t
2.26216
Least Significant Difference
10.387
Means with the same letter are not significantly different. t Grouping
A
Mean N Trt
43.000
4 c
37.250
4 b
30.750
4 a
A B
A
B B
ANOVA for Any Number of Treatments with Unequal Replication Given the following data:
Replicate 1 2 3 4 5 Yi.
Y
2 ij
A 2.0 2.2 1.8 2.3 1.7 10 20.26
Treatment B C 1.7 2.0 1.9 2.4 1.5 2.7 2.5 2.4 5.1 12 8.75
29.06
D 2.1 2.2 2.2 1.9 8.4 Y..=35.5 17.7
Step 1. Write the hypotheses to be tested.
H o : 1 2 3 4 HA: At least one of the means is different from one of the other means. Step 2. Calculate the Correction Factor.
Y..2 35.52 CF 74.132 ri 17
Step 3. Calculate the Total SS TotalSS Yij2 CF (2.0 2 2.2 2 1.8 2 ... 1.9 2 ) CF 75.77 74.132 1.638
Step 4. Calculate the Treatment SS (TRT SS) Y2 TRTSS i. CF ri
10 2 5.12 12 2 8.4 2 74.132 3 5 4 5 75.110 74.132 0.978
Step 5. Calculate the Error SS Error SS = Total SS – Treatment SS = 1.638 – 0.978 = 0.660
Step 6. Complete the ANOVA table Sources of variation Treatment Error Total
Df t-1 = 3 By subtraction = 13 Total number of observations -1 = 16
SS 0.978 0.660 1.638
MS 0.326 0.051
F 6.392**
Step 7. Look up Table F-values. F0.05;3,13 = 3.41 F0.01;3,13 = 5.74 Step 8. Make conclusions.
0
3.41 5.74 6.39
Step 9. Calculate Coefficient of Variation (CV). %CV
s *100 Y
Remember that the Error MS = s2.
%CV
0.051 *100 35.5 17
0.2259 / 2.088 *100 10.82%
Since F-calc (6.392) > 3.41 we reject Ho: at the 95% level of confidence. Since F-calc (6.392) > 5.74 we reject Ho: at the 99% level of confidence.
ANOVA with Sampling (Equal Number of Samples Per Experimental Unit) Linear Model Yijk i ij ijk Where: Yijk is the kth sample of the jth observation of the ith treatment, is the population mean, i is the treatment effect of the ith treatment,
ij is the random error, and
ijk is the sampling error.
ANOVA table SOV Treatment Experimental error Sampling Error Total
Df t-1 (tr-1) - (t-1) (trs-1) - (tr-1) trs-1
F Treatment MS/Experimental Error MS
Facts about ANOVA with Sampling
There are two sources of variation that contribute to the variance appropriate to comparisons among treatment means. 1. Sampling Error = variation among sampling units treated alike ( s2 ). 2. Experimental Error = variation among experimental units treated alike ( s2 r E2 ).
The Experimental Error MS is expected to be larger than the Sampling Error MS.
If the Experimental Error variance component is not important, the Sampling Error MS and the Experimental Error MS will be of the same order of magnitude.
If the Experimental Error variance component is important, the Experimental Error MS will be much larger than the Sampling Error MS.
Example (without using a pooled error mean square) Temperature 12o Pot number 2 3 3.5 4.5 3.5 4.0 3.0 4.0 4.0 5.0 14.0 17.5 49.5
o
8 Pot number Plant 1 2 3 1 1 3.5 2.5 3.0 5.0 2 4.0 4.5 3.0 5.5 3 3.0 5.5 2.5 4.0 4 4.5 5.0 3.0 3.5 Yij. 15.0 17.5 11.5 18.0 Yi.. 44.0 Y…=156.0 Note i = treatment, j = replicate, and k = sample. Step 1. Calculate correction factor: Y...2 156 2 676 rts 3(3)(4)
Step 2. Calculate the Total SS: TotalSS Yijk2 CF
3.5 2 4.0 2 3.0 2 ... 5.5 2 CF 712.5 676.0 36.5
Step 3. Calculate the Treatment SS: TreatmentSS
Yi..2 CF rs
44 2 49.5 2 62.5 2 676.0 3 ( 4 ) ( 3 ( 4 ) 3 ( 4 ) 691.04 676.0 15.042
1 5.0 4.5 5.0 4.5 19.0
16o Pot number 2 5.5 6.0 5.0 5.0 21.5 62.5
3 5.5 4.5 6.5 5.5 22.0
Step 4. Calculate the SS Among Experimental Units Total (SSAEUT)
SSAEUT
Yij2. s
CF
15 2 17.5 2 11.5 2 22.0 2 676.0 ... 4 4 4 4 699.25 676.0 23.25 Step 5. Calculate the Experimental Error SS: Experimental Error SS = SSAEUT – SS TRT = 23.25 – 15.042 = 8.208 Step 6. Calculate the Sampling Error SS: Sampling Error SS = Total SS – SSAEUT = 36.5 – 23.25 = 13.25 Step 7. Complete the ANOVA Table: SOV Treatment Experimental Error Sampling Error Total
Df t-1 = 2 (tr-1) - (t-1) = 6 (trs-1) - (tr-1) = 27 trs-1 = 35
Step 8 Look up Table F-values. F0.05;2,6 = 5.14 F0.01;2,6 = 10.92
SS 15.042 8.208 13.25 36.5
MS 7.521 1.368
F 5.498*
F-value = Trt MS/Expt Error MS
Step 8. Make conclusions.
0
5.14
Since F-calc (5.498) > 5.14 we reject Ho: at the 95% level of confidence. Since F-calc (5.498) < 10.92 we fail to reject Ho: at the 99% level of confidence.
10.92
5.498
OPTIONAL MATERIAL: ANOVA of CRD With Sampling (Use of the Pooled Error MS)
We will use the data from the previous example for performing the F-test on treatments using a Pooled Error Mean Square.
The advantage of using the Pooled Error Mean Square as the denominator of the F-test is that you will have more degrees of freedom associated with denominator.
This may allow you to detect smaller significant differences between treatment means than you may have by using the Experimental Error Mean Square.
Example Steps 1-6 are the same as in the previous example. Step 7. Complete the ANOVA, but don’t calculate the F-value. SOV Treatment Experimental Error Sampling Error Total Pooled Error
Df t-1 = 2 (tr-1) - (t-1) = 6 (trs-1) - (tr-1) = 27 trs-1 = 35 33
SS 15.042 8.208 13.25 36.5 21.458
MS 7.521 1.368 0.491
F
0.65
Step 7.1 Calculate the Sampling Error Mean Square. Step 7.2 Test the homogeneity of the Sampling and Experimental Error Mean Square using the folded F-test. If they are homogeneous a Pooled Error MS can be calculated and used as the denominator of the F-test on treatments.
We need to remember that the expected mean square for the Sampling Error = s2 and the expected mean square for the Experimental Error = s2 r E2 . Thus the Experimental Error MS is expected to be > the Sampling Error MS.
Remember from the t-test that the Folded F-test involves the larger MS/smaller MS. Thus, in our case the Folded F = Experimental Error MS / Sampling Error MS.
If the variances are similar, the component of r E2 in the Experimental Error Mean Square will approach the value zero.
F
1.368 0.491
2.786
Step 7.3 Look up the table F-value This F-test is a one-tail test because there is the expectation that the Experimental Error MS S2 s E2 is going to be larger than the Sampling Error MS S2 .
Thus, if you are testing = 0.01, then you need to use the F-table for α = 0.01.
F 0.01,( ExptErrdf )( SampErrdf) = F 0.01;6, 27 3.558
Step 7.4 Make conclusions: Since the calculated value of F (2.786) is less than the Table-F value (3.558), we fail to reject Ho: Sampling Error MS = Experimental Error MS at the 99% level of confidence.
Therefore, we can calculate a Pooled Error MS
Step 7.5: Calculate the Pooled Error df and the Pooled Error MS Pooled Error df = Sampling Error df + Experimental Error df = (26+7) = 33 Pooled Error MS = Sampling Error SS + Experimental Error SS Sampling Error df + Experimental Error df =
13.25 8.208 0.650 26 7
Step 8. Complete the ANOVA by calculating the F-value using the Pooled Error MS as the denominator of the F-test. SOV Treatment Experimental Error Sampling Error Total Pooled Error
Df t-1 = 2 (tr-1) - (t-1) = 6 (trs-1) - (tr-1) = 27 trs-1 = 35 33
SS 15.042 8.208 13.25 36.5 21.458
MS 7.521 1.368 0.491
F 11.57**
0.65
Step 9 Look up Table F-values. F-value = Trt MS/Pooled Error MS
F0.05;2,33 = 3.30 F0.01;2,33 = 5.37 Step 10. Make conclusions.
0
Since F-calc (11.57) > 3.30 we reject Ho: at the 95% level of confidence. Since F-calc (11.57) > 5.37 we reject Ho: at the 99% level of confidence.
3.30 5.37
11.57
ANOVA When the Number of Subsamples are Not Equal TotalSS Yijk2
Y...2 total # ofobservations
df = #observations – 1
TreatmentSS
Yi..2 Y...2 r j sk total # ofobs.
df = # treatments –1
SSAEUT
Yij2. sk
Y...2 total # ofobs.
df = # Experimental units – 1
SS Experimental Error = SSAEUT – SS TRT
df = SSAEUT df – TRT df
SS Sampling Error = Total SS – SSAEUT
df = Total df – SSAEUT df
CRD with Sampling (Using the Experimental Error and Pooled Error MS as the denominator of the F-tests) options pageno=1; data crdsamp; input trt rep sample kwt; datalines; 8 1 1 3.5 8 1 2 4 8 1 3 3 8 1 4 4.5 8 2 1 2.5 8 2 2 4.5 8 2 3 5.5 8 2 4 5 8 3 1 3 8 3 2 3 8 3 3 2.5 8 3 4 3 12 1 1 5 12 1 2 5.5 12 1 3 4 12 1 4 3.5 12 2 1 3.5 12 2 2 3.5 12 2 3 3 12 2 4 4 12 3 1 4.5 12 3 2 4 12 3 3 4 12 3 4 5 16 1 1 5 16 1 2 4.5 16 1 3 5 16 1 4 4.5 16 2 1 5.5 16 2 2 6 16 2 3 5 16 2 4 5 16 3 1 5.5 16 3 2 4.5 16 3 3 6.5 16 3 4 5.5 ;; ods rtf file='example.rtf'; run; proc anova; class trt rep sample; model kwt=trt rep*trt;
*Comment The rep*trt commmand = the experimental error; test h=trt e=rep*trt; *The previous statement tells SAS to use the Experimental Error MS as the denominator of the F-test; means trt; title 'CRD with Sampling - Using Experimental Error as the Denominator'; run; proc anova; class trt rep sample; model kwt=trt; *The previous statement does not have the rep*trt statement, which is equivalent to the Expt. Error MS; means trt; title 'CRD with Sampling - Using Pooled Error as the Denominator'; run; ods rtf close;
CRD with Sampling - Using Experimental Error as the Denominator The ANOVA Procedure Class Level Information Class
Levels Values
trt
3 8 12 16
rep
3 123
sample
4 1234
Number of Observations Read
36
Number of Observations Used
36
CRD with Sampling - Using Experimental Error as the Denominator The ANOVA Procedure Dependent Variable: kwt Source
DF
Sum of Squares Mean Square F Value
Model
8
23.25000000
2.90625000
Error
27
13.25000000
0.49074074
Corrected Total
35
36.50000000
5.92
Pr > F
0.0002
This MS is the Sampling Error MS
R-Square Coeff Var Root MSE kwt Mean
0.636986 16.16605 0.700529 4.333333
Source DF trt trt*rep
Anova SS Mean Square F Value
2
15.04166667
7.52083333
6
8.20833333
1.36805556
Pr > F
15.33 F
0.0440
Correct result of the Ftest using the Expt. Error MS as the denominator of the Ftest.
CRD with Sampling - Using Experimental Error as the Denominator The ANOVA Procedure Level of trt
kwt N
Mean
Std Dev
8
12
3.66666667
1.00754728
12
12
4.12500000
0.74238559
16
12
5.20833333
0.62005620
CRD with Sampling - Using Pooled Error as the Denominator The ANOVA Procedure Class Level Information Class
Levels Values
trt
3 8 12 16
rep
3 123
sample
4 1234
Number of Observations Read
36
Number of Observations Used
36
CRD with Sampling - Using Pooled Error as the Denominator The ANOVA Procedure Dependent Variable: kwt Source
DF
Sum of Squares Mean Square F Value
Model
2
15.04166667
7.52083333
Error
33
21.45833333
0.65025253
Corrected Total
35
36.50000000
11.57
Pr > F
0.0002
Pooled Error MS R-Square Coeff Var Root MSE kwt Mean
0.412100 18.60882 0.806382 4.333333
Source DF trt
2
Anova SS Mean Square F Value
15.04166667
7.52083333
11.57
Pr > F
0.0002
Correct value for F-test using the Pooled Error MS as the denominator of the F-test.
CRD with Sampling - Using Pooled Error as the Denominator The ANOVA Procedure Level of trt
kwt N
Mean
Std Dev
8
12
3.66666667
1.00754728
12
12
4.12500000
0.74238559
16
12
5.20833333
0.62005620