Overview

Review Last week’s Lecture
Review Block Design
Randomized Complete Block Design with R Programming

1.1 In last week (1)

We reviewed different types of randomized experiment design:
- Block Design: Complete block design (CBD) vs. Randomized Complete Block Design (RCBD)
  - Difference: A “complete block design” simply refers to an experimental design where every treatment is applied to every block, while a “randomized complete block design” takes that same concept and adds the element of randomly assigning treatments within each block
- Block Design with more block factors: Latin square design, Repeated LSD, Greco-Roman Squares
  - Benefit: (1) Account for more explained variances and leads to lower residual variances; (2) Make the effect size of treatment more accurate;
  - Limitation: require block factors to have same number of levels.

1.2 In last week (2)

We discussed about why we need Block design compared to using simple treatment-control design.
- Potential confounding effects that can become nuisance factors
- Heterogeneity of samples (variability across gender, schools, age groups)
- Greater generalization of results
Assumptions of Block Design
- Continuous outcome
- Experimental units are randomly sampled
- No interactions between treatment factor(s) and blocking factor(s)
- Each block group’s outcome is normally distributed
- Each block group has “equal” or “close” variances in outcome

1.3 Features of RCBD

Think about our example of the effects of teaching methods (M1, M2, M3, M4) and measurement forms (F1, F2, F3, F4) to math performance.

In randomized complete block design (RCBD), each block size is the same and is equal to the number of treatments (i.e. factor levels or factor level combinations).
- For those who using measurement form (same block), they will be randomly assigned to 4 teaching methods.
Each treatment will be randomly assigned to exactly one experimental unit (i.e., students) within every block.
The assignments of treatment levels (teaching methods) to the experimental units (students) have to be done within each block separately.

1.4 Random Effects of RCBD

It is important to mention that blocks are usually ( $U_{\rho_j}$ , but not always) treated as random effects as they typically represent the population of all possible blocks.
In other words, the mean comparison among specific blocks is not of interest. The variability could be large or small depending on your context.
However, the variation between blocks must be incorporated into the model.

1.5 Exercises

A poultry experiment was run to investigate the effect of diet and antibiotics on egg production. They evaluated 2 diets of interest and 2 specific antibiotics that are on the market. The feed and antibiotic were combined and used to fill the feeding trays in barns. They chose 3 poultry farms at random and randomly assigned the combinations of diet and antibiotic to 4 barns within each farm. Total egg production by the chickens was recorded after 4 weeks.
1. What is the experimental design (hint: think about the randomization process)?
2. Identify which factors are treatment and block.

Answer

RCBD.
treatment: combination of Diet and Antibiotic; block: Farms.

1.6 Other Aspects of the RCBD

The RCBD utilizes an additive model (two-way ANOVA without interaction)
- one in which there is no interaction between treatments and blocks. The error term in a randomized complete block model reflects how the treatment effect varies from one block to another.
Both the treatments and blocks can be considered as random effects rather than fixed effects, if the levels were selected at random from a population of possible treatments or blocks. We consider this case later, but it does not change the test for a treatment effect.
What are the consequences of not blocking if we should have? Generally the unexplained error in the model will be larger, and therefore the test of the treatment effect less powerful.
How to determine the sample size in the RCBD?
- The Operating Characteristic (OC) curve approach can be used to determine the number of blocks to run. The number of blocks, b, represents the number of replications (they are exchangable from the point of researchers’ view). The power calculations that we looked at before would be the same, except that we use b rather than n, and we use the estimate of error, $\sigma^2$ , that reflects the improved precision based on having used blocks in our experiment. So, the major benefit or power comes not from the number of replications but from the error variance which is much smaller because you removed the effects due to block.

1.7 Statistical form of RCBD

The mean comparison among specific blocks ( $\mu_{form1}$ , $\mu_{form2}$ , $\mu_{form3}$ , $\mu_{form4}$ ) is not of interest
- In a RCBD, the variation between blocks is partitioned out of the MSE, resulting in a smaller MSE for testing hypotheses about the treatments.

$Y_{ij} = \mu + \tau_i + \rho_j + \epsilon_{ij}$

where:

$Y_{ij}$ : math scores for Method i and Form j
$\mu$ : grand mean
$\tau_i$ : Method i with i = 1, …, 4
$\rho_j$ : From j with j = 1, 2, 3
$\rho_j$ and $\epsilon_{ij}$ are independent random variables such that $\rho_j \sim \mathcal{N}(0, \sigma^2_{\rho})$ and $\epsilon_{ij} \sim \mathcal{N}(0, \sigma^2_{\epsilon})$

1.8 A little bit statistics ¹

Can partition $\mathrm{SS}_{\mathrm{T}}=\sum \sum\left(y_{i j}-\bar{y}_{. .}\right)^{2}$ into:

$\mathrm{SS}_{\mathrm{T}}= n_b \sum\left(\bar{y}_{i .}-\bar{y}_{. .}\right)^{2}+ n_a \sum\left(\bar{y}_{. j}-\bar{y}_{. .}\right)^{2}+\sum \sum\left(y_{i j}-\bar{y}_{i .}-\bar{y}_{. j}+\bar{y}_{. .}\right)^{2}$

$\mathrm{SS}_{\mathrm{treatment}}= n_b \sum\left(\bar{y}_{i .}-\bar{y}_{. .}\right)^{2}$ with $\mathrm{df} = a -1$
$\mathrm{SS}_{\mathrm{block}}= n_a \sum\left(\bar{y}_{. j}-\bar{y}_{. .}\right)^{2}$ with $\mathrm{df} = b -1$
$\mathrm{SS}_{\mathrm{Residual}}= \sum \sum\left(y_{i j}-\bar{y}_{i .}-\bar{y}_{. j}+\bar{y}_{. .}\right)^{2}$ with $\mathrm{df} = (n_a-1)(n_b -1)$

$\mathrm{SS}_{\mathrm{T}} = \mathrm{SS}_{\mathrm{treatment}} + \mathrm{SS}_{\mathrm{block}} + \mathrm{SS}_{\mathrm{Residual}}$

¹ Original post can be found here

1.9 A littble bit more statistics

Assume treatment factor has $n_a$ levels and blocking factor has $n_b$ levels:

$SS_{Total} = \sum_{i=1}^{n_a}\sum_{j=1}^{n_b}(y_{ij})^2-(\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}y_{ij})^2/N$

Mean of “sum of square” of marginal sums minus the mean of “square of sum”
- Marginal Sums of treatment: $y_{i.}$

$SS_{Treatment} = \frac{1}{n_b}\sum{(y_{i.})}^2 -(\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}y_{ij})^2/N$

Mean of “sum of square” of marginal sums minus the mean of “square of sum”
- Marginal Sums of block: $y_{.j}$

$SS_{Block} = \frac{1}{n_a}\sum{(y_{.j})}^2 -(\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}y_{ij})^2/N$

1.10 Example: Performance of detergents

Background

An experiment was designed to study the performance of four different detergents in cleaning clothes. The following “cleanness” readings (higher=cleaner) were obtained with specially designed equipment for three different types of common stains. Is there a difference between the detergents?

Click to see R code

library(tidyverse)
detergents <- tribble(
  ~Detergent, ~Stain1, ~Stain2, ~Stain3,
  1, 45, 43, 51,
  2, 47, 46, 52,
  3, 48, 50, 55,
  4, 42, 37, 49
)
kableExtra::kable(detergents)

Detergent	Stain1	Stain2	Stain3
1	45	43	51
2	47	46	52
3	48	50	55
4	42	37	49

Marginal Sums of treatment: $y_{i.}$ ; R code: rowSums(detergents[, 2:4])
Marginal Sums of Stain: $y_{.j}$ ; R code: colSums(detergents[, 2:4])

1.11 Example: Total Sum of Square

Sum of square of all values: $(\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}y_{ij})^2$ = 26867
Square of sum of all values per level: $\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}(y_{ij})^2$ = 26602.0833333333
Total Sum of Squares: $SS_{Total}$ = 264.916666666668

sum((detergents[, 2:4])^2) - (sum(detergents[, 2:4]))^2 / 12

[1] 264.9167

1.12 Example: Sum of squares for Detergent

treatment_marginal_Sums = rowSums(detergents[, 2:4])
grand_mean <- mean(unlist(detergents[, 2:4]))
## Method 1
3 * sum((treatment_marginal_Sums/3 - grand_mean)^2)

[1] 110.9167

## Method 2
sum(treatment_marginal_Sums^2) / 3 - (sum(detergents[, 2:4]))^2 / 12

[1] 110.9167

Sum of square per level: $\frac{1}{n_b}\sum{(y_{i.})}^2$ = 26713
Square of sum of all values: $\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}(y_{ij})^2$ = 319225
Sum of Squares for treatment: $SS_{treatment}$ = 110.917

1.13 Example: Sum of squares for block

block_marginal_Sums = colSums(detergents[, 2:4])
## Method 1
4 * sum((block_marginal_Sums/4 - grand_mean)^2)

[1] 135.1667

## Method 2
(1 / 4) * sum(block_marginal_Sums^2) - (sum(detergents[, 2:4]))^2 / 12

[1] 135.1667

Sum of square per level: $\frac{1}{n_b}\sum{(y_{i.})}^2$ = 26737.25
Square of sum of all values: $\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}(y_{ij})^2$ = 707670837.673611
Sum of Squares for treatment: $SS_{treatment}$ = 135.166666666668

1.14 F-statistics

$F = \frac{SS_{\mathrm{treatment}}/n_a}{SS_\mathrm{residual}/ ((n_a-1)*(n_b-1))}$

SS_total <- sum((detergents[, 2:4])^2) - (sum(detergents[, 2:4]))^2 / 12
SS_treatment <- 3 * sum((treatment_marginal_Sums/3 - grand_mean)^2)
SS_block <- 4 * sum((block_marginal_Sums/4 - grand_mean)^2)
SS_residual = SS_total - SS_treatment - SS_block
SS_residual

[1] 18.83333

F_stat = (SS_treatment / (4-1)) / (SS_residual / ((4-1)*(3-1)))
F_stat

[1] 11.77876

1.15 R Code for Sum of Squares

detergents_aov <- detergents |> 
  pivot_longer(starts_with("Stain"), names_to = "Stain") |>
  mutate(Detergent = factor(Detergent, levels = 1:4))

fit <- aov(value ~ Detergent+Stain, data= detergents_aov)
summary(fit)

            Df Sum Sq Mean Sq F value  Pr(>F)   
Detergent    3 110.92   36.97   11.78 0.00631 **
Stain        2 135.17   67.58   21.53 0.00183 **
Residuals    6  18.83    3.14                   
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

1.16 Example: Tutoring session

Let’s look at a simple example:
Research question: What is the effect of time of day of tutoring session on midterm grades?
The Primary Investigator (PI) of the study wants to control for tutor, believing some tutors may be better in the subject matter than others. Thus, they design a completely randomized block design.
Each tutor works with students in the morning and afternoon.
- In this example, student select their favorite tutor, but are then randomly assigned to either a morning (AM) or afternoon (PM) session.
- 200 total students: 91 in the morning, 109 in the afternoon

Note

DV: Midterm Score (in cells)

1.16.1 Example: Variables and Null Hypothesis

IV: Time (a = 2 )
- 2 levels: AM and PM
Nuisance factor: Tutor (b = 4 )
- 4 levels: Booby, Julia, Monique, and Ned
DV: Midterm Score
Null hypothesis pertaining to the IV of interest:
- $H_0:\mu_{𝐴𝑀} = \mu_{𝑃𝑀} → a= 2$
We will also have a null hypothesis pertaining to the blocking factor:
- $H_0:\mu_{Bobby} = \mu_{Julia} = \mu_{Monique} =\mu_{Ned} → b=4$
Two Nulls = two values of $F_{obs}$ , two values of $F_{crit}$ , two decisions

Note

DV: Midterm Score (in cells)

1.16.2 Example: Sum of Squares

IV: Time (𝑎= 2 )
Nuisance factor: Tutor (𝑏= 4 )
DV: Midterm Score
Now:
- $𝑆𝑆_{Total} =\color{red}{𝑆𝑆_{𝑀𝑜𝑑𝑒𝑙}}+\color{purple}{𝑆𝑆_{𝐵𝑙𝑜𝑐𝑘}}+\color{blue}{𝑆𝑆_{𝐸𝑟𝑟𝑜𝑟}}$
Thus, we can partition the effects into three parts:
- Sum of squares due to treatments (IV = Time),
- Sum of squares due to the blocking factor,
- and Sum of squares due to error.
We do not model an interaction with blocked designs. (we will talk about it later.)

1.16.3 Example: Mean of Squares and F-statistics

IV: Time (𝑎= 2 )
Nuisance factor: Tutor (𝑏= 4 )
DV: Midterm Score
Model: $𝑆𝑆_{Total} =\color{red}{𝑆𝑆_{𝑀𝑜𝑑𝑒𝑙}}+\color{purple}{𝑆𝑆_{𝐵𝑙𝑜𝑐𝑘}}+\color{blue}{𝑆𝑆_{𝐸𝑟𝑟𝑜𝑟}}$
ANOVA table:

1.16.4 Example: Sum of Square Formula

$SS_{\mathrm{Total}}=\sum_{i=1}^{n}(y_{ij}-\bar{y}_{..})^2$

This is the same as before: take each individual score ( $y_{ij}$ ), subtract the grand mean ( $\bar{y}_{..}$ ), and square it $(y_{ij}-\bar{y}_{..})^2$

scores <- c(9.1, 9.3, 9.4, 9.5) # four individuals' scores
mean(scores) # grand mean

[1] 9.325

sum((scores - mean(scores))^2) # sum of squares of total indivisuals

[1] 0.0875

Do this for everyone and then sum over all people $\sum_{i=1}^{n}(y_{ij}-\bar{y}_{..})^2$
However, when calculating Sum of Squares for IVs: $SS_{Model}$ , $SS_{Block}$ , $SS_{error}$ , we need to compute “marginal means”
- A marginal mean is the mean for one level of the variable, ignoring the other variable

1.16.5 Example: Marginal Means

A marginal mean is the mean for one level of the variable, ignoring the other variable

For example:

The AM marginal mean is the average of all students’ midterm scores in the morning, ignoring who they have as a tutor $\bar{y}_{AM.} = 22.95$
The Bobby marginal mean is the average of all students’ midterm scores who had Bobby as a tutor, ignoring time of day $\bar{y}_{.Bobby} = 11.91$

1.16.6 Example: SS for Total and Model

$SS_{Total} = \sum_{i=1}^{n}(y_{ij}-\bar{y}_{..})^2=15060.48$

$SS_{Model(time)} = \sum_{a=1}^{a}n_a(\bar{y}_{a.}-\bar{y}_{..})^2=4489.02$

where $n_a$ is the group size for AM/PM and $\bar{y}_{a.}$ are the marginal means for AM and PM {22.95, 13.43}
This is similar to how we computed $𝑆𝑆_{𝑀𝑜𝑑𝑒𝑙}$ before: marginal group mean subtract off the grand mean and square it. Sum over all groups.

$SS_{Block} = \sum_{b=1}^{b}n_{b}(\bar{y}_{.b}-\bar{y}_{..})^2=3239.43$

Technically, the blocking factor is just another IV (but we are not interested in or is not within the scope of research question).
- $𝑆𝑆_{𝐸𝑟𝑟𝑜𝑟} = 𝑆𝑆_{𝑇𝑜𝑡𝑎𝑙}− 𝑆𝑆_{𝑀𝑜𝑑𝑒𝑙} +𝑆𝑆_{𝐵𝑙𝑜𝑐𝑘}$ = 7332. 03

1.16.7 Example: Details of ANOVA Table

Then, we can fill out the ANOVA table:

Note

Under $\alpha=.05$ , for “Model” factor – Time, we have $df_{Model}$ = 1, $df_{error} = 195$ : $F_{crit}=3.89$ so sig.

Similarly, for “Blocking” - Tutor, we have $df_{block}$ = 3, $df_{error} = 195$ : $F_{crit}=2.65$ so sig.

1.16.8 Interpretation

A randomized block design was used to test the effect of tutoring time on midterm scores. For each tutor, participants were randomly assigned to either morning (𝑛 = 91 ) or afternoon (𝑛= 109 ) tutoring sessions.
The effect of tutoring time was significant $(𝐹(1,195) = 119. 39, 𝑝<. 001 ,\eta^2_{𝑇𝑖𝑚𝑒} =. 298$ ) with a large effect. Using a Tukey’s test, morning is significantly higher than afternoon sessions (𝑝 <. 05 ).
The effect of the blocking factor, tutor, was significant ( $𝐹(3 , 195) = 28. 72 ,𝑝< . 001 ,\eta^2_{𝑇𝑢𝑡𝑜𝑟} =. 215$ ) with a large effect. Using a Tukey’s test, Monique’s students were significantly higher than other students, and Bobby’s students were significantly lower than other students in the midterm scores (𝑝<. 05 ).

1.17 Example: Hardness Reading²

In this example we wish to determine whether 4 different tips (the treatment factor) produce different (mean) hardness readings on a Rockwell hardness tester.
- The treatment factor is the design of the tip for the machine that determines the hardness of metal. The tip is one component of the testing machine.

The Rockwell hardness test

The Rockwell hardness test is a hardness test based on indentation hardness of a material. The Rockwell test measures the depth of penetration of an indenter under a large load (major load) compared to the penetration made by a preload (minor load).

To conduct this experiment we assign the tips to an experimental unit; that is, to a test specimen (called a coupon), which is a piece of metal on which the tip is tested.
- The blocking factor is the block of test specimens. The test specimens are blocks of metal that are similar in hardness. The test specimens are used to block the variation in hardness of the metal from the variation in the tips.

1.17.1 Example: Block Design - CRD

If the structure were a completely randomized experiment (CRD) that we discussed in lecture 7, we would assign the tips to a random piece of metal for each test. In this case, the test specimens would be considered a source of nuisance variability.

Click to see R code

set.seed(1234)
data.frame(
  Metal = paste0("Metal", 1:8),
  Tip = rep(c("Tip1", "Tip2", "Tip3", "Tip4"), each = 2),
  Hardness = sample(seq(9, 10, by =.1), 8)
) |> 
  kableExtra::kable()

Metal	Tip	Hardness
Metal1	Tip1	9.9
Metal2	Tip1	9.5
Metal3	Tip2	9.4
Metal4	Tip2	9.3
Metal5	Tip3	9.6
Metal6	Tip3	9.0
Metal7	Tip4	9.8
Metal8	Tip4	9.1

1.17.2 Example: Block Design - RCBD

If we conduct this as a blocked experiment, we would assign all four tips to the same test specimen, randomly assigned to be tested on a different location on the specimen. Since each treatment occurs once in each block, the number of test specimens is the number of replicates.
Back to the hardness testing example, the experimenter may very well want to test the tips (treatment) across specimens (block) of various hardness levels. This shows the importance of blocking. To conduct this experiment as a RCBD, we assign all 4 tips to each specimen.
- In this experiment, each specimen is called a “block”; thus, we have designed a more homogenous set of experimental units on which to test the tips.

1.17.3 Example: Block Design Table - RCBD

Suppose that we use b = 4 blocks as shown in the table below:
We are primarily interested in testing the equality of treatment means, but now we have the ability to remove the variability associated with the nuisance factor (the blocks) through the grouping of the experimental units prior to having assigned the treatments.

Click to see R code

tribble(
  ~`1`,     ~`2`,   ~`3`,   ~`4`,
  "Tip 3",  "Tip 3",    "Tip 2",    "Tip 1",
  "Tip 1",  "Tip 4",    "Tip 1",    "Tip 4",
  "Tip 4",  "Tip 2",    "Tip 3",    "Tip 3",
  "Tip 2",  "Tip 1",    "Tip 4",    "Tip 3"
) |> 
  gt() |> 
  tab_header(
    title = "The Hardness Testing Experiment",
    subtitle = "Randomized Complete Block Design"
  ) |> 
  tab_spanner(
    label = "Test Coupon (Block)",
    columns = everything()
  ) |> 
  tab_options(
    table.width = px(500),
    table.font.size = px(20)
  )

The Hardness Testing Experiment
Randomized Complete Block Design
Test Coupon (Block)
1	2	3	4
Tip 3	Tip 3	Tip 2	Tip 1
Tip 1	Tip 4	Tip 1	Tip 4
Tip 4	Tip 2	Tip 3	Tip 3
Tip 2	Tip 1	Tip 4	Tip 3

Important

Notice the two-way structure of the experiment. Here we have four blocks and within each of these blocks is a random assignment of the tips within each block.

1.17.4 Example: ANOVA Results (1)

Remember, the hardness of specimens (coupons) is tested with 4 different tips.

Click to see R code

library(here)
dat <- read.csv(here::here("teaching/2025-01-13-Experiment-Design/Lecture08", "tip_hardness.csv"))
kableExtra::kable(dat)

Obs	Tip	Hardness	Coupon
1	1	9.3	1
2	1	9.4	2
3	1	9.6	3
4	1	10.0	4
5	2	9.4	1
6	2	9.3	2
7	2	9.8	3
8	2	9.9	4
9	3	9.2	1
10	3	9.4	2
11	3	9.5	3
12	3	9.7	4
13	4	9.7	1
14	4	9.6	2
15	4	10.0	3
16	4	10.2	4

1.17.5 Example: ANOVA Results (2)

Here is the output from R aov(). We can see four levels of the Tip and four levels for Coupon:

Click to see R code

dat$Tip <- factor(dat$Tip)
dat$Coupon <- factor(dat$Coupon)
fit_exp2 <- aov(Hardness ~ Tip + Coupon, data = dat)

Note

The Analysis of Variance table shows three degrees of freedom for Tip three for Coupon, and the residual (error) degrees of freedom is nine.
The ratio of mean squares of treatment over error gives us an F ratio that is equal to 14.44 which is highly significant since it is greater than the .001 percentile of the F distribution with three and nine degrees of freedom.
Our 2-way analysis also provides a test for the block factor, Coupon. The ANOVA shows that this factor is also significant with an F-test = 30.94. So, there is a large amount of variation in hardness between the pieces of metal.
This is why we used specimen (or coupon) as our blocking factor. We expected in advance that it would account for a large amount of variation. By including block in the model and in the analysis, we removed this large portion of the variation, such that the residual error is quite small. By including a block factor in the model, the error variance is reduced, and the test on treatments is more powerful.

1.17.6 Example: ANOVA Results (3)

The test on the block factor is typically not of interest except to confirm that you used a good blocking factor. The results are summarized by the table of means given below.

Click to see R code

cbind(
dat |> 
  group_by(Tip) |>
  summarize(
    N_Tip = n(),
    Hardness_Tip = mean(Hardness))
,
dat |> 
  group_by(Coupon) |>
  summarize(
    N_Coupon = n(),
    Hardness_Coupon = mean(Hardness)) 
) |> 
  kableExtra::kable()

Tip	N_Tip	Hardness_Tip	Coupon	N_Coupon	Hardness_Coupon
1	4	9.575	1	4	9.400
2	4	9.600	2	4	9.425
3	4	9.450	3	4	9.725
4	4	9.875	4	4	9.950

² Please check the original post here: STAT 503

³ Please check the original post here: STAT 502 Analysis of Variance and Design of Experiments

1.18 Example: Plant Fertilizer³

In a greenhouse experiment, there was a single factor (fertilizer) with 4 levels (i.e. 4 treatments), six replications, and a total of 24 experimental units (potted plants). Suppose the image below is the greenhouse bench (viewed from above) that was used for the experiment.
To use CBD, we need to randomly assign each of the treatment levels to 6 potted plants. To do this, we first assign numbers to the physical position of the pots on the bench. Each column of plants will be use one Fertilizer.
To further expand it to RCBD, we need to put them into different farms (say we have six farms). Each farms will have 4 plans with each will use one type of fertilizer.

1.18.1 Read in R Data

After saving the exp1_data.csv into the same directory of your R file. You should be able to import dataset from Files Panel in Rstudio:

1.18.2 Block Design

gt(dat, 
   rownames_to_stub = TRUE, 
   groupname_col = "Block", 
   row_group_as_column = TRUE) |> 
   tab_options(table.width = px(500))

		Fertilizer	Height	Plant
Block1	1	Control	19.5	1
	7	F1	25.0	7
	13	F2	22.5	13
	19	F3	27.5	19
Block2	2	Control	20.5	2
	8	F1	27.5	8
	14	F2	25.2	14
	20	F3	28.0	20
Block3	3	Control	21.0	3
	9	F1	28.0	9
	15	F2	26.0	15
	21	F3	29.2	21
Block4	4	Control	21.0	4
	10	F1	28.6	10
	16	F2	26.5	16
	22	F3	29.5	22
Block5	5	Control	21.5	5
	11	F1	30.5	11
	17	F2	27.0	17
	23	F3	30.0	23
Block6	6	Control	22.5	6
	12	F1	32.0	12
	18	F2	28.0	18
	24	F3	31.0	24

1.18.3 ANOVA Results

Let us obtain the ANOVA table for the RCBD. To run the model with Block Design in R we can use the aov() function with the formula:

<Outcome>~<Treatment>+<Block>.

fit_rcbd <- aov(Height ~ Fertilizer + Block, data = dat)                 
summary(fit_rcbd)

            Df Sum Sq Mean Sq F value   Pr(>F)    
Fertilizer   3 251.44   83.81  162.96 1.14e-11 ***
Block        5  53.32   10.66   20.73 2.99e-06 ***
Residuals   15   7.72    0.51                     
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

For comparison, let us obtain the ANOVA table for the CRD for the same data. We use the following R code with the formula:

<Outcome>~<Treatment>

fit_cbd <- aov(Height ~ Fertilizer, data = dat)              
summary(fit_cbd)

            Df Sum Sq Mean Sq F value   Pr(>F)    
Fertilizer   3 251.44   83.81   27.46 2.71e-07 ***
Residuals   20  61.03    3.05                     
---
Signif. codes:  0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1

1.18.4 Interpretation

Important

Comparing the two ANOVA tables, we see that the MSE in RCBD has decreased considerably in comparison to the CRD. This reduction in MSE can be viewed as the partition in SSE for the CRD (61.033) into SSBlock (53.32) + SSE (7.715). The potential reduction in SSE by blocking is offset to some degree by losing degrees of freedom for the blocks. But more often than not, is worth it in terms of the improvement in the calculated F-statistic. In our example, we observe that the F-statistic for the treatment has increased considerably for RCBD in comparison to CRD. It is reasonable to assume that the result from the RCBD is more valid than that from the CRD as the MSE value obtained after accounting for the block to block variability is a more accurate representation of the random error variance.

1.19 Example: Performance of Students at varied environment

Background: Comparing the performances of students (male and female) blocks in different environments (at home and at college). To represent this experiment in the figure will be as follows:

Where AC: At College, AH: At Home

stud <- factor(rep(c("male", "female"), each = 2)) 
perf <- factor(rep(c("ah", "ac" ), times = 2)) 
perf

[1] ah ac ah ac
Levels: ac ah

y <- c(5.5, 5, 
    4, 6.2) 

# y is the hours students 
# studied in specific places 
results <- data.frame(y, stud, perf) 

fit <- aov(y ~ perf+stud, data = results)                
summary(fit)

            Df Sum Sq Mean Sq F value Pr(>F)
perf         1 0.7225  0.7225   0.396  0.642
stud         1 0.0225  0.0225   0.012  0.930
Residuals    1 1.8225  1.8225

Explanation: The value of Mean Sq is 0.7225<<1.8225,i.e, here blocking wasn’t necessary. And as Pr value is 0.642 > 0.05 (5% significance) and the hypothesis is accepted - there is no sufficient evidence suggesting females and males have significant differences in performance.

2 Blocking factor or not?

Effectiveness of Health Promotion Programs

Researchers are interested in comparing the effectiveness of three health promotion programs with nursing students.
The researcher has the following research question: Which of the three health promotion programs are most effective at reducing unhealthy coping behaviors?
1. Unhealthy coping behavior is measured as a composite score from the “Poor Coping Behavior” survey (PCB). High scores on the PCB indicate higher levels of unhealthy coping behaviors. Low scores indicate low levels of unhealthy coping behaviors. PCB scores can range from 0 (no unhealthy behaviors) to 100 (multiple unhealthy behaviors at high frequency and high intensity).
2. The researcher thinks that the health promotion programs for nursing students may have an effect on academic and well-being outcomes.
3. However, gender and status (traditional vs. non-traditional student) differences may influence the effectiveness of health promotion programs.

2.0.1 Blocking Factors?

The researcher has a sample of 200 nursing students from the state of Arkansas. 50 students are assigned to each program: Program A, Program B, and Program C. In addition, 50 students are assigned to a control group that receives the status quo educational model.

data.frame(
  Health_Program = c("Program A", "Program B", "Program C", "Control"),
  N = c(50, 50, 50, 50)
) |> kableExtra::kable()

Health_Program	N
Program A	50
Program B	50
Program C	50
Control	50

2.0.3 Factors

What is the dependent variable? → PCB SCORES
What is the independent variable of interest? → HEALTH PROGRAM (4 LEVELS)
What is (are) the confounding variables, that the researcher might statistically control for? → GENDER, STATUS
What is (are) the blocking factors? → Schools, Classes

--- title: "Lecture 08: Block Design II" subtitle: "Experimental Design in Education" date: "2025-03-07" date-modified: "`{r} Sys.Date()`" execute: eval: true echo: true warning: false message: false format: html: code-tools: true code-line-numbers: false code-fold: false code-summary: "Click to see R code" number-offset: 1 fig.width: 10 fig-align: center message: false grid: sidebar-width: 350px uark-revealjs: chalkboard: true embed-resources: false code-fold: false number-sections: true number-depth: 1 footer: "ESRM 64503" slide-number: c/t tbl-colwidths: auto scrollable: true output-file: slides-index.html mermaid: theme: forest --- ::: objectives ## Overview {.unnumbered} 1. Review Last week's Lecture 2. Review Block Design 3. Randomized Complete Block Design with R Programming ::: ```{r setup, include=FALSE} library(tidyverse) library(gt) ``` ## In last week (1) 1. We reviewed different types of randomized experiment design: - Block Design: Complete block design (CBD) vs. Randomized Complete Block Design (RCBD) - **Difference**: [A "complete block design" simply refers to an experimental design where every treatment is applied to every block, while a "randomized complete block design" takes that same concept and adds the element of randomly assigning treatments within each block]{.mohu} - Block Design with more block factors: Latin square design, Repeated LSD, Greco-Roman Squares - **Benefit**: [(1) Account for more explained variances and leads to lower residual variances; (2) Make the effect size of treatment more accurate;]{.mohu} - **Limitation**: [require block factors to have same number of levels.]{.mohu} ## In last week (2) 2. We discussed about why we need Block design compared to using simple treatment-control design. - [Potential confounding effects that can become nuisance factors]{.mohu} - [Heterogeneity of samples (variability across gender, schools, age groups)]{.mohu} - [Greater generalization of results]{.mohu} 3. Assumptions of Block Design - [Continuous outcome]{.mohu} - [Experimental units are randomly sampled]{.mohu} - [No interactions between treatment factor(s) and blocking factor(s)]{.mohu} - [Each block group's outcome is normally distributed]{.mohu} - [Each block group has "equal" or "close" variances in outcome]{.mohu} ## Features of RCBD ::: discussion Think about our example of the effects of teaching methods (M1, M2, M3, M4) and measurement forms (F1, F2, F3, F4) to math performance. ::: 1. In **randomized complete block design (RCBD)**, each block size is the same and is equal to the number of treatments (i.e. factor levels or factor level combinations). - For those who using measurement form (same block), they will be randomly assigned to 4 teaching methods. 2. Each treatment will be randomly assigned to exactly one experimental unit (i.e., students) within every block. 3. The assignments of treatment levels (teaching methods) to the experimental units (students) have to be done within each block separately. ## Random Effects of RCBD 1. It is important to mention that blocks are usually ($U_{\rho_j}$, but not always) treated as random effects as they typically represent the population of all possible blocks. 2. In other words, the mean comparison among specific blocks is not of interest. The variability could be large or small depending on your context. 3. However, the variation between blocks must be incorporated into the model. ## Exercises :::: discussion - A poultry experiment was run to investigate the effect of **diet** and **antibiotics** on egg production. They evaluated 2 diets of interest and 2 specific antibiotics that are on the market. The feed and antibiotic were combined and used to fill the feeding trays in barns. They chose 3 poultry farms at random and randomly assigned the combinations of diet and antibiotic to 4 barns within each farm. Total egg production by the chickens was recorded after 4 weeks. 1. What is the experimental design (hint: think about the randomization process)? 2. Identify which factors are treatment and block. ::: {.callout-note .mohu} ## Answer a. [RCBD.]{.mohu} b. [treatment: combination of Diet and Antibiotic; block: Farms.]{.mohu} ::: :::: ## Other Aspects of the RCBD - The RCBD utilizes an *additive model* (two-way ANOVA without interaction) - one in which there is *no interaction* between treatments and blocks. The error term in a randomized complete block model reflects how the treatment effect varies from one block to another. - Both the treatments and blocks can be considered as *random effects* rather than fixed effects, if the levels were selected at random from a population of possible treatments or blocks. We consider this case later, but it does not change the test for a treatment effect. - What are the consequences of not blocking if we should have? Generally the **unexplained error** in the model will be larger, and therefore the test of the treatment effect less powerful. - How to determine the sample size in the RCBD? - The [Operating Characteristic (OC) curve](https://online.stat.psu.edu/stat503/lesson/3/3.2#paragraph--298) approach can be used to determine the number of blocks to run. The number of blocks, *b*, represents the number of replications (they are exchangable from the point of researchers' view). The power calculations that we looked at before would be the same, except that we use b rather than n, and we use the estimate of error, $\sigma^2$, that reflects the improved precision based on having used blocks in our experiment. So, the major benefit or power comes not from the number of replications but from the error variance which is much smaller because you removed the effects due to block. ## Statistical form of RCBD - The mean comparison among specific blocks ($\mu_{form1}$, $\mu_{form2}$, $\mu_{form3}$, $\mu_{form4}$) is not of interest - In a RCBD, the variation between blocks is partitioned out of the MSE, resulting in a smaller MSE for testing hypotheses about the treatments. $$ Y_{ij} = \mu + \tau_i + \rho_j + \epsilon_{ij} $$ where: 1. $Y_{ij}$: math scores for Method *i* and Form *j* 2. $\mu$: grand mean 3. $\tau_i$: Method *i* with *i* = 1, ..., 4 4. $\rho_j$: From *j* with *j* = 1, 2, 3 5. $\rho_j$ and $\epsilon_{ij}$ are independent random variables such that $\rho_j \sim \mathcal{N}(0, \sigma^2_{\rho})$ and $\epsilon_{ij} \sim \mathcal{N}(0, \sigma^2_{\epsilon})$ ## A little bit statistics [^1] [^1]: Original post can be found [here](https://www.stat.purdue.edu/~xbw/courses/stat514/notes/lec6.blockdesign.pdf) - Can partition $\mathrm{SS}_{\mathrm{T}}=\sum \sum\left(y_{i j}-\bar{y}_{. .}\right)^{2}$ into: $$ \mathrm{SS}_{\mathrm{T}}= n_b \sum\left(\bar{y}_{i .}-\bar{y}_{. .}\right)^{2}+ n_a \sum\left(\bar{y}_{. j}-\bar{y}_{. .}\right)^{2}+\sum \sum\left(y_{i j}-\bar{y}_{i .}-\bar{y}_{. j}+\bar{y}_{. .}\right)^{2} $$ - $\mathrm{SS}_{\mathrm{treatment}}= n_b \sum\left(\bar{y}_{i .}-\bar{y}_{. .}\right)^{2}$ with $\mathrm{df} = a -1$ - $\mathrm{SS}_{\mathrm{block}}= n_a \sum\left(\bar{y}_{. j}-\bar{y}_{. .}\right)^{2}$ with $\mathrm{df} = b -1$ - $\mathrm{SS}_{\mathrm{Residual}}= \sum \sum\left(y_{i j}-\bar{y}_{i .}-\bar{y}_{. j}+\bar{y}_{. .}\right)^{2}$ with $\mathrm{df} = (n_a-1)(n_b -1)$ $$ \mathrm{SS}_{\mathrm{T}} = \mathrm{SS}_{\mathrm{treatment}} + \mathrm{SS}_{\mathrm{block}} + \mathrm{SS}_{\mathrm{Residual}} $$ ## A littble bit more statistics - Assume treatment factor has $n_a$ levels and blocking factor has $n_b$ levels: $$ SS_{Total} = \sum_{i=1}^{n_a}\sum_{j=1}^{n_b}(y_{ij})^2-(\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}y_{ij})^2/N $$ - *Mean of "sum of square" of marginal sums* minus the mean of "square of sum" - Marginal Sums of treatment: $y_{i.}$ $$ SS_{Treatment} = \frac{1}{n_b}\sum{(y_{i.})}^2 -(\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}y_{ij})^2/N $$ - *Mean of "sum of square" of marginal sums* minus the mean of "square of sum" - Marginal Sums of block: $y_{.j}$ $$ SS_{Block} = \frac{1}{n_a}\sum{(y_{.j})}^2 -(\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}y_{ij})^2/N $$ ## Example: Performance of detergents ::: callout-important ## Background An experiment was designed to study the performance of four different detergents in cleaning clothes. The following “cleanness” readings (higher=cleaner) were obtained with specially designed equipment for three different types of common stains. Is there a difference between the detergents? ::: ```{r} #| code-fold: true library(tidyverse) detergents <- tribble( ~Detergent, ~Stain1, ~Stain2, ~Stain3, 1, 45, 43, 51, 2, 47, 46, 52, 3, 48, 50, 55, 4, 42, 37, 49 ) kableExtra::kable(detergents) ``` - Marginal Sums of treatment: $y_{i.}$; R code: `rowSums(detergents[, 2:4])` - Marginal Sums of Stain: $y_{.j}$; R code: `colSums(detergents[, 2:4])` ------------------------------------------------------------------------ ## Example: Total Sum of Square 1. *Sum of square* of all values: $(\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}y_{ij})^2$ = `{r} sum((detergents[, 2:4])^2) |> as.character()` 2. *Square of sum* of all values per level: $\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}(y_{ij})^2$ = `{r} ((sum(detergents[, 2:4]))^2/12) |> as.character()` 3. **Total Sum of Squares**: $SS_{Total}$ = `{r} (sum((detergents[, 2:4])^2) - (sum(detergents[, 2:4]))^2 / 12 )|> as.character()` ```{r} #| code-fold: false sum((detergents[, 2:4])^2) - (sum(detergents[, 2:4]))^2 / 12 ``` ## Example: Sum of squares for Detergent ```{r} treatment_marginal_Sums = rowSums(detergents[, 2:4]) grand_mean <- mean(unlist(detergents[, 2:4])) ## Method 1 3 * sum((treatment_marginal_Sums/3 - grand_mean)^2) ## Method 2 sum(treatment_marginal_Sums^2) / 3 - (sum(detergents[, 2:4]))^2 / 12 ``` 1. *Sum of square* per level: $\frac{1}{n_b}\sum{(y_{i.})}^2$ = `{r} (sum(treatment_marginal_Sums^2) / 3) |> as.character()` 2. *Square of sum* of all values: $\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}(y_{ij})^2$ = `{r} (sum(detergents[, 2:4]))^2 |> as.character()` 3. **Sum of Squares for treatment**: $SS_{treatment}$ = `{r} round(sum(treatment_marginal_Sums^2) / 3 - (sum(detergents[, 2:4]))^2 / 12, 3) |> as.character()` ## Example: Sum of squares for block ```{r} block_marginal_Sums = colSums(detergents[, 2:4]) ## Method 1 4 * sum((block_marginal_Sums/4 - grand_mean)^2) ## Method 2 (1 / 4) * sum(block_marginal_Sums^2) - (sum(detergents[, 2:4]))^2 / 12 ``` 1. *Sum of square* per level: $\frac{1}{n_b}\sum{(y_{i.})}^2$ = `{r} ((1 / 4) * sum(block_marginal_Sums^2)) |> as.character()` 2. *Square of sum* of all values: $\sum_{i=1}^{n_a}\sum_{j=1}^{n_b}(y_{ij})^2$ = `{r} ((sum(detergents[, 2:4]))^2 / 12)^2 |> as.character()` 3. **Sum of Squares for treatment**: $SS_{treatment}$ = `{r} ((1 / 4) * sum(block_marginal_Sums^2) - (sum(detergents[, 2:4]))^2 / 12) |> as.character()` ## F-statistics $$ F = \frac{SS_{\mathrm{treatment}}/n_a}{SS_\mathrm{residual}/ ((n_a-1)*(n_b-1))} $$ ```{r} SS_total <- sum((detergents[, 2:4])^2) - (sum(detergents[, 2:4]))^2 / 12 SS_treatment <- 3 * sum((treatment_marginal_Sums/3 - grand_mean)^2) SS_block <- 4 * sum((block_marginal_Sums/4 - grand_mean)^2) SS_residual = SS_total - SS_treatment - SS_block SS_residual ``` ```{r} F_stat = (SS_treatment / (4-1)) / (SS_residual / ((4-1)*(3-1))) F_stat ``` ------------------------------------------------------------------------ ## R Code for Sum of Squares ```{r} detergents_aov <- detergents |> pivot_longer(starts_with("Stain"), names_to = "Stain") |> mutate(Detergent = factor(Detergent, levels = 1:4)) fit <- aov(value ~ Detergent+Stain, data= detergents_aov) summary(fit) ``` ## Example: Tutoring session :::::: columns ::: {.column width="60%"} - Let’s look at a simple example: - Research question: **What is the effect of time of day of** **tutoring session on midterm grades?** - The Primary Investigator (PI) of the study wants to control for tutor, believing some tutors may be better in the subject matter than others. Thus, they design a completely randomized block design. - Each tutor works with students in the morning and afternoon. - In this example, student select their favorite tutor, but are then randomly assigned to either a morning (AM) or afternoon (PM) session. - 200 total students: 91 in the morning, 109 in the afternoon ::: :::: {.column width="40%"} ![](images/clipboard-3708810514.png) ::: callout-note DV: Midterm Score (in cells) ::: :::: :::::: ------------------------------------------------------------------------ ### Example: Variables and Null Hypothesis :::::: columns ::: {.column width="60%"} - IV: Time (*a* = 2 ) - [2 levels]{style="color: orange"}: AM and PM - Nuisance factor: Tutor (*b* = 4 ) - [4 levels]{style="color: purple"}: Booby, Julia, Monique, and Ned - DV: Midterm Score - Null hypothesis pertaining to the IV of interest: - $H_0:\mu_{𝐴𝑀} = \mu_{𝑃𝑀} → a= 2$ - We will also have a null hypothesis pertaining to the blocking factor: - $H_0:\mu_{Bobby} = \mu_{Julia} = \mu_{Monique} =\mu_{Ned} → b=4$ - Two Nulls = two values of $F_{obs}$, two values of $F_{crit}$, two decisions ::: :::: {.column width="40%"} ![](images/clipboard-3708810514.png) ::: callout-note DV: Midterm Score (in cells) ::: :::: :::::: ------------------------------------------------------------------------ ### Example: Sum of Squares ::::: columns ::: {.column width="70%"} - IV: Time (𝑎= 2 ) - Nuisance factor: Tutor (𝑏= 4 ) - DV: Midterm Score - Now: - $𝑆𝑆_{Total} =\color{red}{𝑆𝑆_{𝑀𝑜𝑑𝑒𝑙}}+\color{purple}{𝑆𝑆_{𝐵𝑙𝑜𝑐𝑘}}+\color{blue}{𝑆𝑆_{𝐸𝑟𝑟𝑜𝑟}}$ - Thus, we can partition the effects into three parts: - Sum of squares due to [treatments (IV = Time)]{style="color: red"}, - Sum of squares due to the [blocking factor]{style="color: purple"}, - and Sum of squares due to [error]{style="color: blue"}. - We **do not** model an **interaction** with blocked designs. (we will talk about it later.) ::: ::: {.column width="30%"} ![](images/clipboard-624916915.png) ::: ::::: ------------------------------------------------------------------------ ### Example: Mean of Squares and F-statistics - IV: Time (𝑎= 2 ) - Nuisance factor: Tutor (𝑏= 4 ) - DV: Midterm Score - Model: $𝑆𝑆_{Total} =\color{red}{𝑆𝑆_{𝑀𝑜𝑑𝑒𝑙}}+\color{purple}{𝑆𝑆_{𝐵𝑙𝑜𝑐𝑘}}+\color{blue}{𝑆𝑆_{𝐸𝑟𝑟𝑜𝑟}}$ - ANOVA table: ![](images/clipboard-505786804.png) ------------------------------------------------------------------------ ### Example: Sum of Square Formula $$ SS_{\mathrm{Total}}=\sum_{i=1}^{n}(y_{ij}-\bar{y}_{..})^2 $$ - This is the same as before: take each individual score ($y_{ij}$), subtract the grand mean ($\bar{y}_{..}$), and square it $(y_{ij}-\bar{y}_{..})^2$ ```{r} scores <- c(9.1, 9.3, 9.4, 9.5) # four individuals' scores mean(scores) # grand mean sum((scores - mean(scores))^2) # sum of squares of total indivisuals ``` - Do this for everyone and then sum over all people $\sum_{i=1}^{n}(y_{ij}-\bar{y}_{..})^2$ - However, when calculating **S**um of **S**quares for IVs: $SS_{Model}$, $SS_{Block}$, $SS_{error}$, we need to compute "**marginal means**" - A **marginal mean** is the mean for one level of the variable, ignoring the other variable ------------------------------------------------------------------------ ### Example: Marginal Means - A **marginal mean** is the mean for one level of the variable, ignoring the other variable ::::: columns ::: {.column width="50%"} **For example**: - The AM marginal mean is the average of all students’ midterm scores in the morning, ignoring who they have as a tutor $$ \bar{y}_{AM.} = 22.95 $$ - The Bobby marginal mean is the average of all students’ midterm scores who had Bobby as a tutor, ignoring time of day $$ \bar{y}_{.Bobby} = 11.91 $$ ::: ::: {.column width="50%"} ![](images/clipboard-560146152.png) ::: ::::: ------------------------------------------------------------------------ ### Example: SS for Total and Model $$ SS_{Total} = \sum_{i=1}^{n}(y_{ij}-\bar{y}_{..})^2=15060.48 $$ $$ SS_{Model(time)} = \sum_{a=1}^{a}n_a(\bar{y}_{a.}-\bar{y}_{..})^2=4489.02 $$ - where $n_a$ is the group size for AM/PM and $\bar{y}_{a.}$ are the marginal means for AM and PM {22.95, 13.43} - This is similar to how we computed $𝑆𝑆_{𝑀𝑜𝑑𝑒𝑙}$ before: marginal group mean subtract off the grand mean and square it. Sum over all groups. $$ SS_{Block} = \sum_{b=1}^{b}n_{b}(\bar{y}_{.b}-\bar{y}_{..})^2=3239.43 $$ - Technically, the blocking factor is just another IV (but we are not interested in or is not within the scope of research question). - $𝑆𝑆_{𝐸𝑟𝑟𝑜𝑟} = 𝑆𝑆_{𝑇𝑜𝑡𝑎𝑙}− 𝑆𝑆_{𝑀𝑜𝑑𝑒𝑙} +𝑆𝑆_{𝐵𝑙𝑜𝑐𝑘}$ = 7332. 03 ------------------------------------------------------------------------ ### Example: Details of ANOVA Table - Then, we can fill out the ANOVA table: ![](images/clipboard-122981133.png) ::: callout-note Under $\alpha=.05$, for "Model" factor – Time, we have $df_{Model}$ = 1, $df_{error} = 195$: $F_{crit}=3.89$ so sig. Similarly, for "Blocking" - Tutor, we have $df_{block}$ = 3, $df_{error} = 195$: $F_{crit}=2.65$ so sig. ::: ------------------------------------------------------------------------ ### Interpretation - A randomized block design was used to test the effect of tutoring time on midterm scores. For each tutor, participants were randomly assigned to either morning (𝑛 = 91 ) or afternoon (𝑛= 109 ) tutoring sessions. - The effect of tutoring time was significant $(𝐹(1,195) = 119. 39, 𝑝<. 001 ,\eta^2_{𝑇𝑖𝑚𝑒} =. 298$) with a large effect. Using a Tukey’s test, morning is significantly higher than afternoon sessions (𝑝 \<. 05 ). - The effect of the blocking factor, tutor, was significant ($𝐹(3 , 195) = 28. 72 ,𝑝< . 001 ,\eta^2_{𝑇𝑢𝑡𝑜𝑟} =. 215$) with a large effect. Using a Tukey’s test, Monique’s students were significantly higher than other students, and Bobby’s students were significantly lower than other students in the midterm scores (𝑝\<. 05 ). ## Example: Hardness Reading[^2] [^2]: Please check the original post here: [STAT 503](https://online.stat.psu.edu/stat503/lesson/4/4.1) - In this example we wish to determine whether 4 different tips (the treatment factor) produce different (mean) hardness readings on a Rockwell hardness tester. - The **treatment factor** is the design of the tip for the machine that determines the hardness of metal. The tip is one component of the testing machine. ::: callout-note ## The Rockwell hardness test **The Rockwell hardness test** is a hardness test based on indentation hardness of a material. The Rockwell test measures the depth of penetration of an indenter under a large load (major load) compared to the penetration made by a preload (minor load). ::: - To conduct this experiment we assign the tips to an experimental unit; that is, to a **test specimen** (called a coupon), which is a piece of metal on which the tip is tested. - The **blocking factor** is the block of test specimens. The test specimens are blocks of metal that are similar in hardness. The test specimens are used to block the variation in hardness of the metal from the variation in the tips. ------------------------------------------------------------------------ ### Example: Block Design - CRD - If the structure were a completely randomized experiment (CRD) that we discussed in lecture 7, we would assign the tips to a random piece of metal for each test. In this case, the test specimens would be considered a source of nuisance variability. ```{r} #| code-fold: true set.seed(1234) data.frame( Metal = paste0("Metal", 1:8), Tip = rep(c("Tip1", "Tip2", "Tip3", "Tip4"), each = 2), Hardness = sample(seq(9, 10, by =.1), 8) ) |> kableExtra::kable() ``` ------------------------------------------------------------------------ ### Example: Block Design - RCBD - If we conduct this as a blocked experiment, we would assign all four tips to the same test specimen, randomly assigned to be tested on a different location on the specimen. Since each treatment occurs once in each block, the number of test specimens is the number of replicates. - Back to the hardness testing example, the experimenter may very well want to *test the tips* (treatment) across *specimens* (block) of various hardness levels. This shows the importance of blocking. To conduct this experiment as a RCBD, we assign all 4 tips to each specimen. - In this experiment, each specimen is called a “block”; thus, we have designed a more homogenous set of experimental units on which to test the tips. ------------------------------------------------------------------------ ### Example: Block Design Table - RCBD ::::: columns ::: column - Suppose that we use b = 4 blocks as shown in the table below: - We are primarily interested in testing the equality of treatment means, but now we have the ability to remove the variability associated with the nuisance factor (the blocks) through the grouping of the experimental units prior to having assigned the treatments. ::: ::: column ```{r} #| code-fold: true tribble( ~`1`, ~`2`, ~`3`, ~`4`, "Tip 3", "Tip 3", "Tip 2", "Tip 1", "Tip 1", "Tip 4", "Tip 1", "Tip 4", "Tip 4", "Tip 2", "Tip 3", "Tip 3", "Tip 2", "Tip 1", "Tip 4", "Tip 3" ) |> gt() |> tab_header( title = "The Hardness Testing Experiment", subtitle = "Randomized Complete Block Design" ) |> tab_spanner( label = "Test Coupon (Block)", columns = everything() ) |> tab_options( table.width = px(500), table.font.size = px(20) ) ``` ::: ::::: ::: callout-important Notice the two-way structure of the experiment. Here we have four blocks and within each of these blocks is a random assignment of the tips within each block. ::: ------------------------------------------------------------------------ ### Example: ANOVA Results (1) - Remember, the hardness of specimens (coupons) is tested with 4 different tips. ```{r} #| code-fold: true library(here) dat <- read.csv(here::here("teaching/2025-01-13-Experiment-Design/Lecture08", "tip_hardness.csv")) kableExtra::kable(dat) ``` ------------------------------------------------------------------------ ### Example: ANOVA Results (2) - Here is the output from R `aov()`. We can see four levels of the Tip and four levels for Coupon: ```{r} #| code-fold: true dat$Tip <- factor(dat$Tip) dat$Coupon <- factor(dat$Coupon) fit_exp2 <- aov(Hardness ~ Tip + Coupon, data = dat) ``` ::: callout-note - The Analysis of Variance table shows three degrees of freedom for Tip three for Coupon, and the residual (error) degrees of freedom is nine. - The ratio of mean squares of treatment over error gives us an F ratio that is equal to 14.44 which is highly significant since it is greater than the .001 percentile of the F distribution with three and nine degrees of freedom. - Our 2-way analysis also provides a test for the block factor, Coupon. The ANOVA shows that this factor is also significant with an F-test = 30.94. So, there is a large amount of variation in hardness between the pieces of metal. - This is why we used specimen (or coupon) as our blocking factor. We expected in advance that it would account for a large amount of variation. By including block in the model and in the analysis, we removed this large portion of the variation, such that the residual error is quite small. By including a block factor in the model, the error variance is reduced, and the test on treatments is more powerful. ::: ### Example: ANOVA Results (3) - The test on the block factor is typically not of interest except to confirm that you used a good blocking factor. The results are summarized by the table of means given below. ```{r} #| code-fold: true cbind( dat |> group_by(Tip) |> summarize( N_Tip = n(), Hardness_Tip = mean(Hardness)) , dat |> group_by(Coupon) |> summarize( N_Coupon = n(), Hardness_Coupon = mean(Hardness)) ) |> kableExtra::kable() ``` ## Example: Plant Fertilizer[^3] [^3]: Please check the original post here: [STAT 502 Analysis of Variance and Design of Experiments](https://online.stat.psu.edu/stat502/lesson/7/7.3) ::::: columns ::: column 1. In a greenhouse experiment, there was a single factor (fertilizer) with 4 levels (i.e. 4 treatments), six replications, and a total of 24 experimental units (potted plants). Suppose the image below is the greenhouse bench (viewed from above) that was used for the experiment. 2. To use CBD, we need to randomly assign each of the treatment levels to 6 potted plants. To do this, we first assign numbers to the physical position of the pots on the bench. Each column of plants will be use one Fertilizer. 3. To further expand it to RCBD, we need to put them into different farms (say we have six farms). Each farms will have 4 plans with each will use one type of fertilizer. ::: ::: column ![](images/clipboard-1929254708.png){width="100%"} ::: ::::: ------------------------------------------------------------------------ ### Read in R Data - After saving the **exp1_data.csv** into the same directory of your R file. You should be able to import dataset from Files Panel in Rstudio: ![](images/clipboard-1848460327.png) ![](images/clipboard-1116098669.png) ------------------------------------------------------------------------ ### Block Design ```{r} #| include: false library(tidyverse) dat <- tribble( ~Block, ~Fertilizer, ~Height, "Block1", "Control", 19.5, "Block2", "Control", 20.5, "Block3", "Control", 21, "Block4", "Control", 21, "Block5", "Control", 21.5, "Block6", "Control", 22.5, "Block1", "F1", 25, "Block2", "F1", 27.5, "Block3", "F1", 28, "Block4", "F1", 28.6, "Block5", "F1", 30.5, "Block6", "F1", 32, "Block1", "F2", 22.5, "Block2", "F2", 25.2, "Block3", "F2", 26, "Block4", "F2", 26.5, "Block5", "F2", 27, "Block6", "F2", 28, "Block1", "F3", 27.5, "Block2", "F3", 28, "Block3", "F3", 29.2, "Block4", "F3", 29.5, "Block5", "F3", 30, "Block6", "F3", 31 ) dat$Plant <- 1:nrow(dat) ``` ```{r} #| echo: true gt(dat, rownames_to_stub = TRUE, groupname_col = "Block", row_group_as_column = TRUE) |> tab_options(table.width = px(500)) ``` ------------------------------------------------------------------------ ### ANOVA Results :::::: columns ::: column - Let us obtain the ANOVA table for the **RCBD**. To run the model with Block Design in R we can use the `aov()` function with the formula: `<Outcome>~<Treatment>+<Block>`. ```{r} fit_rcbd <- aov(Height ~ Fertilizer + Block, data = dat) summary(fit_rcbd) ``` ::: ::: column - For comparison, let us obtain the ANOVA table for the **CRD** for the same data. We use the following R code with the formula: `<Outcome>~<Treatment>` ```{r} fit_cbd <- aov(Height ~ Fertilizer, data = dat) summary(fit_cbd) ``` ::: ------------------------------------------------------------------------ ### Interpretation ::: callout-important Comparing the two ANOVA tables, we see that the MSE in RCBD has decreased considerably in comparison to the CRD. This reduction in MSE can be viewed as the partition in SSE for the CRD (61.033) into SSBlock (53.32) + SSE (7.715). The potential reduction in SSE by blocking is offset to some degree by losing degrees of freedom for the blocks. But more often than not, is worth it in terms of the improvement in the calculated *F*-statistic. In our example, we observe that the *F*-statistic for the treatment has increased considerably for RCBD in comparison to CRD. It is reasonable to assume that the result from the RCBD is more valid than that from the CRD as the MSE value obtained after accounting for the block to block variability is a more accurate representation of the random error variance. ::: :::::: ## Example: Performance of Students at varied environment - **Background:** Comparing the performances of students (male and female) blocks in different environments (at home and at college). To represent this experiment in the figure will be as follows: ![](https://media.geeksforgeeks.org/wp-content/uploads/20201015122147/RCBDexample2.PNG) - Where *AC*: At College, *AH*: At Home ```{r} stud <- factor(rep(c("male", "female"), each = 2)) perf <- factor(rep(c("ah", "ac" ), times = 2)) perf ``` ```{r} y <- c(5.5, 5, 4, 6.2) # y is the hours students # studied in specific places results <- data.frame(y, stud, perf) fit <- aov(y ~ perf+stud, data = results) summary(fit) ``` - **Explanation**: The value of Mean Sq is `0.7225<<1.8225`,i.e, here blocking wasn’t necessary. And as Pr value is 0.642 \> 0.05 (5% significance) and the hypothesis is accepted - there is no sufficient evidence suggesting females and males have significant differences in performance. # Blocking factor or not? ::: discussion ## Effectiveness of Health Promotion Programs {.unnumbered} - Researchers are interested in comparing the effectiveness of three health promotion programs with nursing students. - The researcher has the following research question: *Which of the three health promotion programs are most effective at reducing unhealthy coping behaviors?* 1. Unhealthy coping behavior is measured as a composite score from the “Poor Coping Behavior” survey (PCB). High scores on the PCB indicate higher levels of unhealthy coping behaviors. Low scores indicate low levels of unhealthy coping behaviors. PCB scores can range from 0 (no unhealthy behaviors) to 100 (multiple unhealthy behaviors at high frequency and high intensity). 2. The researcher thinks that the health promotion programs for nursing students may have an effect on academic and well-being outcomes. 3. However, gender and status (traditional vs. non-traditional student) differences may influence the effectiveness of health promotion programs. ::: ------------------------------------------------------------------------ ### Blocking Factors? - The researcher has a sample of 200 nursing students from the state of Arkansas. 50 students are assigned to each program: Program A, Program B, and Program C. In addition, 50 students are assigned to a control group that receives the status quo educational model. ```{r} data.frame( Health_Program = c("Program A", "Program B", "Program C", "Control"), N = c(50, 50, 50, 50) ) |> kableExtra::kable() ``` ------------------------------------------------------------------------ ### Blocking Factors? ::: panel-tabset ## Visualization ![](images/clipboard-1200507560.png) ## Table ![](images/clipboard-2055304403.png) ::: ------------------------------------------------------------------------ ### Factors 1. What is the dependent variable? → [PCB SCORES]{.mohu} 2. What is the independent variable of interest? → [HEALTH PROGRAM (4 LEVELS)]{.mohu} 3. What is (are) the confounding variables, that the researcher might statistically control for? → [`GENDER`, `STATUS`]{.mohu} 4. What is (are) the blocking factors? → [`Schools`, `Classes`]{.mohu}

Overview

1.1 In last week (1)

1.2 In last week (2)

1.3 Features of RCBD

1.4 Random Effects of RCBD

1.5 Exercises

1.6 Other Aspects of the RCBD

1.7 Statistical form of RCBD

1.8 A little bit statistics 1

1.9 A littble bit more statistics

1.10 Example: Performance of detergents

1.11 Example: Total Sum of Square

1.12 Example: Sum of squares for Detergent

1.13 Example: Sum of squares for block

1.14 F-statistics

1.15 R Code for Sum of Squares

1.16 Example: Tutoring session

1.16.1 Example: Variables and Null Hypothesis

1.16.2 Example: Sum of Squares

1.16.3 Example: Mean of Squares and F-statistics

1.16.4 Example: Sum of Square Formula

1.16.5 Example: Marginal Means

1.16.6 Example: SS for Total and Model

1.16.7 Example: Details of ANOVA Table

1.16.8 Interpretation

1.17 Example: Hardness Reading2

1.17.1 Example: Block Design - CRD

1.17.2 Example: Block Design - RCBD

1.17.3 Example: Block Design Table - RCBD

1.17.4 Example: ANOVA Results (1)

1.17.5 Example: ANOVA Results (2)

1.17.6 Example: ANOVA Results (3)

1.18 Example: Plant Fertilizer3

1.18.1 Read in R Data

1.18.2 Block Design

1.18.3 ANOVA Results

1.18.4 Interpretation

1.19 Example: Performance of Students at varied environment

2 Blocking factor or not?

Effectiveness of Health Promotion Programs

2.0.1 Blocking Factors?

2.0.2 Blocking Factors?

2.0.3 Factors

1.8 A little bit statistics ¹

1.17 Example: Hardness Reading²

1.18 Example: Plant Fertilizer³