Educational Statistics and Research Methods (ESRM) Program*
University of Arkansas
Published
September 3, 2024
0.1 Presentation Outline
Centering and Coding Predictors
Interpreting Parameters in the Model for the Means
Main Effects Within Interactions
GLM Example 1: “Regression” vs. “ANOVA”
0.2 Today’s Example:
Study examining effect of new instruction method (New: 0=Old, 1=New) on test performance (Test: % correct) in college freshmen vs. senior (Senior: 0=Freshman, 1=Senior), n = 25 per group.
Each person’s expected (predicted) outcome is a function of his/her values on x and z (and their interaction), each measured once person
Estimated parameters are called fixed effects (here, \beta_0, \beta_1, \beta_2, and \beta_3); although they have a sampling distribution, they are not random variables
The number of fixed effects will show up in formulas as k (so k = 4here)
Model for the Variance:
e_p \sim N(0, \sigma^2_e) \rightarrow ONE residual (unexplained) deviation
e_p has a mean of 0 with some estimated constant variance \sigma^2_e, is normally distributed, is unrelated across people
Estimated parameter is the residual variance only (in the model above)
0.4 Representing the Effects of Predictor Variables
From now on, we will think carefully about how the predictor variables enter into the model for the means rather than the scales of predictors
Why don’t people always care about the scale of predictors:
Does NOT affect the amount of outcome variance account for (R^2)
Does NOT affect the outcomes values predicted by the model for the means (so long as the same predictor fixed effects are included)
Why should this matter to us?
Because the Intercept = expected outcome value when X = 0
Can end up with nonsense values for intercept if X = isn’t in the data
We will almost always need to deliberately adjust the scale of the predictor variables so that they have 0 values that could be observed in our data
Is much bigger deal in models with random effects (MLM) or GLM once interactions are included
0.5 Adjusting the Scale of Predictor Variables
For continuous (quantitative) predictors, we will make the intercept interpretable by centering:
Centering = subtract as constant from each person’s variable value so that the 0 value falls within the range of the new centered predictor variable
Typical \rightarrow Center around predictor’s mean: X_1^\prime = X_1 - \bar{X_1}
Better \rightarrow Center around meaningful constant C: X_1^\prime = X_1 - C
x <-rnorm(100, mean =1, sd =1)x_c <- x -mean(x)mean(x)mean(x_c)
[1] 1.063944
[1] 1.776357e-17
For categorical (grouping) predictors, either we or the program will make the intercept interpretable by creating a reference group:
Reference group is given a 0 value on all predictor variable created from the original group variable, such that the intercept is the expected outcome for that reference group specifically
Accomplished via “dummy coding” or “reference group coding”
For categorical predictors with more than two groups
We need to dummy coded the group variable using factor() in R
For example, I dummy coded group variable with group 1 as the reference
d1 = GroupT1 (0, 1, 0, 0) \rightarrow\beta_1 = mean difference between Treatment1 vs. Control
d2 = GroupT2 (0, 0, 1, 0) \rightarrow\beta_2 = mean difference between Treatment2 vs. Control
d3 = GroupT3 (0, 0, 0, 1) \rightarrow\beta_3 = mean difference between Treatment3 vs. Control
How does the model give us all possible group difference?
Control Mean
Treatment 1 Mean
Treatment 2 Mean
Treatment 3 Mean
\beta_0
\beta_0+\beta_1
\beta_0+\beta_2
\beta_0+\beta_3
The model (coefficients with dummy coding) directly provides 3 differences (control vs. each treatment), and indirectly provides another 3 differences (differences between treatments)
0.7 Group differences from Dummy Codes
Set J = 4 as number of groups:
The total number of group differences is J * (J-1) / 2 = 4*3/2 = 6
Control Mean
Treatment 1 Mean
Treatment 2 Mean
Treatment 3 Mean
\beta_0
\beta_0+\beta_1
\beta_0+\beta_2
\beta_0+\beta_3
All group differences
Alt Group
Ref Group
Difference
Control vs. T1
(\beta_0+\beta_1)
- \beta_0
= \beta_1
Control vs. T2
(\beta_0+\beta_2)
- \beta_0
= \beta_2
Control vs. T3
(\beta_0+\beta_3)
- \beta_0
= \beta_3
T1 vs. T2
(\beta_0+\beta_2)
-(\beta_0+\beta_1)
= \beta_2-\beta_1
T1 vs. T3
(\beta_0+\beta_3)
-(\beta_0+\beta_1)
= \beta_3-\beta_1
T2 vs. T3
(\beta_0+\beta_3)
-(\beta_0+\beta_2)
= \beta_3 - \beta_2
0.8 R Code: Estimating All Group Differences in R
⌘+C
library(multcomp)mod_e0 <-lm(Test ~ Group, data = dataTestExperiment)summary(mod_e0)
contrast_model_matrix =matrix(c(0, 1, 0, 0, # Control vs. T10, 0, 1, 0, # Control vs. T20, 0, 0, 1, # Control vs. T30, -1, 1, 0, # T1 vs. T20, -1, 0, 1, # T1 vs. T30, 0,-1, 1# T2 vs. T3), nrow =6, byrow = T)rownames(contrast_model_matrix) <-c( "Control vs. T1", "Control vs. T2", "Control vs. T3", "T1 vs. T2", "T1 vs. T3", "T2 vs. T3" ) contrast_value <-glht(mod_e0, linfct = contrast_model_matrix)summary(contrast_value)
Simultaneous Tests for General Linear Hypotheses
Fit: lm(formula = Test ~ Group, data = dataTestExperiment)
Linear Hypotheses:
Estimate Std. Error t value Pr(>|t|)
Control vs. T1 == 0 7.7600 0.7586 10.229 <0.001 ***
Control vs. T2 == 0 2.1600 0.7586 2.847 0.0274 *
Control vs. T3 == 0 6.8800 0.7586 9.069 <0.001 ***
T1 vs. T2 == 0 -5.6000 0.7586 -7.382 <0.001 ***
T1 vs. T3 == 0 -0.8800 0.7586 -1.160 0.6534
T2 vs. T3 == 0 4.7200 0.7586 6.222 <0.001 ***
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
(Adjusted p values reported -- single-step method)
0.9 What the intercept should mean to you
The model for the means will describe what happens to the predicted outcome Y “as X increases” or “as Z increases” and so forth
But you wont what Y is actually supposed to be unless you know where the predictor variables are starting from!
Therefor, the intercept is the “YOU ARE HERE” sign in the map of your data… so it should be somewhere in the map*!
1 Unit 2: Main Effects Within Interactions
1.1 Interaction Effects
Interaction = Moderation: the effect of a predictor depends on the value of the interacting predictor
Either predictor can be “the moderator” (interpretive distinction only)
Interaction can always be evaluated for any combination of categorical and continuous predictors, although
In “ANOVA”: By default, all possible interactions are estimated
Software does this for you; oddly enough, nonsignificant interactions usually still are kept in the model (even if only significant interactions are interpreted)
In “ANCOVA”: Continuous predictors (“covariates”) do not get to be part of interaction ➡️ make the “homogeneity of regression” assumption
There is no reason to assume this – it is a testable hypothesis!
In “Regression”: No default – effects of predictors are as you specify them
Requires most thought, but gets annoying because in regression programs you usually have to manually create the interaction as an observed variable:
e.g., XZ_interaction = centered_X * centered_Z
1.2 Main Effects in GLM with Interactions
Note
Main effects of predictors within interactions should remain in the model regardless of whether or not they are significant
Reason: the role of two-way interaction is to adjust its main effects
However, the original idea of a “main effect” no longer applied … each main effect is conditional on the interacting predictor as 0 (X_1X_2=0)
Example:
\beta_1 is the “simple” main effect of X1 when X2 = 0
\beta_1 + \beta_3 X_2 is the “conditional” main effect of X1 depending on X2 values
\beta_2 is the “simple” main effect of X2 when X1 = 0
\beta_2 + \beta_3 X_1 is the “conditional” main effect of X2 depending on X1 values
1.3 Model-Implied Simple Main Effects
Tip
The trick is keeping track of what 0 means for every interacting predictor, which depends on the way each predictor is being represented, as determined by you, or by the software without you!
Simple Main Effect = What it is + What modified it
where new conditional main effect\beta_2^{new} = \beta_2 + \beta_3 * T
1.7 Quiz
1.8 Testing the Significance of Model-Implied Fixed Effects
We now know how to calculate any conditional main effect:
Effect of interest (“conditional” main effect) = what it is + what modifies it
Outputed Effect (“simple” main effect) = what it is + what modifies it is 0
But if we want to test whether that new effect is \neq 0, we also need its standard error (SE needed to get Wald test T-value ➡️p-value)
Even if the conditional main effect is not directly given by the model, its estimate and SE are still implied by the model
3 options to get the new conditional main effect estimates and SE
Method 1: Ask the software to give it to you using your original model
e.g., glht function in R package multicomp, ESTIMATE in SAS, TEST in SPSS, NEW in Mplus
Model 2: Re-center your predictors to the interacting value of interest (e.g., make Exam = 3 the new 0 for \text{Exam}_C) and re-estimate your model; repeat as needed for each value of interest
Method 3: Hand calculations (what the program is doing for you in option #1)
Method 3 for example: Effect of Motiv = \beta_1 + \beta_3 * \text{Exam}
We have following formula to calculate the sampling error variance of “conditional” main effect as
Values come from “asymptotic (sampling) covariance matrix”
Variance of a sum of terms always includes covariance among them
Here, this is because what each main effect estimate could be is related to what the other main effect estimates could be
Note that if a main effect is unconditional, its SE^2 = Var(\beta) only
2 Unit 3: GLM Example 1
2.1 GLM via Dummy-Coding in “Regression”
# Model 1: 2 X 2 predictors with 0/1 codingmodel1 <-lm(Test ~ Senior + New + Senior * New, data = dataTestExperiment)# Alternativeformular_mod1 <-as.formula("Test ~ Senior + New + Senior * New")model1 <-lm(formular_mod1, data = dataTestExperiment)summary(model1)
Call:
lm(formula = formular_mod1, data = dataTestExperiment)
Residuals:
Min 1Q Median 3Q Max
-6.36 -2.08 0.04 1.83 6.64
Coefficients:
Estimate Std. Error t value Pr(>|t|)
(Intercept) 80.2000 0.5364 149.513 < 2e-16 ***
Senior 2.1600 0.7586 2.847 0.00539 **
New 7.7600 0.7586 10.229 < 2e-16 ***
Senior:New -3.0400 1.0728 -2.834 0.00561 **
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Residual standard error: 2.682 on 96 degrees of freedom
Multiple R-squared: 0.6013, Adjusted R-squared: 0.5888
F-statistic: 48.26 on 3 and 96 DF, p-value: < 2.2e-16
anova(model1)
Analysis of Variance Table
Response: Test
Df Sum Sq Mean Sq F value Pr(>F)
Senior 1 10.24 10.24 1.4235 0.235762
New 1 973.44 973.44 135.3253 < 2.2e-16 ***
Senior:New 1 57.76 57.76 8.0297 0.005609 **
Residuals 96 690.56 7.19
---
Signif. codes: 0 '***' 0.001 '**' 0.01 '*' 0.05 '.' 0.1 ' ' 1
Note
These ANOVA table is displaying marginal tests (F-test) for the main effects. Marginal tests are for the main effect only and are not conditional on any interacting variables.
2.2 Getting Each of Means as a Contrast
We can get group means of test scores with Senior/Freshman by New/Old combination
glht() requests predicted outcomes from model for the means:
Paste your R syntax that can calculate the conditional main effect of New (New vs. Old) when Senior = 1 (Senior-Old, Senior-New) standard errors. For example for conditional main effect of senior when new = 1:
---title: "Lecture 03: Simple, Marginal, and Interaction Effects"subtitle: "More about general linear model"author: "Jihong Zhang*, Ph.D"institute: | Educational Statistics and Research Methods (ESRM) Program* University of Arkansasdate: "2024-09-03"sidebar: falseexecute: echo: true warning: falseoutput-location: columnformat: html: page-layout: full toc: true toc-depth: 2 toc-expand: true lightbox: true code-fold: false uark-revealjs: scrollable: true chalkboard: true embed-resources: false code-fold: false number-sections: false footer: "ESRM 64503: Lecture 02 - Descriptive Statistics" slide-number: c/t tbl-colwidths: auto output-file: slides-index.html---## Presentation Outline1. Centering and Coding Predictors2. Interpreting Parameters in the Model for the Means3. Main Effects Within Interactions4. GLM Example 1: "Regression" vs. "ANOVA"## Today's Example:- Study examining effect of new instruction method (**New**: 0=Old, 1=New) on test performance (**Test**: % correct) in college freshmen vs. senior (**Senior**: 0=Freshman, 1=Senior), n = 25 per group.$$\text{Test}_p =\beta_0+\beta_1\text{Senior}_p+\beta_2\text{New}_p+ \beta_3 \text{Senior}_p\text{New}_p+ e_p$$```{r}#| code-fold: true#| output-location: defaultlibrary(ESRM64503)library(tidyverse)library(kableExtra)library(gt)data("dataTestExperiment")dataTestExperiment |>group_by(Senior, New) |>summarise(Mean =mean(Test),SD =sd(Test),SE =sd(Test) /sqrt(25) ) |>mutate(Statistics =paste0(signif(Mean, 4), "(", round(SD, 2), ")", "[", round(SE, 2), "]") ) |> dplyr::select(-c(Mean, SD, SE)) |>mutate(Senior =factor(Senior, levels =0:1, labels =c("Freshmen", "Senior")),New =factor(New, levels =0:1, labels =c("Old", "New")), ) |>pivot_wider(names_from = Senior, values_from = Statistics) |>kable()```::: callout-noteTest Mean (SD), \[SE = $\frac{SD}{\sqrt{n}}$\]:::## The Two Sides of a Model$$\text{Test}_p =\color{red}{\beta_0+\beta_1\text{Senior}_p+\beta_2\text{New}_p+ \beta_3 \text{Senior}_p\text{New}_p}+ \color{blue}{e_p}$$- [[Model for the Means (Predicted Values)]{.underline}]{style="color: red"} - Each person's expected (predicted) outcome is a function of his/her values on x and z (and their interaction), each measured once person - **Estimated parameters are called fixed effects** (here, $\beta_0$, $\beta_1$, $\beta_2$, and $\beta_3$); although they have a sampling distribution, they are not random variables - The number of fixed effects will show up in formulas as ***k*** (so ***k = 4***here)- [[Model for the Variance]{.underline}]{style="color: blue"}: - $e_p \sim N(0, \sigma^2_e) \rightarrow$ ONE residual (unexplained) deviation - $e_p$ has a mean of 0 with some estimated constant variance $\sigma^2_e$, is normally distributed, is unrelated across people - Estimated parameter is the residual variance only (in the model above)## Representing the Effects of Predictor Variables- From now on, we will think carefully about how the predictor variables enter into the [model for the means]{style="color: red;"} rather than the scales of predictors- Why don't people always care about **the scale of predictors**: 1. Does NOT affect the amount of outcome variance account for ($R^2$) 2. Does NOT affect the outcomes values predicted by the model for the means (so long as the same predictor fixed effects are included)- Why should this matter to us? 1. Because the **Intercept = expected outcome value when X = 0** 2. Can end up with nonsense values for intercept if X = isn't in the data 3. We will almost always need to deliberately **adjust the scale of the predictor variables** so that they have 0 values that could be observed in our data 4. Is much bigger deal in models with random effects (MLM) or GLM once interactions are included## Adjusting the Scale of Predictor Variables- For **continuous** (quantitative) predictors, we will make the intercept interpretable by centering: - **Centering** = subtract as constant from each person's variable value so that the 0 value falls within the range of the new centered predictor variable - Typical $\rightarrow$ Center around predictor's mean: $X_1^\prime = X_1 - \bar{X_1}$ - Better $\rightarrow$ Center around meaningful constant C: $X_1^\prime = X_1 - C$```{r}#| results: holdx <-rnorm(100, mean =1, sd =1)x_c <- x -mean(x)mean(x)mean(x_c)```------------------------------------------------------------------------- For **categorical** (grouping) predictors, [**either we or the program**]{.underline} will make the intercept interpretable by **creating a reference group:** - Reference group is given a 0 value on all predictor variable created from the original group variable, such that the intercept is the expected outcome for that reference group specifically - Accomplished via "dummy coding" or "reference group coding"- For **categorical** predictors with **more than two groups** - We need to dummy coded the group variable using `factor()` in R - For example, I dummy coded `group` variable with group 1 [as the reference]{style="color:red"}```{r}#| results: holddataTestExperiment$Group <-factor(dataTestExperiment$Group, levels =1:4, labels =c("Ctl", "T1", "T2", "T3"))mod_e0 <-lm(Test ~ Group, data = dataTestExperiment)summary(mod_e0)$coefficients |>kable(digits =3) ```------------------------------------------------------------------------- Variable `edu`: \# dummy variables = \# group -1 - Dummy variables with four levels - Control, Treatment1, Treatment2, Treatment3: - d1 = *GroupT1* ([0, 1, 0, 0]{style="color: tomato;"}) $\rightarrow$ $\beta_1$ = difference between Treatment1 vs. Control - d2 = *GroupT2* ([0, 0, 1, 0]{style="color: tomato"}) $\rightarrow$ $\beta_2$ = difference between Treatment2 vs. Control - d3 = *GroupT3* ([0, 0, 0, 1]{style="color: tomato"}) $\rightarrow$ $\beta_3$ = difference between Treatment3 vs. Control```{r}#| output-location: defaultmodel.matrix(mod_e0) |>cbind(dataTestExperiment) |>showtable(font_size =33)```------------------------------------------------------------------------- Other examples of things people do to categorical predictors: - "**Contrast/effect coding**" $\rightarrow$ Gender: -0.5 = Man, 0.5 = Women - **Effect**: Man vs. Average \| Women vs. Average - Test other contrasts among multiple groups - Four-group variable: Control, Treatment1, Treatment2, Treatment3 - **Effect**: contrast1 = {-1, .33, .33, .34} - Control vs. Any Treatment?## Categorical Predictors: Manual Coding- $$ \hat Y_p = \beta_0+\beta_1\text{GroupT1}_p+\beta_2\text{GroupT2}_p + \beta_3\text{GroupT3}_p $$ - d1 = *GroupT1* ([0, 1, 0, 0]{style="color: tomato;"}) $\rightarrow$ $\beta_1$ = mean difference between Treatment1 vs. Control - d2 = *GroupT2* ([0, 0, 1, 0]{style="color: tomato"}) $\rightarrow$ $\beta_2$ = mean difference between Treatment2 vs. Control - d3 = *GroupT3* ([0, 0, 0, 1]{style="color: tomato"}) $\rightarrow$ $\beta_3$ = mean difference between Treatment3 vs. Control- How does the model give us **all possible group difference**?| Control Mean | Treatment 1 Mean | Treatment 2 Mean | Treatment 3 Mean ||:------------:|:-----------------:|:-----------------:|:-----------------:|| $\beta_0$ | $\beta_0+\beta_1$ | $\beta_0+\beta_2$ | $\beta_0+\beta_3$ |- The model (coefficients with dummy coding) directly provides 3 differences (control vs. each treatment), and indirectly provides another 3 differences (differences between treatments)## Group differences from Dummy CodesSet $J = 4$ as number of groups:The total number of group differences is $J * (J-1) / 2 = 4*3/2 = 6$| Control Mean | Treatment 1 Mean | Treatment 2 Mean | Treatment 3 Mean ||:------------:|:-----------------:|:-----------------:|:-----------------:|| $\beta_0$ | $\beta_0+\beta_1$ | $\beta_0+\beta_2$ | $\beta_0+\beta_3$ |- All group differences| | Alt Group | Ref Group | Difference ||------------------|-----------------:|:-----------------|:-----------------|| Control vs. T1 | $(\beta_0+\beta_1)$ | \- $\beta_0$ | = $\beta_1$ || Control vs. T2 | $(\beta_0+\beta_2)$ | \- $\beta_0$ | = $\beta_2$ || Control vs. T3 | $(\beta_0+\beta_3)$ | \- $\beta_0$ | = $\beta_3$ || T1 vs. T2 | $(\beta_0+\beta_2)$ | \-$(\beta_0+\beta_1)$ | = $\beta_2-\beta_1$ || T1 vs. T3 | $(\beta_0+\beta_3)$ | \-$(\beta_0+\beta_1)$ | = $\beta_3-\beta_1$ || T2 vs. T3 | $(\beta_0+\beta_3)$ | \-$(\beta_0+\beta_2)$ | = $\beta_3 - \beta_2$ |## R Code: Estimating All Group Differences in R```{r}#| code-fold: true#| results: hide#| output-location: defaultlibrary(multcomp)mod_e0 <-lm(Test ~ Group, data = dataTestExperiment)summary(mod_e0)``````{r}#| output-location: defaultcontrast_model_matrix =matrix(c(0, 1, 0, 0, # Control vs. T10, 0, 1, 0, # Control vs. T20, 0, 0, 1, # Control vs. T30, -1, 1, 0, # T1 vs. T20, -1, 0, 1, # T1 vs. T30, 0,-1, 1# T2 vs. T3), nrow =6, byrow = T)rownames(contrast_model_matrix) <-c( "Control vs. T1", "Control vs. T2", "Control vs. T3", "T1 vs. T2", "T1 vs. T3", "T2 vs. T3" ) contrast_value <-glht(mod_e0, linfct = contrast_model_matrix)summary(contrast_value)```## What the intercept should mean to you- **The model for the means** will describe what happens to the predicted outcome Y "as X increases" or "as Z increases" and so forth- But you wont what Y is actually supposed to be unless you know where the predictor variables are starting from!- Therefor, the intercept is the "YOU ARE HERE" sign in the map of your data... so it should be somewhere in the map\*!```{r}#| echo: false#| fig-align: centerTestGroup <- dataTestExperiment |>group_by(Group) |>summarise(GroupMean =mean(Test))ggplot(dataTestExperiment) +geom_point(aes(y = Test, x = Group, col = Group)) +geom_point(aes(y = GroupMean, x = Group, col = Group), data = TestGroup, size =4) +geom_label(aes(x ="Ctl", y =80.20, label ="You are here"), nudge_x = .3, nudge_y =1) +theme_bw() +theme(text =element_text(size =33))```# Unit 2: Main Effects Within Interactions## Interaction Effects- **Interaction = Moderation**: the effect of a predictor depends on the value of the interacting predictor - Either predictor can be "the moderator" (interpretive distinction only)- Interaction can always be evaluated for any combination of categorical and continuous predictors, although 1. In “**ANOVA**”: By default, all possible interactions are estimated - Software does this for you; oddly enough, nonsignificant interactions usually still are kept in the model (even if only significant interactions are interpreted) 2. In “**ANCOVA**”: Continuous predictors (“covariates”) do not get to be part of interaction ➡️ make the “homogeneity of regression” assumption - There is no reason to assume this – it is a testable hypothesis! 3. In “**Regression**”: No default – effects of predictors are as you specify them - Requires most thought, but gets annoying because in regression programs you usually have to manually create the interaction as an observed variable: - e.g., XZ_interaction = centered_X \* centered_Z## Main Effects in GLM with Interactions::: callout-noteMain effects of predictors within interactions should remain in the model regardless of whether or not they are significant:::😶: $Y_p = \beta_0 + \beta_1 X_1 X_2$😄: $Y_p = \beta_0 + \beta_1X_1+\beta_2X_2+\beta_3 X_1 X_2$- **Reason**: the role of two-way interaction is to adjust its main effects- **However**, the original idea of a "main effect" **no longer applied** ... each main effect is **conditional** on the interacting predictor as 0 ($X_1X_2=0$)- Example: - $\beta_1$ is the "**simple**" main effect of X1 when X2 = 0 - $\beta_1 + \beta_3 X_2$ is the "**conditional**" main effect of X1 depending on X2 values - $\beta_2$ is the "simple" main effect of X2 when X1 = 0 - $\beta_2 + \beta_3 X_1$ is the "**conditional**" main effect of X2 depending on X1 values## Model-Implied Simple Main Effects::: callout-tipThe trick is keeping track of what 0 means for every interacting predictor, which depends on the way each predictor is being represented, as determined by you, or by the software without you!:::**Simple Main Effect = What it is + What modified it**$\text{GPA}_p = 30 + 1\times \text{Motiv}_p +2 \times\text{Exam}_p+ 0.5 \times \text{Motiv}\times \text{Exam}_p$- GPA scores of anyone can be predicted by academic motivation, final exam scores, and their interaction by this model- Simple Main Effect of *Motiv* : $(1 + 0.5\times \text{Exam})$ - = 1 if Exam is 0; = 2 if Exam is 1; = 3 if Exam is 4<!-- -->- Simple Main Effect of *Exam* : $(2 + 0.5\times \text{Motiv})$ - = 2 if Motiv is 0; = 3 if Motiv is 1; = 4 if Motiv is 4## Interpretation$$\begin{aligned}\text{GPA}_p = \beta_0 + \beta_1\times \text{Motiv}_p + \beta_2 \times\text{Exam}_p+ \beta_3 \times \text{Motiv}\times \text{Exam}_p\\= 30 + 1\times \text{Motiv}_p +2 \times\text{Exam}_p+ 0.5 \times \text{Motiv}\times \text{Exam}_p\end{aligned}$$- $\beta_0$: Expected GPA when motivation is 0 and exam score is 0- $\beta_1$: Increase in GPA per unit motivation when exam score is 0- $\beta_2$: Increase in GPA per unit exam score when motivation is 0- $\beta_3$: Two ways of interpretation - **Motivation is moderator**: Increase in effect of exam scores per unit motivation - One unit of motivation leads to the effect of exam score $2 \rightarrow 2.5$ - **Exam Score is moderator**: Increase in effect of motivation per unit exam score - One unit of exam score leads to the effect of motivation $1 \rightarrow 1.5$::: callout-noteIf interaction effect is significant, we typically report that motivation significantly moderates the effect of exam score on GPA:::## Why centering mattersWhen we **centered Exam Score** with 3 as centering point, then **intercept and the main effect** of motivation will change:$$\begin{align}\text{GPA}_p = \color{tomato}{30} + \color{tomato}{1}\times \text{Motiv}_p +2 \times\text{Exam}_p+ 0.5 \times \text{Motiv}\times \text{Exam}_p\\= \color{tomato}{\beta_0^\prime} + \color{tomato}{\beta_1\prime} \times \text{Motiv}_p + \beta_2 \times (\text{Exam}_p-3)+ \beta_3 \times \text{Motiv}\times (\text{Exam}_p-3)\\= \color{red}{36} + \color{red}{2.5}\times \text{Motiv}_p +2 \times (\text{Exam}_p-3) + 0.5 \times \text{Motiv}\times (\text{Exam}_p-3)\end{align}$$- **Trick of new coefficients**: Predicted value stay the same - Expected GPA score is [30 + 1\*0 + 2\*3 + 0.5\*0 = 36]{style="color: red"} when Motiv = 0 and Exam = 3 - Expected effect of Motiv is [1 + 0.5\*3 = 2.5]{style="color: red"} when Exam = 3- **Reason**: $\beta_0$ and $\beta_1$ are conditional on exam score while $\beta_2$ and $\beta_3$ are unconditional on exam score## Simple and Conditional Main Effect- **Conditional** main effect for `Motiv` depending on value of `Exam`: - $(1 + 0.5\times\text{Exam})$ or $[2.5 + 0.5 \times(\text{Exam}-3)]$ - **"Simple" main effect** is 1 when Exam = 0; 2.5 when Exam = 3 - Assume centering point of Exam is **T**, we can get **the general form** of simple main effect of Motivation as: - $f(\beta_1,\beta_3,\text{Exam})= (\beta_1 + \beta_3*T) + \beta_3*(\text{Exam} - T) = \beta_1^{new}+\beta_3*\text{Exam}_C$ - where new **conditional main effect** $\beta_1^{new} = \beta_1+\beta_3*T$- Similarly, we can get the general form of conditional main effect of `Exam` as - $f(\beta_2,\beta_3,\text{Motiv})= (\beta_2 + \beta_3*T) + \beta_3*(\text{Motiv} - T)$ - where new **conditional main effect** $\beta_2^{new} = \beta_2 + \beta_3 * T$## Quiz```{=html}<iframe width="100%" height="800px" src="https://forms.office.com/r/WubsScScRz?embed=true" frameborder="0" marginwidth="0" marginheight="0" style="border: none; max-width:100%; max-height:100vh" allowfullscreen webkitallowfullscreen mozallowfullscreen msallowfullscreen></iframe>```## Testing the Significance of Model-Implied Fixed Effects- We now know how to calculate any conditional main effect: - Effect of interest ("**conditional" main effect**) = what it is + what modifies it - Outputed Effect (**"simple" main effect**) = what it is + what modifies it is 0- But if we want to test whether that new effect is $\neq 0$, we also need its standard error (SE needed to get **Wald test T-value ➡️*p*-value**)- Even if the conditional main effect is not *directly* given by the model, its estimate and SE are still *implied* by the model------------------------------------------------------------------------- 3 options to get the new conditional main effect estimates and SE - **Method 1: Ask the software to give it to you** using your original model - e.g., `glht` function in R package `multicomp`, `ESTIMATE` in SAS, `TEST` in SPSS, `NEW` in Mplus - **Model 2: Re-center your predictors** to the interacting value of interest (e.g., make Exam = 3 the new 0 for $\text{Exam}_C$) and re-estimate your model; repeat as needed for each value of interest - **Method 3: Hand calculations** (what the program is doing for you in option #1)------------------------------------------------------------------------- Method 3 for example: Effect of Motiv = $\beta_1 + \beta_3 * \text{Exam}$ - We have following formula to calculate the **sampling error variance** of "conditional" main effect as - $$ \mathbf{SE_{\beta_1^{New}}^2 = Var(\beta_1) + Var(\beta_3) * Exam + 2Cov(\beta_1, \beta_3)*Exam} $$ - Values come from "asymptotic (sampling) covariance matrix" - Variance of a sum of terms always includes covariance among them - Here, this is because what each main effect estimate could be is related to what the other main effect estimates could be - Note that if a main effect is unconditional, its $SE^2 = Var(\beta)$ only# Unit 3: GLM Example 1## GLM via Dummy-Coding in "Regression"```{r}#| output-location: default# Model 1: 2 X 2 predictors with 0/1 codingmodel1 <-lm(Test ~ Senior + New + Senior * New, data = dataTestExperiment)# Alternativeformular_mod1 <-as.formula("Test ~ Senior + New + Senior * New")model1 <-lm(formular_mod1, data = dataTestExperiment)summary(model1)``````{r}#| output-location: defaultanova(model1)```::: callout-noteThese ANOVA table is displaying **marginal tests** (F-test) for the main effects. Marginal tests are for the main effect only and are not conditional on any interacting variables.:::## Getting Each of Means as a ContrastWe can get group means of test scores with Senior/Freshman by New/Old combination`glht()` requests predicted outcomes from model for the means:$$\mathbf{\hat{Test} =\beta_0+\beta_1Senior+\beta_2New+\beta_3SeniorNew}$$- Freshmen-Old: $\mathbf{\hat{Test_1}=\beta_0*1+\beta_1*0+\beta_2*0 +\beta_3*0}$- Freshmen-New: $\mathbf{\hat{Test_2}=\beta_0*1+\beta_1*0+\beta_2*1 +\beta_3*0}$- Senior-Old: $\mathbf{\hat{Test_3}=\beta_0*1+\beta_1*1+\beta_2*0 +\beta_3*0}$- Senior-New: $\mathbf{\hat{Test_4}=\beta_0*1+\beta_1*1+\beta_2*1 +\beta_3*1}$```{r}#| output-location: defaultcontrast_model1_matrix =matrix(c(1, 0, 0, 0, # Freshman-Old1, 0, 1, 0, # Freshman-New1, 1, 0, 0, # Senior-Old1, 1, 1, 1# Senior-New), nrow =4, byrow = T)rownames(contrast_model1_matrix) <-c( "Freshman-Old", "Freshman-New", "Senior-Old", "Senior-New") means_model1 <-glht(model1, linfct = contrast_model1_matrix)summary(means_model1)```## Standard Errors of Group Means- Freshman-new group mean's standard error:$$\mathbf{SE^2(\hat{Test_1}) = Var(\beta_0)}$$- Freshman-old group mean's standard error:$$\mathbf{SE^2(\hat{Test_2}) = Var(\beta_0) + Var(\beta_2) + 2Cov(\beta_0,\beta_2)}$$```{r}#| output-location: defaultvcov(model1)# Variance-covariance of betas# Standard error of freshman-new groupVar_Test_1 <- Var_beta_0 <-vcov(model1)[1,1](SE_Test_1 <-sqrt(Var_Test_1))# Standard error of freshman-old groupVar_beta_0 <-vcov(model1)[1,1]Var_beta_2 <-vcov(model1)[3,3]Cov_beta_0_2 <-vcov(model1)[1,3]Var_Test_2 <- Var_beta_0 + Var_beta_2 +2*Cov_beta_0_2(SE_Test_2 <-sqrt(Var_Test_2))```::: callout-caution## Your Homework 1Paste your R syntax that can calculate the other two groups' (Senior-Old, Senior-New) standard errors. For example for Freshman-New:> Var_beta_0 \<- vcov(model1)\[1,1\]>> Var_beta_2 \<- vcov(model1)\[3,3\]>> Cov_beta_0_2 \<- vcov(model1)\[1,3\]>> Var_Test_2 \<- Var_beta_0 + Var_beta_2 + 2\*Cov_beta_0_2>> (SE_Test_2 \<- sqrt(Var_Test_2)) \[1\] 0.5364078:::## Standard errors of Main Effects```{r}#| output-location: defaulteffect_model1_matrix =matrix(c(0, 1, 0, 0, # Senior vs. Freshmen | Old0, 1, 0, 1, # Senior vs. Freshmen | New0, 0, 1, 0, # New vs. Old | Frenshmen0, 0, 1, 1# New vs. Old | Senior), nrow =4, byrow = T)rownames(effect_model1_matrix) <-c( "beta_1_old", "beta_1_new", "beta_2_fresh", "beta_2_senior") effect_model1 <-glht(model1, linfct = effect_model1_matrix)summary(effect_model1)```------------------------------------------------------------------------**Example: Conditional main effect** of Senior (Senior vs. Freshmen) when using the new method (New = 1)- $$ \mathbf{\beta_1^{new} = \beta_1+\beta_3} $$ $$ \mathbf{SE_{\beta_1^{New}}^2 = Var(\beta_1) + Var(\beta_3) * New + 2Cov(\beta_1, \beta_3)*New} $$```{r}#| output-location: default## conditional main effect of senior when newfixed_effects <-summary(model1)$coefficientsfixed_effectsbeta_1 <- fixed_effects[2, 1]beta_3 <- fixed_effects[4, 1](beta_1_new <- beta_1+beta_3)vcov(model1) # Variance-covariance of betasVar_beta_1 =vcov(model1)[2,2]Var_beta_3 =vcov(model1)[4,4]Cov_beta_1_3 =vcov(model1)[4,2]Var_beta_1_new <- Var_beta_1 + Var_beta_3 *1+2* Cov_beta_1_3 *1(SE_beta_1_new <-sqrt(Var_beta_1_new))```::: callout-caution## Your Homework 1Paste your R syntax that can calculate the conditional main effect of New (New vs. Old) when Senior = 1 (Senior-Old, Senior-New) standard errors. For example for conditional main effect of senior when new = 1:> Var_beta_1 = vcov(model1)\[2,2\]>> Var_beta_3 = vcov(model1)\[4,4\]>> Cov_beta_1_3 = vcov(model1)\[4,2\]>> Var_beta_1_new \<- Var_beta_1 + Var_beta_3 \* 1 + 2 \* Cov_beta_1_3 \* 1>> (SE_beta_1_new \<- sqrt(Var_beta_1_new)) \[1\] 0.7585952:::