📊Probability and Statistics Unit 9 Review

9.6 ANOVA

📊Probability and Statistics
Unit 9 Review

9.6 ANOVA

Written by the Fiveable Content Team • Last updated September 2025

📊Probability and Statistics

Unit & Topic Study Guides

9.1 Confidence intervals for means

9.2 Confidence intervals for proportions

9.3 Hypothesis testing framework

9.4 Type I and Type II errors

9.5 t-tests and z-tests

9.6 ANOVA

Analysis of Variance (ANOVA) is a statistical method for comparing means across multiple groups. It's a powerful tool that extends beyond simple t-tests, allowing researchers to analyze complex experimental designs and identify significant differences between groups.

ANOVA comes in various forms, including one-way, two-way, and repeated measures. Each type addresses different research questions and experimental setups, providing insights into main effects, interactions, and within-subject changes. Understanding ANOVA is crucial for interpreting experimental results across many fields.

Overview of ANOVA

Analysis of Variance (ANOVA) is a statistical method used to compare means across three or more groups or conditions
ANOVA helps determine if there are significant differences between the means of different groups
ANOVA is an essential tool in Probability and Statistics for analyzing data from experiments or observational studies

ANOVA vs other statistical tests

ANOVA is used when comparing means across three or more groups, while t-tests are used for comparing means between two groups
ANOVA can handle both categorical and continuous independent variables, while chi-square tests are used for comparing frequencies or proportions across categories
ANOVA is more versatile than other tests as it can be used for one-way, two-way, or multi-way designs

Applications of ANOVA

ANOVA is commonly used in psychology, biology, and social sciences to compare treatment effects or group differences
In market research, ANOVA can be used to compare customer preferences or satisfaction levels across different product categories or brands
ANOVA is also used in quality control to test if there are significant differences in product quality across different production batches or manufacturing plants

One-way ANOVA

One-way ANOVA is used when there is a single categorical independent variable (factor) with three or more levels
The purpose of one-way ANOVA is to determine if there are significant differences in the means of the dependent variable across the levels of the independent variable
One-way ANOVA is an extension of the independent samples t-test for situations with more than two groups

Assumptions of one-way ANOVA

Independence of observations: Observations within each group should be independent of each other
Normality: The dependent variable should be approximately normally distributed within each group
Homogeneity of variances: The variance of the dependent variable should be equal across all groups (homoscedasticity)

Steps in one-way ANOVA

State the null and alternative hypotheses
Calculate the total sum of squares (SST), sum of squares between groups (SSB), and sum of squares within groups (SSW)
Calculate the degrees of freedom for each sum of squares
Calculate the mean squares between groups (MSB) and mean squares within groups (MSW)
Calculate the F-statistic by dividing MSB by MSW
Determine the p-value associated with the F-statistic
Compare the p-value to the chosen significance level (e.g., 0.05) and make a decision to reject or fail to reject the null hypothesis

F-statistic in one-way ANOVA

The F-statistic in one-way ANOVA is the ratio of the variance between groups to the variance within groups
A larger F-statistic indicates a greater difference in means between groups relative to the variability within groups
The F-statistic follows an F-distribution with degrees of freedom determined by the number of groups and the total sample size

P-value interpretation

The p-value in one-way ANOVA represents the probability of obtaining an F-statistic as extreme as or more extreme than the observed F-statistic, assuming the null hypothesis is true
A small p-value (typically < 0.05) suggests that there is strong evidence against the null hypothesis, indicating significant differences in means between groups

Limitations of one-way ANOVA

One-way ANOVA only determines if there are significant differences between groups but does not specify which groups differ
One-way ANOVA assumes that the groups are independent and that the dependent variable is normally distributed within each group
Violations of the assumptions (e.g., non-normality, heteroscedasticity) can affect the validity of the results

Two-way ANOVA

Two-way ANOVA is used when there are two categorical independent variables (factors) and one continuous dependent variable
The purpose of two-way ANOVA is to examine the main effects of each independent variable and the interaction effect between the two independent variables on the dependent variable
Two-way ANOVA allows researchers to test if the effect of one independent variable on the dependent variable depends on the level of the other independent variable

Assumptions of two-way ANOVA

Independence of observations: Observations within each cell should be independent of each other
Normality: The dependent variable should be approximately normally distributed within each cell
Homogeneity of variances: The variance of the dependent variable should be equal across all cells (homoscedasticity)

Main effects vs interaction effects

Main effects are the effects of each independent variable on the dependent variable, ignoring the other independent variable
Interaction effects occur when the effect of one independent variable on the dependent variable depends on the level of the other independent variable
A significant interaction effect suggests that the main effects should be interpreted with caution, as they may not tell the whole story

Steps in two-way ANOVA

State the null and alternative hypotheses for main effects and interaction effect
Calculate the total sum of squares (SST), sum of squares for factor A (SSA), sum of squares for factor B (SSB), sum of squares for interaction (SSAB), and sum of squares for error (SSE)
Calculate the degrees of freedom for each sum of squares
Calculate the mean squares for factor A (MSA), factor B (MSB), interaction (MSAB), and error (MSE)
Calculate the F-statistics for factor A, factor B, and interaction by dividing their respective mean squares by the mean square for error
Determine the p-values associated with each F-statistic
Compare the p-values to the chosen significance level (e.g., 0.05) and make decisions to reject or fail to reject the null hypotheses for main effects and interaction effect

F-statistics for main and interaction effects

The F-statistic for a main effect is the ratio of the variance explained by that factor to the unexplained variance (error)
The F-statistic for the interaction effect is the ratio of the variance explained by the interaction to the unexplained variance (error)
Larger F-statistics indicate a greater effect of the factor or interaction on the dependent variable

P-value interpretation

The p-values for main effects and interaction effect in two-way ANOVA represent the probability of obtaining F-statistics as extreme as or more extreme than the observed F-statistics, assuming the respective null hypotheses are true
Small p-values (typically < 0.05) suggest strong evidence against the null hypotheses, indicating significant main effects or interaction effect

Limitations of two-way ANOVA

Two-way ANOVA assumes that the groups are independent and that the dependent variable is normally distributed within each cell
Violations of the assumptions (e.g., non-normality, heteroscedasticity) can affect the validity of the results
Two-way ANOVA does not provide information about the direction or magnitude of the effects, only their significance

ANOVA post-hoc tests

Post-hoc tests are conducted after a significant ANOVA result to determine which specific group means differ from each other
Post-hoc tests help control the familywise error rate, which is the probability of making at least one Type I error when conducting multiple comparisons

Purpose of post-hoc tests

To identify which specific group means are significantly different from each other
To provide more detailed information about the nature of the differences between groups
To control the familywise error rate and maintain the overall Type I error rate at the desired level (e.g., 0.05)

Types of post-hoc tests

Bonferroni correction: Adjusts the significance level for each comparison by dividing the desired familywise error rate by the number of comparisons
Tukey's HSD (Honestly Significant Difference) test: Controls the familywise error rate and is more powerful than the Bonferroni correction when the number of comparisons is large
Dunnett's test: Used when comparing each treatment group to a control group
Scheffe's test: More conservative than Tukey's HSD test and can be used for both pairwise and complex comparisons

Bonferroni correction

The Bonferroni correction adjusts the significance level for each comparison to control the familywise error rate
The adjusted significance level is calculated by dividing the desired familywise error rate (e.g., 0.05) by the number of comparisons
The Bonferroni correction is simple to implement but can be overly conservative, especially when the number of comparisons is large

Tukey's HSD test

Tukey's HSD test is based on the studentized range distribution and controls the familywise error rate
The test calculates a critical value based on the number of groups, degrees of freedom for error, and the desired familywise error rate
Tukey's HSD test is more powerful than the Bonferroni correction when the number of comparisons is large

Dunnett's test

Dunnett's test is used when comparing each treatment group to a control group
The test calculates a critical value based on the number of treatment groups, degrees of freedom for error, and the desired familywise error rate
Dunnett's test is more powerful than the Bonferroni correction when the focus is on comparing treatments to a control

Scheffe's test

Scheffe's test is a conservative post-hoc test that can be used for both pairwise and complex comparisons
The test calculates a critical value based on the number of groups, degrees of freedom for error, and the desired familywise error rate
Scheffe's test is more conservative than Tukey's HSD test, making it less likely to detect significant differences between groups

ANOVA vs regression

Both ANOVA and regression are used to analyze the relationship between independent variables and a dependent variable
ANOVA is used when the independent variables are categorical, while regression is used when the independent variables are continuous or a mix of categorical and continuous

Similarities between ANOVA and regression

Both techniques aim to explain the variability in the dependent variable using the independent variables
Both techniques use hypothesis testing to determine the significance of the relationships between variables
Both techniques can include multiple independent variables (e.g., two-way ANOVA, multiple regression)

Differences between ANOVA and regression

ANOVA is used for comparing means across groups, while regression is used for predicting the value of the dependent variable based on the independent variables
ANOVA uses categorical independent variables, while regression uses continuous or a mix of categorical and continuous independent variables
ANOVA results in an F-statistic and p-value, while regression results in coefficients, t-statistics, and p-values for each independent variable

Choosing between ANOVA and regression

Use ANOVA when the research question involves comparing means across groups or conditions
Use regression when the research question involves predicting the value of the dependent variable based on the independent variables
Consider the nature of the independent variables (categorical or continuous) when deciding between ANOVA and regression

Repeated measures ANOVA

Repeated measures ANOVA is used when the same participants are measured on the dependent variable under different conditions or at different time points
The purpose of repeated measures ANOVA is to determine if there are significant differences in the means of the dependent variable across the different conditions or time points
Repeated measures ANOVA accounts for the correlation between measurements from the same participants, reducing the unexplained variability and increasing statistical power

Assumptions of repeated measures ANOVA

Normality: The dependent variable should be approximately normally distributed within each condition or time point
Sphericity: The variances of the differences between all pairs of conditions or time points should be equal (see Sphericity in repeated measures ANOVA)
Independence of observations: Observations between participants should be independent of each other

Benefits of repeated measures design

Increased statistical power: By using the same participants across conditions or time points, repeated measures ANOVA reduces the unexplained variability and increases the ability to detect significant differences
Fewer participants required: Because each participant serves as their own control, repeated measures designs require fewer participants than between-subjects designs
Control for individual differences: Repeated measures designs control for individual differences between participants, as each participant is compared to themselves across conditions or time points

Sphericity in repeated measures ANOVA

Sphericity is an assumption of repeated measures ANOVA that requires the variances of the differences between all pairs of conditions or time points to be equal
Violations of sphericity can lead to an increased Type I error rate (rejecting the null hypothesis when it is true)
Mauchly's test is used to assess the sphericity assumption, with a significant result indicating a violation of sphericity

Greenhouse-Geisser correction

The Greenhouse-Geisser correction is a method for adjusting the degrees of freedom in repeated measures ANOVA when the sphericity assumption is violated
The correction factor epsilon ($\epsilon$) is calculated based on the variance-covariance matrix of the repeated measures
The adjusted degrees of freedom are calculated by multiplying the original degrees of freedom by the correction factor $\epsilon$

Huynh-Feldt correction

The Huynh-Feldt correction is another method for adjusting the degrees of freedom in repeated measures ANOVA when the sphericity assumption is violated
The Huynh-Feldt correction is less conservative than the Greenhouse-Geisser correction and is used when the Greenhouse-Geisser correction is too conservative
The adjusted degrees of freedom are calculated by multiplying the original degrees of freedom by the Huynh-Feldt correction factor $\tilde{\epsilon}$

Multivariate ANOVA (MANOVA)

Multivariate ANOVA (MANOVA) is an extension of ANOVA that is used when there are multiple dependent variables
The purpose of MANOVA is to determine if there are significant differences between groups or conditions on a combination of dependent variables
MANOVA takes into account the correlations between the dependent variables and provides a more comprehensive analysis than conducting separate ANOVAs for each dependent variable

Purpose of MANOVA

To determine if there are significant differences between groups or conditions on a combination of dependent variables
To control the familywise error rate when conducting multiple ANOVAs on related dependent variables
To provide a more holistic understanding of the effects of the independent variables on the dependent variables

Assumptions of MANOVA

Independence of observations: Observations within each group should be independent of each other
Multivariate normality: The dependent variables should follow a multivariate normal distribution within each group
Homogeneity of covariance matrices: The covariance matrices of the dependent variables should be equal across groups
Absence of multicollinearity: The dependent variables should not be highly correlated with each other

Steps in MANOVA

State the null and alternative hypotheses
Calculate the sample covariance matrices for each group
Calculate the pooled covariance matrix
Calculate the test statistics (e.g., Wilks' lambda, Pillai's trace, Roy's largest root, Hotelling's trace)
Determine the associated F-statistics and p-values
Compare the p-values to the chosen significance level (e.g., 0.05) and make a decision to reject or fail to reject the null hypothesis

Wilks' lambda

Wilks' lambda is a test statistic used in MANOVA that measures the proportion of variance in the dependent variables that is not explained by the independent variables
Smaller values of Wilks' lambda indicate a greater difference between groups on the combination of dependent variables
Wilks' lambda is the most commonly reported test statistic in MANOVA

Pillai's trace

Pillai's trace is another test statistic used in MANOVA that measures the amount of variance in the dependent variables that is explained by the independent variables
Larger values of Pillai's trace indicate a greater difference between groups on the combination of dependent variables
Pillai's trace is considered the most robust test statistic in MANOVA, especially when assumptions are violated

Roy's largest root

Roy's largest root is a test statistic used in MANOVA that measures the maximum amount of variance in the dependent variables that can be explained by the independent variables
Larger values of Roy's largest root indicate a greater difference between groups on the combination of dependent variables
Roy's largest root is most powerful when there is a strong correlation between the dependent variables

Hotelling's trace

Hotelling's trace is a test statistic used in MANOVA that measures the amount of variance in the dependent variables that is explained by the independent variables, while taking into account the correlations between the dependent variables
Larger values of Hotelling's trace indicate a greater difference between groups on the combination of dependent variables
Hotelling's trace is a generalization of the t-squared statistic used in multivariate t-tests

Non-parametric alternatives to ANOVA

Non-parametric alternatives to ANOVA are used when the assumptions of ANOVA (e.g., normality, homogeneity of variances) are violated and cannot be corrected through data transformations
Non-parametric tests are based on ranks rather than the original values of the dependent variable
Non-parametric tests are generally less powerful than parametric tests when the assumptions are met but are more robust to violations of assumptions

Kruskal-Wallis test

The Kruskal-Wallis test is a non-parametric alternative to one-way ANOVA
The test is based on ranks and compares the medians of the dependent variable across three or more groups
The null hypothesis is that the medians of the dependent variable are equal across all groups

Friedman test

The Friedman test is a non-parametric alternative to repeated measures ANOVA
The test is based on ranks and compares the medians of the dependent variable across three or more related samples

📊Probability and Statistics Unit 9 Review

9.6 ANOVA

📊Probability and Statistics Unit 9 Review

9.6 ANOVA

Unit & Topic Study Guides

Overview of ANOVA

ANOVA vs other statistical tests

Applications of ANOVA

One-way ANOVA

Assumptions of one-way ANOVA

Steps in one-way ANOVA

F-statistic in one-way ANOVA

P-value interpretation

Limitations of one-way ANOVA

Two-way ANOVA

Assumptions of two-way ANOVA

Main effects vs interaction effects

Steps in two-way ANOVA

F-statistics for main and interaction effects

P-value interpretation

Limitations of two-way ANOVA

ANOVA post-hoc tests

Purpose of post-hoc tests

Types of post-hoc tests

Bonferroni correction

Tukey's HSD test

Dunnett's test

Scheffe's test

ANOVA vs regression

Similarities between ANOVA and regression

Differences between ANOVA and regression

Choosing between ANOVA and regression

Repeated measures ANOVA

Assumptions of repeated measures ANOVA

Benefits of repeated measures design

Sphericity in repeated measures ANOVA

Greenhouse-Geisser correction

Huynh-Feldt correction

Multivariate ANOVA (MANOVA)

Purpose of MANOVA

Assumptions of MANOVA

Steps in MANOVA

Wilks' lambda

Pillai's trace

Roy's largest root

Hotelling's trace

Non-parametric alternatives to ANOVA

Kruskal-Wallis test

Friedman test

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

📊Probability and Statistics
Unit 9 Review