📊Causal Inference Unit 5 Review

5.2 Matching methods

📊Causal Inference
Unit 5 Review

5.2 Matching methods

Written by the Fiveable Content Team • Last updated September 2025

📊Causal Inference

Unit & Topic Study Guides

5.1 Propensity score

5.2 Matching methods

5.3 Inverse probability weighting

5.4 Doubly robust estimation

Matching methods are powerful tools for estimating causal effects in observational studies. They aim to create balanced treatment and control groups by pairing similar units based on observed characteristics, mimicking randomized experiments.

These techniques rely on key assumptions like SUTVA, unconfoundedness, and positivity. Various algorithms, from greedy to optimal matching, can be used. Assessing balance and estimating treatment effects are crucial steps in the matching process.

Overview of matching methods

Matching methods are a set of techniques used in causal inference to estimate treatment effects by creating balanced treatment and control groups
These methods aim to mimic the randomization process in observational studies by matching treated and control units on observed covariates
Matching methods can be used to estimate various causal estimands, such as the average treatment effect (ATE) and the average treatment effect on the treated (ATT)

Assumptions for matching

Stable unit treatment value assumption (SUTVA)

SUTVA assumes that the potential outcomes for each unit are unaffected by the treatment assignment of other units
This assumption rules out interference between units and ensures that there are no hidden versions of the treatment
Violations of SUTVA can lead to biased estimates of treatment effects

Unconfoundedness assumption

The unconfoundedness assumption, also known as the ignorability assumption, states that treatment assignment is independent of potential outcomes given the observed covariates
This assumption implies that there are no unobserved confounders that influence both treatment assignment and outcomes
Meeting the unconfoundedness assumption is crucial for the validity of matching methods

Positivity assumption

The positivity assumption requires that each unit has a non-zero probability of receiving each level of the treatment given their observed covariates
This assumption ensures that there is sufficient overlap between the treatment and control groups in terms of their covariate distributions
Violations of the positivity assumption can lead to extrapolation and unreliable estimates of treatment effects

Matching algorithms

Greedy matching

Greedy matching is a simple and fast algorithm that matches each treated unit to the nearest control unit based on a distance metric (Mahalanobis distance)
This algorithm proceeds sequentially, making the best match for each treated unit without considering the overall balance of the matched sample
Greedy matching can be implemented with or without replacement of control units

Optimal matching

Optimal matching aims to minimize the total distance between matched pairs while ensuring that each treated unit is matched to a control unit
This algorithm uses optimization techniques, such as network flow algorithms, to find the globally optimal match
Optimal matching can be computationally intensive for large datasets but often results in better balance than greedy matching

Genetic matching

Genetic matching is an adaptive algorithm that uses a genetic search algorithm to find the optimal weight for each covariate in the distance metric
This method iteratively improves the balance of the matched sample by modifying the distance metric based on the balance achieved in previous iterations
Genetic matching can automatically select the most important covariates for achieving balance

Coarsened exact matching (CEM)

CEM involves coarsening the continuous covariates into discrete categories and then performing exact matching on the coarsened data
This method ensures that matched units have identical covariate values within each coarsened category
CEM can be useful when there are many covariates or when exact matching on the original scale is infeasible

Assessing balance after matching

Standardized mean differences

Standardized mean differences (SMDs) measure the difference in means between the treatment and control groups for each covariate, divided by the pooled standard deviation
SMDs are commonly used to assess balance, with values close to zero indicating good balance
A common rule of thumb is that SMDs should be less than 0.1 for the matched sample to be considered well-balanced

Variance ratios

Variance ratios compare the variances of each covariate between the treatment and control groups
Ideally, the variance ratios should be close to one, indicating that the spread of the covariate is similar in both groups
Large deviations from one suggest imbalance and may require further adjustment or a different matching algorithm

Kolmogorov-Smirnov tests

Kolmogorov-Smirnov (KS) tests compare the empirical cumulative distribution functions (ECDFs) of each covariate between the treatment and control groups
A significant KS test suggests that the distributions of the covariate differ between the groups, indicating imbalance
KS tests can detect differences in the shape of the distributions that may not be captured by SMDs or variance ratios

Graphical diagnostics

Visual inspection of covariate balance can be done using various plots, such as histograms, density plots, or QQ plots
These plots allow for a more intuitive assessment of balance and can help identify any remaining differences between the groups
Examples of graphical diagnostics include side-by-side boxplots, overlaid density plots, and love plots (a grid of SMDs before and after matching)

Estimating treatment effects

Average treatment effect (ATE)

The ATE is the expected difference in outcomes between the treatment and control groups across the entire population
Estimating the ATE requires that the matched sample is representative of the overall population
The ATE can be estimated by taking the difference in mean outcomes between the matched treatment and control groups

Average treatment effect on the treated (ATT)

The ATT is the expected difference in outcomes between the treatment and control groups, focusing only on the population of treated units
Estimating the ATT is often of interest when the treatment is not randomly assigned and there is selection into treatment based on observed covariates
The ATT can be estimated by taking the difference in mean outcomes between the treated units and their matched controls

Subgroup analysis

Subgroup analysis involves estimating treatment effects for specific subpopulations defined by one or more covariates (age groups)
This analysis can help identify heterogeneous treatment effects and provide insights into which subgroups benefit most from the treatment
When conducting subgroup analysis, it is important to pre-specify the subgroups of interest and adjust for multiple testing to avoid false positives

Advantages of matching methods

Intuitive approach

Matching methods mimic the randomization process by creating balanced treatment and control groups, making the causal inference more intuitive and easier to communicate
The matched sample can be thought of as a "virtual RCT," where the only remaining difference between the groups is the treatment itself
This intuitive approach can be particularly appealing to non-technical audiences and decision-makers

Transparency in matched samples

Matching methods provide a clear and transparent way to create balanced treatment and control groups
The matched sample can be easily inspected and compared to the original sample to assess the quality of the matching and the resulting balance
This transparency allows for greater scrutiny of the causal inference process and can increase confidence in the results

Flexibility in estimands

Matching methods can be used to estimate various causal estimands, such as the ATE and ATT, depending on the research question and the population of interest
The choice of estimand can be tailored to the specific context and the available data
This flexibility allows researchers to answer a wide range of causal questions using the same matching framework

Limitations of matching methods

Curse of dimensionality

As the number of covariates used for matching increases, the difficulty of finding good matches grows exponentially
This phenomenon, known as the curse of dimensionality, can lead to poor balance and reduced sample size when matching on a large number of covariates
Techniques such as coarsened exact matching (CEM) and propensity score matching can help mitigate this issue by reducing the dimensionality of the matching problem

Reduced sample size

Matching methods often result in a reduced sample size, as some units may not have suitable matches and are therefore discarded
This reduction in sample size can lead to decreased statistical power and wider confidence intervals for the estimated treatment effects
The impact of reduced sample size should be considered when interpreting the results and planning the study design

Sensitivity to matching algorithm choice

Different matching algorithms can produce different matched samples and, consequently, different estimates of treatment effects
The choice of matching algorithm, distance metric, and tuning parameters can have a substantial impact on the results
It is important to assess the sensitivity of the results to these choices and to justify the selected matching approach based on the research question and the characteristics of the data

Matching vs other methods

Matching vs regression adjustment

Regression adjustment involves controlling for observed covariates in a regression model to estimate treatment effects
Matching can be seen as a nonparametric preprocessing step that creates a balanced sample, which can then be analyzed using regression adjustment
Matching followed by regression adjustment can help ensure that the regression model is not extrapolating beyond the observed data and can improve the robustness of the estimates

Matching vs propensity score methods

Propensity score methods involve estimating the probability of treatment assignment given the observed covariates and using these probabilities for matching, stratification, or weighting
Matching can be performed directly on the propensity scores, which reduces the dimensionality of the matching problem
Propensity score methods can be more efficient than matching on the original covariates when there are many covariates, but they rely on the correct specification of the propensity score model

Matching vs inverse probability weighting

Inverse probability weighting (IPW) involves weighting each unit by the inverse of its probability of receiving the observed treatment, given the covariates
IPW can be used to estimate the ATE by creating a pseudo-population in which treatment assignment is independent of the covariates
Matching can be more robust to model misspecification than IPW, but IPW can be more efficient when the propensity score model is correctly specified and there is good overlap between the treatment and control groups

Sensitivity analysis

Rosenbaum bounds

Rosenbaum bounds are a method for assessing the sensitivity of the estimated treatment effects to unobserved confounding
This approach calculates the magnitude of hidden bias that would be necessary to explain away the observed treatment effect
A study is considered sensitive to unobserved confounding if a relatively small amount of hidden bias could alter the conclusions

Simulation-based sensitivity analysis

Simulation-based sensitivity analysis involves simulating the impact of unobserved confounders on the estimated treatment effects
This approach requires specifying the distribution and strength of the unobserved confounder and its relationship with the treatment and outcome
By varying the characteristics of the simulated confounder, researchers can assess the robustness of the results to different scenarios of unobserved confounding

Software for matching

R packages for matching

Several R packages are available for implementing matching methods, such as MatchIt, Matching, and optmatch
These packages provide a wide range of matching algorithms, diagnostic tools, and functions for estimating treatment effects
The MatchIt package is particularly popular due to its ease of use and compatibility with other causal inference packages, such as cobalt for balance assessment

Stata commands for matching

Stata offers several commands for matching, including teffects nnmatch for nearest-neighbor matching and teffects psmatch for propensity score matching
The kmatch command is a versatile tool that implements various matching algorithms, such as Mahalanobis distance matching and coarsened exact matching
Stata's tebalance command provides a range of diagnostic tools for assessing balance before and after matching

Python libraries for matching

Python has a growing ecosystem of libraries for causal inference, including matching methods
The CausalInference library provides a simple interface for propensity score matching and balance assessment
The causalml library offers a range of matching algorithms, including nearest-neighbor matching, optimal matching, and coarsened exact matching, along with tools for estimating treatment effects and conducting sensitivity analyses

📊Causal Inference Unit 5 Review

5.2 Matching methods

📊Causal Inference Unit 5 Review

5.2 Matching methods

Unit & Topic Study Guides

Overview of matching methods

Assumptions for matching

Stable unit treatment value assumption (SUTVA)

Unconfoundedness assumption

Positivity assumption

Matching algorithms

Greedy matching

Optimal matching

Genetic matching

Coarsened exact matching (CEM)

Assessing balance after matching

Standardized mean differences

Variance ratios

Kolmogorov-Smirnov tests

Graphical diagnostics

Estimating treatment effects

Average treatment effect (ATE)

Average treatment effect on the treated (ATT)

Subgroup analysis

Advantages of matching methods

Intuitive approach

Transparency in matched samples

Flexibility in estimands

Limitations of matching methods

Curse of dimensionality

Reduced sample size

Sensitivity to matching algorithm choice

Matching vs other methods

Matching vs regression adjustment

Matching vs propensity score methods

Matching vs inverse probability weighting

Sensitivity analysis

Rosenbaum bounds

Simulation-based sensitivity analysis

Software for matching

R packages for matching

Stata commands for matching

Python libraries for matching

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

📊Causal Inference
Unit 5 Review