Fiveable

๐Ÿ›Biostatistics Unit 10 Review

QR code for Biostatistics practice questions

10.2 Kaplan-Meier estimation and log-rank test

๐Ÿ›Biostatistics
Unit 10 Review

10.2 Kaplan-Meier estimation and log-rank test

Written by the Fiveable Content Team โ€ข Last updated September 2025
Written by the Fiveable Content Team โ€ข Last updated September 2025
๐Ÿ›Biostatistics
Unit & Topic Study Guides

Kaplan-Meier estimation is a key method in survival analysis, helping researchers understand how long subjects survive in a study. It handles censored data, where some subjects don't experience the event of interest during the study period.

The log-rank test compares survival curves between groups, determining if differences are statistically significant. This test is crucial for assessing treatment effects or comparing survival rates across different populations in biological studies.

Kaplan-Meier Survival Analysis

Estimating Survival Probabilities

  • The Kaplan-Meier method is a non-parametric statistical technique used to estimate survival probabilities over time in the presence of censored data
    • Censoring occurs when the exact survival time of an individual is unknown
      • Right-censoring: individual has not experienced the event of interest by the end of the study
      • Left-censoring: individual was lost to follow-up during the study period
  • The Kaplan-Meier estimator calculates the probability of survival at each distinct event time
    • Considers the number of individuals at risk and the number of events that occurred at that time
    • Survival probability at a given time point is the product of the conditional probabilities of surviving each preceding time interval
  • The Kaplan-Meier estimator assumes that censoring is non-informative
    • Censoring mechanism is independent of the survival time

Interpreting Survival Probabilities

  • The survival probability estimates the likelihood of an individual surviving beyond a specific time point
    • For example, a survival probability of 0.8 at 12 months indicates an 80% chance of surviving beyond 12 months
  • Survival probabilities can be used to estimate the median survival time
    • The median survival time is the time point at which the estimated survival probability reaches 0.5
    • Represents the time by which 50% of the individuals are expected to have experienced the event of interest
  • Confidence intervals can be calculated for the survival probability estimates
    • Provide a measure of the uncertainty associated with the estimates
    • Wider confidence intervals indicate greater uncertainty in the survival probability estimates

Survival Curves for Groups

Constructing Survival Curves

  • Kaplan-Meier survival curves are graphical representations of the estimated survival probabilities over time for one or more groups
    • Y-axis represents the estimated survival probability
    • X-axis represents the time since the start of the study or a specific event
  • Each step in the survival curve corresponds to an event time
    • Vertical drop in the curve represents the change in the estimated survival probability at that time
  • Censored observations are typically marked on the survival curve using symbols
    • Tick marks or circles distinguish censored observations from actual event times

Comparing Survival Curves

  • When comparing survival curves for different groups, a larger separation between the curves indicates a more substantial difference in survival probabilities between the groups
    • For example, if the survival curve for treatment group A is consistently higher than the survival curve for treatment group B, it suggests that individuals in group A have a higher probability of survival over time
  • The median survival time for each group can be determined from the survival curve
    • Identify the time point at which the estimated survival probability reaches 0.5
    • Allows for a direct comparison of the median survival times between groups
  • Statistical tests, such as the log-rank test, can be used to formally compare the survival distributions between groups
    • Assess whether the observed differences in survival curves are statistically significant

Log-rank Test for Comparisons

Hypothesis Testing

  • The log-rank test is a non-parametric hypothesis test used to compare the survival distributions of two or more groups
    • Null hypothesis: there is no difference in the survival distributions between the groups being compared
    • Alternative hypothesis: there is a difference in the survival distributions between the groups
  • The test statistic for the log-rank test is calculated based on the observed and expected number of events in each group at each distinct event time
    • Expected number of events is determined using the pooled estimate of the hazard rate, assuming that the null hypothesis is true
  • The log-rank test follows a chi-square distribution with degrees of freedom equal to the number of groups minus one
    • A significant log-rank test result indicates a statistically significant difference in the survival distributions between the groups

Interpreting Log-rank Test Results

  • P-value: probability of observing the test statistic or a more extreme value under the null hypothesis
    • A small p-value (typically < 0.05) suggests strong evidence against the null hypothesis and in favor of the alternative hypothesis
    • For example, if the p-value is 0.01, there is a 1% chance of observing the difference in survival distributions between the groups if the null hypothesis were true
  • Hazard ratio: ratio of the hazard rates between two groups
    • Hazard rate is the instantaneous risk of experiencing the event of interest at a given time point
    • A hazard ratio of 1 indicates no difference in hazard rates between the groups
    • A hazard ratio > 1 indicates a higher hazard rate in one group compared to the other
    • A hazard ratio < 1 indicates a lower hazard rate in one group compared to the other

Assumptions of Kaplan-Meier and Log-rank Test

Non-informative Censoring

  • The Kaplan-Meier method assumes that censoring is non-informative
    • Censoring mechanism is independent of the survival time
    • Survival probabilities of censored individuals are similar to those of individuals who remain under observation
  • Violation of the non-informative censoring assumption can lead to biased estimates of survival probabilities
    • For example, if individuals with poor prognosis are more likely to be censored, the survival probabilities may be overestimated

Proportional Hazards Assumption

  • The log-rank test assumes that the hazard ratio between the groups remains constant over time (proportional hazards assumption)
    • The relative risk of experiencing the event of interest remains constant throughout the study period
  • Violation of the proportional hazards assumption can lead to incorrect conclusions about the difference in survival distributions
    • For example, if the hazard ratio between the groups changes over time, the log-rank test may not accurately capture the overall difference in survival distributions

Limitations and Considerations

  • The Kaplan-Meier estimator may be unstable when the number of individuals at risk becomes small, particularly at later time points
    • Confidence intervals for survival probabilities may be wide when the number of individuals at risk is low
  • The log-rank test may have limited power to detect differences in survival distributions when the sample size is small or when the difference between the groups is not constant over time
    • Alternative tests, such as the Wilcoxon test or the Peto-Peto test, may be more appropriate in these situations
  • Both the Kaplan-Meier method and the log-rank test do not account for the potential confounding effects of other variables on the survival outcomes
    • Adjusting for confounders may require the use of more advanced survival analysis techniques, such as Cox proportional hazards regression
    • Stratification or multivariate modeling can be used to control for confounding variables and assess their impact on survival outcomes