🐛Biostatistics Unit 10 Review

10.1 Survival function and hazard rate concepts

🐛Biostatistics
Unit 10 Review

10.1 Survival function and hazard rate concepts

Written by the Fiveable Content Team • Last updated September 2025

🐛Biostatistics

Unit & Topic Study Guides

10.1 Survival function and hazard rate concepts

10.2 Kaplan-Meier estimation and log-rank test

10.3 Cox proportional hazards model

Survival function and hazard rate are key concepts in survival analysis. They help us understand how long individuals survive and their risk of experiencing an event over time. These tools are crucial for analyzing data in medical research, epidemiology, and other biological studies.

By examining survival functions and hazard rates, researchers can compare different groups, assess treatment effectiveness, and identify risk factors. These concepts form the foundation for more advanced survival analysis techniques, enabling scientists to draw meaningful conclusions from time-to-event data in various biological contexts.

Survival function and its properties

Definition and notation

The survival function, denoted as $S(t)$, represents the probability that an individual survives beyond a specific time point $t$ in a given study
$S(0) = 1$ indicates that all individuals are alive at the beginning of the study
$S(∞) = 0$ indicates that no individual survives indefinitely

Estimation methods

The survival function can be estimated using various methods, such as:
- Kaplan-Meier estimator
- Life-table method
The choice of estimation method depends on the nature of the data and the assumptions made

Median survival time

The median survival time is the time point at which the survival function equals 0.5
This indicates that half of the individuals in the study have experienced the event of interest

Properties

The survival function is a non-increasing function of time
It starts at 1 (all individuals alive) and decreases towards 0 (no individual survives indefinitely)
The rate of decrease in the survival function reflects the hazard rate (instantaneous risk) at each time point

Hazard rate and its relationship to survival function

Definition and notation

The hazard rate, also known as the hazard function or instantaneous failure rate, is the instantaneous probability of experiencing an event at a specific time point $t$, given that the individual has survived up to that time point
The hazard rate is denoted as $h(t)$

Relationship to survival function

The hazard rate is related to the survival function through the equation: $h(t) = -\frac{d}{dt} [\log(S(t))]$
This relationship allows for the estimation of the hazard rate from the survival function and vice versa

Types of hazard rates

The hazard rate can be:
- Increasing over time (e.g., aging processes)
- Decreasing over time (e.g., infant mortality)
- Constant over time (e.g., random events)
The shape of the hazard rate reflects the underlying biological processes and the nature of the event being studied

Cumulative hazard function

The cumulative hazard function, denoted as $H(t)$, is the integral of the hazard rate over time
It represents the accumulated risk of experiencing the event up to time $t$
$H(t)$ is related to the survival function through the equation: $S(t) = \exp(-H(t))$

Survival curves and their implications

Graphical representation

Survival curves are graphical representations of the survival function, plotting the estimated probability of survival against time
The y-axis represents the survival probability, and the x-axis represents time

Interpreting survival curves

Steep drops in the survival curve indicate time periods with a higher incidence of the event of interest (e.g., high mortality after surgery)
Plateaus in the survival curve suggest periods of lower risk (e.g., stable survival after successful treatment)

Comparing survival curves

Comparing survival curves between different groups (e.g., treatment vs. control) allows researchers to assess the effectiveness of interventions or the impact of risk factors on the event of interest
The log-rank test is commonly used to compare survival curves between groups, testing the null hypothesis that there is no difference in survival between the groups

Confidence intervals

Confidence intervals can be constructed around the survival curves to quantify the uncertainty in the estimates
These intervals facilitate comparisons between groups and help assess the precision of the survival estimates

Applications in biological research

Survival curves are widely used in cancer research to compare treatment options and assess prognostic factors
In epidemiological studies, survival curves help identify risk factors and evaluate public health interventions
Survival analysis is also applied in reliability engineering and failure time analysis of biological systems (e.g., mechanical heart valves)

Censoring mechanisms and their impact on survival analysis

Types of censoring

Right-censoring: occurs when an individual is still alive or has not experienced the event of interest by the end of the study
- Most common type of censoring in survival analysis
- Examples: study ends before all participants experience the event, or participants are lost to follow-up
Left-truncation: occurs when individuals are not included in the study until they have already been at risk for the event of interest for some time
- Example: studying time to disease recurrence, but only including patients who have survived an initial treatment

Informative vs. non-informative censoring

Censoring mechanisms can be informative or non-informative, depending on whether the censoring is related to the event of interest or not
Informative censoring: the reason for censoring is related to the event of interest (e.g., patients with more severe conditions are more likely to be lost to follow-up)
Non-informative censoring: the reason for censoring is unrelated to the event of interest (e.g., study ends at a predetermined time point)

Accounting for censoring in survival analysis

Survival analysis methods, such as the Kaplan-Meier estimator and Cox proportional hazards model, account for censoring by incorporating information from both censored and uncensored individuals in the estimation process
These methods use the available information from censored individuals up to the time of censoring and then remove them from the risk set

Impact of ignoring or mishandling censoring

Ignoring censoring or incorrectly handling censored observations can lead to biased estimates of the survival function and hazard rate
Biased estimates may affect the validity of the study conclusions and lead to incorrect interpretations of the data
Example: ignoring right-censoring may overestimate the survival probabilities, as the analysis would not account for the fact that some individuals may experience the event after the end of the study

🐛Biostatistics Unit 10 Review

10.1 Survival function and hazard rate concepts

🐛Biostatistics Unit 10 Review

10.1 Survival function and hazard rate concepts

Unit & Topic Study Guides

Survival function and its properties

Definition and notation

Estimation methods

Median survival time

Properties

Hazard rate and its relationship to survival function

Definition and notation

Relationship to survival function

Types of hazard rates

Cumulative hazard function

Survival curves and their implications

Graphical representation

Interpreting survival curves

Comparing survival curves

Confidence intervals

Applications in biological research

Censoring mechanisms and their impact on survival analysis

Types of censoring

Informative vs. non-informative censoring

Accounting for censoring in survival analysis

Impact of ignoring or mishandling censoring

history

social science

english & capstone

arts

science

math & computer science

world languages

high school exams

honors classes

college classes

🐛Biostatistics
Unit 10 Review