Fiveable

๐Ÿ›Biostatistics Unit 11 Review

QR code for Biostatistics practice questions

11.3 Spearman's rank correlation and Kendall's tau

๐Ÿ›Biostatistics
Unit 11 Review

11.3 Spearman's rank correlation and Kendall's tau

Written by the Fiveable Content Team โ€ข Last updated September 2025
Written by the Fiveable Content Team โ€ข Last updated September 2025
๐Ÿ›Biostatistics
Unit & Topic Study Guides

Spearman's rank correlation and Kendall's tau are essential tools for analyzing relationships in biological data. These methods work well with non-normal data and outliers, making them perfect for gene expression studies or disease severity ratings.

Both measure monotonic relationships, ranging from -1 to +1. Spearman's uses rank differences, while Kendall's focuses on concordant and discordant pairs. Choose based on your data's characteristics and research goals for the best results.

Nonparametric Association Measures

Principles and Assumptions

  • Assess the monotonic relationship between two variables, meaning as one variable increases or decreases, the other variable also increases or decreases, but not necessarily in a linear fashion
  • Suitable for non-normally distributed data or data with outliers (gene expression data) as they do not require the assumption of normality
  • The null hypothesis states that there is no monotonic relationship between the two variables, while the alternative hypothesis suggests the presence of a monotonic relationship
  • Commonly used when the data violate the assumptions of parametric tests or when the variables are measured on an ordinal scale (disease severity ratings)

Spearman's Rank Correlation

Calculation and Interpretation

  • Measures the strength and direction of the monotonic relationship between two variables, denoted as ฯ or rs
  • Rank the data for each variable separately, assigning the smallest value a rank of 1, the second smallest a rank of 2, and so on. Tied values receive the average of the ranks they would have received if they were not tied
  • Calculate using the formula: ฯ = 1 - (6 * ฮฃdยฒ) / (n * (nยฒ - 1)), where d is the difference between the ranks of corresponding values, and n is the number of pairs of values
  • Ranges from -1 to +1, with -1 indicating a perfect negative monotonic relationship, +1 indicating a perfect positive monotonic relationship, and 0 indicating no monotonic relationship

Significance Testing

  • Assess the statistical significance of Spearman's rank correlation using a hypothesis test or by constructing a confidence interval
  • The null hypothesis states that there is no monotonic relationship between the two variables, while the alternative hypothesis suggests the presence of a monotonic relationship
  • A p-value less than the chosen significance level (usually 0.05) indicates a statistically significant monotonic relationship between the variables
  • Confidence intervals provide a range of plausible values for the true population correlation coefficient

Kendall's Tau

Calculation and Interpretation

  • Another nonparametric measure of the strength and direction of the monotonic relationship between two variables, denoted as ฯ„
  • Compare each pair of observations and determine whether they are concordant (both values in one observation are higher or lower than the corresponding values in the other observation) or discordant (one value is higher, and the other is lower)
  • Calculate using the formula: ฯ„ = (C - D) / (n (n - 1) / 2), where C is the number of concordant pairs, D is the number of discordant pairs, and n is the number of pairs of values
  • Ranges from -1 to +1, with -1 indicating a perfect negative monotonic relationship, +1 indicating a perfect positive monotonic relationship, and 0 indicating no monotonic relationship
  • Represents the difference between the probability of observing concordant pairs and the probability of observing discordant pairs, providing a more straightforward interpretation than Spearman's rank correlation

Significance Testing

  • Assess the statistical significance of Kendall's tau using a hypothesis test or by constructing a confidence interval
  • The null hypothesis states that there is no monotonic relationship between the two variables, while the alternative hypothesis suggests the presence of a monotonic relationship
  • A p-value less than the chosen significance level (usually 0.05) indicates a statistically significant monotonic relationship between the variables
  • Confidence intervals provide a range of plausible values for the true population correlation coefficient

Spearman's vs Kendall's Tau

Similarities

  • Both are nonparametric measures of the monotonic relationship between two variables, suitable for non-normally distributed data or data with outliers
  • Both range from -1 to +1, with -1 indicating a perfect negative monotonic relationship, +1 indicating a perfect positive monotonic relationship, and 0 indicating no monotonic relationship
  • Both can be used to assess the statistical significance of the monotonic relationship using hypothesis tests or confidence intervals

Differences

  • Spearman's rank correlation is based on the differences between the ranks of the values, while Kendall's tau is based on the concordance and discordance of pairs of observations
  • Kendall's tau has a more intuitive interpretation than Spearman's rank correlation, representing the difference between the probability of observing concordant pairs and the probability of observing discordant pairs
  • Spearman's rank correlation is more sensitive to outliers than Kendall's tau, as it considers the magnitude of the differences between ranks
  • Kendall's tau is generally more robust and efficient than Spearman's rank correlation, especially for small sample sizes or in the presence of tied values (repeated measurements)

Choosing Between Spearman's and Kendall's Tau

  • The choice between Spearman's rank correlation and Kendall's tau depends on the specific characteristics of the data, the research question, and the desired interpretation of the results
  • When the sample size is large, and there are few tied values, Spearman's rank correlation and Kendall's tau tend to produce similar results
  • In the context of biological data analysis, consider the nature of the variables, the presence of outliers, and the desired robustness and efficiency of the measure when selecting between Spearman's rank correlation and Kendall's tau