Fiveable

๐ŸŽฒIntro to Probability Unit 7 Review

QR code for Intro to Probability practice questions

7.2 Properties of variance

๐ŸŽฒIntro to Probability
Unit 7 Review

7.2 Properties of variance

Written by the Fiveable Content Team โ€ข Last updated September 2025
Written by the Fiveable Content Team โ€ข Last updated September 2025
๐ŸŽฒIntro to Probability
Unit & Topic Study Guides

Variance and standard deviation are key measures in probability, helping us understand how data spreads out from the average. These tools are crucial for assessing risk, quality control, and making statistical inferences across various fields.

Calculating variance involves squaring deviations from the mean, while standard deviation is the square root of variance. For both discrete and continuous variables, we'll explore formulas, examples, and important properties that make these concepts fundamental in probability theory.

Variance and Standard Deviation

Definition and Basic Properties

  • Variance measures variability in a random variable quantifying how far numbers spread out from their average value
  • Denoted as Var(X) or ฯƒยฒ(X), defined as expected value of squared deviation from mean: Var(X)=E[(Xโˆ’ฮผ)2]Var(X) = E[(X - ฮผ)ยฒ]
  • Standard deviation equals square root of variance denoted as ฯƒ(X) or SD(X) expressed in same units as original data
  • Variance remains non-negative equaling zero only when random variable is constant (no variability)
  • Demonstrates non-linearity Var(aX)=a2Var(X)Var(aX) = aยฒVar(X) for any constant a
  • For independent random variables X and Y, variance of sum equals sum of individual variances: Var(X+Y)=Var(X)+Var(Y)Var(X + Y) = Var(X) + Var(Y)
  • Standard deviation property for constant a and random variable X: SD(aX)=โˆฃaโˆฃSD(X)SD(aX) = |a|SD(X)

Interpretation and Significance

  • Provides measure of spread or dispersion in dataset
  • Larger variance indicates greater variability in data points (stock prices)
  • Smaller variance suggests data points cluster closely around mean (consistent product quality)
  • Used in risk assessment, quality control, and statistical inference
  • Plays crucial role in hypothesis testing and confidence interval construction
  • Helps in comparing datasets with different units or scales
  • Utilized in various fields (finance, engineering, social sciences) to quantify uncertainty and variability

Calculating Variance and Standard Deviation

Discrete Random Variables

  • For discrete random variable X with probability mass function p(x), variance calculated using formula: Var(X)=ฮฃ(xโˆ’ฮผ)2p(x)Var(X) = ฮฃ(x - ฮผ)ยฒp(x)
  • Expected value (mean) calculated first: ฮผ=E(X)=ฮฃxp(x)ฮผ = E(X) = ฮฃxp(x)
  • Standard deviation obtained by taking square root: SD(X)=โˆšVar(X)SD(X) = โˆšVar(X)
  • Calculation involves summing over all possible values of X in sample space
  • Alternative formula for variance: Var(X)=E(X2)โˆ’[E(X)]2Var(X) = E(Xยฒ) - [E(X)]ยฒ where E(X2)=ฮฃx2p(x)E(Xยฒ) = ฮฃxยฒp(x)
  • Be aware of built-in functions for calculating variance and standard deviation in statistical software
  • Apply formulas to common discrete distributions (Binomial, Poisson, Geometric)

Calculation Examples

  • For a fair six-sided die, calculate variance:
    • p(x) = 1/6 for x = 1, 2, 3, 4, 5, 6
    • ฮผ = E(X) = (1+2+3+4+5+6)/6 = 3.5
    • Var(X) = ฮฃ(x - 3.5)ยฒ(1/6) = 2.917
  • For a biased coin with P(Heads) = 0.6, P(Tails) = 0.4:
    • X = 1 for Heads, X = 0 for Tails
    • ฮผ = E(X) = 1(0.6) + 0(0.4) = 0.6
    • Var(X) = (1 - 0.6)ยฒ(0.6) + (0 - 0.6)ยฒ(0.4) = 0.24

Variance and Standard Deviation for Continuous Variables

Computation Methods

  • For continuous random variable X with probability density function f(x), variance calculated using integral: Var(X)=โˆซ(xโˆ’ฮผ)2f(x)dxVar(X) = โˆซ(x - ฮผ)ยฒf(x)dx
  • Expected value (mean) calculated first: ฮผ=E(X)=โˆซxf(x)dxฮผ = E(X) = โˆซxf(x)dx
  • Standard deviation obtained by taking square root: SD(X)=โˆšVar(X)SD(X) = โˆšVar(X)
  • Calculation involves integrating over entire support of probability density function
  • Alternative formula for variance: Var(X)=E(X2)โˆ’[E(X)]2Var(X) = E(Xยฒ) - [E(X)]ยฒ where E(X2)=โˆซx2f(x)dxE(Xยฒ) = โˆซxยฒf(x)dx
  • Familiarize with specific variance formulas for common continuous distributions (Normal, Exponential, Uniform)
  • Understand integration techniques or statistical software use for complex continuous distributions

Application to Specific Distributions

  • For Uniform distribution U(a,b):
    • Variance formula: Var(X)=(bโˆ’a)2/12Var(X) = (b-a)ยฒ/12
    • Example: U(0,1) has variance (1-0)ยฒ/12 = 1/12 โ‰ˆ 0.0833
  • For Exponential distribution with rate parameter ฮป:
    • Variance formula: Var(X)=1/ฮป2Var(X) = 1/ฮปยฒ
    • Example: Exponential(0.5) has variance 1/(0.5)ยฒ = 4
  • For Normal distribution N(ฮผ,ฯƒยฒ):
    • Variance directly given as ฯƒยฒ
    • Example: N(0,1) (standard normal) has variance 1

Variance Properties for Independent Variables

Additive Properties

  • Variance of sum of independent random variables equals sum of individual variances: Var(X+Y)=Var(X)+Var(Y)Var(X + Y) = Var(X) + Var(Y)
  • Variance of difference of independent random variables also equals sum of variances: Var(Xโˆ’Y)=Var(X)+Var(Y)Var(X - Y) = Var(X) + Var(Y)
  • For linear combinations of independent random variables: Var(aX+bY)=a2Var(X)+b2Var(Y)Var(aX + bY) = aยฒVar(X) + bยฒVar(Y) where a and b are constants
  • Standard deviation of sum or difference of independent variables: SD(XยฑY)=โˆš(Var(X)+Var(Y))SD(X ยฑ Y) = โˆš(Var(X) + Var(Y))
  • Properties extend to more than two variables: Var(X+Y+Z)=Var(X)+Var(Y)+Var(Z)Var(X + Y + Z) = Var(X) + Var(Y) + Var(Z) for independent X, Y, and Z

Applications and Limitations

  • Apply properties to solve problems in portfolio analysis, error propagation, or experimental design with multiple independent variables
  • Recognize situations where random variables are not independent requiring additional considerations
  • Used in financial risk assessment (portfolio diversification)
  • Applied in measurement error analysis (combining multiple independent sources of error)
  • Crucial in experimental design (determining overall variability in multi-factor experiments)
  • Limitations arise when variables exhibit correlation or dependence
  • Caution needed when applying to non-linear combinations of random variables