Andromeda
Note

Variance Covariance Matrix

Definition

The Variance-Covariance Matrix (or simply Covariance Matrix) VV summarizes the variances and pairwise covariances of a set of random variables. For a random vector XX, V=E[(Xm)(Xm)T]V = E[(X-m)(X-m)^T].

  • How to read: “V equals the expected value of (X minus m) times (X minus m) transpose.”
  • Meaning: VV encodes how each variable spreads (diagonal) and how pairs co-move (off-diagonal); mm is the mean vector.

Why It Matters

Data isn’t just a list of numbers; it has a “shape.” The covariance matrix is the mathematical lens for that shape. Without it, you cannot detect hidden relationships or eliminate noise, making high-dimensional analysis impossible.

Core Concepts

  • Diagonal Elements: Vii=σi2V_{ii} = \sigma_i^2, the variance of the ii-th variable.
    • How to read: “V-i-i equals sigma-i squared.”
    • Meaning: Spread of variable ii around its own mean; always non-negative.
  • Off-Diagonal Elements: Vij=σijV_{ij} = \sigma_{ij}, the covariance between variable ii and variable jj.
    • How to read: “V-i-j equals sigma-i-j.”
    • Meaning: Positive means variables tend to increase together; negative means one rises as the other falls.
  • Properties:
    • Symmetric: σij=σji\sigma_{ij} = \sigma_{ji}.
      • How to read: “Sigma-i-j equals sigma-j-i.”
      • Meaning: Covariance is symmetric; the matrix equals its transpose.
    • Positive Semidefinite: xTVx0x^T V x \geq 0 for any vector xx. This ensures variances of linear combinations are non-negative.
      • How to read: “x-transpose V x is greater than or equal to zero.”
      • Meaning: No linear combination of the variables can have negative variance.
  • Independence: If variables are independent, VV is a diagonal matrix.

Connected Concepts