Definition
The Variance-Covariance Matrix (or simply Covariance Matrix) summarizes the variances and pairwise covariances of a set of random variables. For a random vector , .
- How to read: “V equals the expected value of (X minus m) times (X minus m) transpose.”
- Meaning: encodes how each variable spreads (diagonal) and how pairs co-move (off-diagonal); is the mean vector.
Why It Matters
Data isn’t just a list of numbers; it has a “shape.” The covariance matrix is the mathematical lens for that shape. Without it, you cannot detect hidden relationships or eliminate noise, making high-dimensional analysis impossible.
Core Concepts
- Diagonal Elements: , the variance of the -th variable.
- How to read: “V-i-i equals sigma-i squared.”
- Meaning: Spread of variable around its own mean; always non-negative.
- Off-Diagonal Elements: , the covariance between variable and variable .
- How to read: “V-i-j equals sigma-i-j.”
- Meaning: Positive means variables tend to increase together; negative means one rises as the other falls.
- Properties:
- Symmetric: .
- How to read: “Sigma-i-j equals sigma-j-i.”
- Meaning: Covariance is symmetric; the matrix equals its transpose.
- Positive Semidefinite: for any vector . This ensures variances of linear combinations are non-negative.
- How to read: “x-transpose V x is greater than or equal to zero.”
- Meaning: No linear combination of the variables can have negative variance.
- Symmetric: .
- Independence: If variables are independent, is a diagonal matrix.