next up previous
Next: Karhunen-Loeve Transform (KLT) Up: klt Previous: Multivariate Random Signals

Covariance and Correlation

Let $x_i$ and $x_j$ be two random variables in a random vector ${\bf x}=[x_1,\cdots,x_N]^T$. The mean and variance of a variable $x_i$ and the covariance and correlation coefficient (normalized correlation) between two variables $x_i$ and $x_j$ are defined below:

Note that the correlation coefficient $r_{ij}$ can be considered as the normalized version of the covariance $\sigma^2_{ij}$.

To obtain these parameters as expectations of the first and second order functions of the random variables, the joint probability density function $p(x_1,\cdots,x_N)$ is required. However, when $p(x_1,\cdots,x_N)$ is not available, the parameters can still be estimated by averaging the outcomes of a random experiment involving these variables repeated $K$ times:

\begin{displaymath}\hat{\mu}_i=\frac{1}{K}\sum_{k=1}^K x_i^{(k)} \end{displaymath}


\begin{displaymath}\hat{\sigma^2_i}=\frac{1}{K}\sum_{k=1}^K \vert x_i^{(k)}-\hat...
...{K}\sum_{k=1}^K \vert x_i^{(k)}\vert^2-\vert\hat{\mu}_i\vert^2
\end{displaymath}


\begin{displaymath}\hat{\sigma^2_{ij}}=\frac{1}{K}\sum_{k=1}^K (x_i^{(k)}-\hat{\...
...{K}\sum_{k=1}^K x_j^{(k)}x^*_j^{(k)}-\hat{\mu}_i \hat{\mu}_j^*
\end{displaymath}


\begin{displaymath}
\hat{r}_{ij}=\frac{\hat{\sigma^2_{ij}}}{\sqrt{ \hat{\sigma_i...
...=\frac{\hat{\sigma^2_{ij}}}{\hat{\sigma_i}\; \hat{\sigma_j} }
\end{displaymath}

To understand intuitively the meaning of these parameters, we consider the following examples under various situations.

Examples:

Now we see that the covariance $\sigma_{ij}^2$ represents how much the two variables $x_i$ and $x_j$ are positively correlated if $\sigma_{ij}^2>0$, negatively correlated if $\sigma_{ij}^2<0$, or not correlated at all if $\sigma_{ij}^2=0$.

More generally, if a random vector ${\bf x}=[x_1,\cdots,x_N]^T$ is composed of $N$ samples of a random signal $x(t)$, then we can predict that the elements near the main diagonal of the covariance matrix ${\bf\Sigma}_x$ have higher values than those farther away from the diagonal, due to the fact that signal samples close to each other tend to be more correlated than those that are far away.


next up previous
Next: Karhunen-Loeve Transform (KLT) Up: klt Previous: Multivariate Random Signals
Ruye Wang 2013-04-23