next up previous
Next: Maximum Likelihood estimation Up: Probabilistic ICA model Previous: Probabilistic ICA model


Uniqueness

[Rao, 1969] extends the standard factor analysis model such that the common and specific variables are independent non-degenerate random variables and examines the implication for the minimum rank of the mixing matrix $ A$ in equation 2. Earlier work [Rao, 1966] characterised the multivariate normal distribution through the non-uniqueness of its linear structure, a result which within the ICA literature has been restated as the limitation that only one Gaussian source process, at most, may contribute to the observations for the ICA model to be estimable [Comon, 1994,Hyvärinen et al., 2001]. Here, a vector variable $ x$ is said to have a linear structure if it can be decomposed as

$\displaystyle \mbox{\protect\boldmath$x$}$$\displaystyle =$$\displaystyle \mbox{\protect\boldmath$\mu$}$$\displaystyle +$$\displaystyle \mbox{\protect\boldmath$A$}$$\displaystyle \mbox{\protect\boldmath$s$}$$\displaystyle ,$ (3)

where $ s$ is a vector of non-degenerate statistically independent one dimensional random variables and $ A$ is a matrix of full column rank. The linear structure is said to be essentially unique if all the linear decompositions are equivalent in the sense that if the vector variable $ x$ allows for two structural representations

$\displaystyle \mbox{\protect\boldmath$x$}$$\displaystyle =$$\displaystyle \mbox{\protect\boldmath$\mu$}$$\displaystyle _1+$$\displaystyle \mbox{\protect\boldmath$A$}$$\displaystyle _1$$\displaystyle \mbox{\protect\boldmath$s$}$$\displaystyle _1$   and   $\displaystyle \mbox{\protect\boldmath$x$}$$\displaystyle =$$\displaystyle \mbox{\protect\boldmath$\mu$}$$\displaystyle _2+$$\displaystyle \mbox{\protect\boldmath$A$}$$\displaystyle _2$$\displaystyle \mbox{\protect\boldmath$s$}$$\displaystyle _2,$ (4)

then every column of $ A$$ _1$ is a multiple of some column of $ A$$ _2$ and vice versa, i.e. the two matrices are identical modulo scaling and permutation. This again has been noted as a standard restriction within the ICA framework [Comon, 1994].

The main result in [Rao, 1969] is a decomposition theorem that states that if $ x$ is a $ p$-variate random variable with a linear structure $ x$$ =$$ A$$ s$ where all the elements of $ s$ are non-Gaussian variables, then there does not exist a non-equivalent linear structure involving the same number or a smaller number of structural variables than that of $ s$.

Furthermore, if $ x$ is a $ p$-vector random variable with a linear structure $ x$$ =$$ A$$ s$ then $ x$ can be decomposed

$\displaystyle \mbox{\protect\boldmath$x$}$$\displaystyle =$$\displaystyle \mbox{\protect\boldmath$x$}$$\displaystyle _1+$   $\displaystyle \mbox{\protect\boldmath$x$}$$\displaystyle _2$

where $ x$$ _1$ and $ x$$ _2$ are independent, $ x$$ _1$ is non-Gaussian and has an essentially unique linear structure and $ x$$ _2$ is $ p$-variate normal with a non-unique linear structure.

The proofs involve the characteristic functions of the vector random variables $ x$ and $ s$ and as such these results are applicable only if the number of observations (i.e. voxels) is sufficiently large to accurately reflect the distribution of these quantities.

The results show, however, that conditioned on knowing the number of source signals contained in the data and under the assumption that the data are generated according to equation 2, i.e. a linear mixture of independent non-Gaussian source signals confounded by Gaussian noise, there is no non-equivalent decomposition into this number of independent non-Gaussian random variables and an associated mixing matrix; the decomposition into independent components is unique, provided we do not attempt to extract more than $ q$ source signals from the data.


next up previous
Next: Maximum Likelihood estimation Up: Probabilistic ICA model Previous: Probabilistic ICA model
Christian F. Beckmann 2003-08-05