Michael Lewicki, Terry
Sejnowski
Overcomplete ICA: more sources than sensors
In an overcomplete
basis, the number of basis vectors is greater than the dimensionality
of the input. The representation of an input is not a unique combination
of basis vectors, however, overcomplete representations have greater robustness
in the presence of noise, are more sparse, and have greater flexibility
in matching structure in the data. Overcomplete codes have also been proposed
as a model of some of the response properties of neurons in primary visual
cortex. In this paper, we present a learning algorithm for inferring an
overcomplete basis by viewing it as probabilistic model of the observed
data. We show that overcomplete bases allow for better approximation of
the underlying statistical density. Redundancy in the overcomplete representation
is removed by using a Laplacian prior on the basis coefficients which leads
to representations that are sparse and are a nonlinear function of the
data. This work generalizes the technique of independent component analysis
and provides a method for the identification of a greater number of sources
than inputs. We also demonstrate the advantage of learned representations
using natural speech and show that compared to the traditional Fourier
basis the learned basis has greater coding efficiency.
Lewicki, M.S. and Sejnowski, T.J. (1997) ``Learning nonlinear overcomplete
representations for efficient coding.'' Advances in Neural and Information
Processing Systems 10. abstract,
compressed
postscript .
Michael Lewicki, Bruno
Olshausen
Efficient Coding of Natural Images
We apply a general technique for learning overcomplete bases to the problem
of finding efficient image codes. The bases learned by the algorithm are
localized, oriented, and bandpass, consistent with earlier results obtained
using related methods. We show that the learned bases are Gabor-like in
structure and that higher degrees of overcompleteness produce greater sampling
density in position, orientation, and scale. The efficient coding framework
provides a method for comparing different bases objectively by calculating
their probability given the observed data or by measuring the entropy of
the basis function coefficients. Compared to complete and overcomplete
Fourier and wavelet bases, the learned bases have much better coding efficiency.
We demonstrate the improvement in the representation of the learned bases
by showing superior performance in image denoising and filling-in of missing
pixels.
Lewicki, M.S. and Olshausen, B.A. (1998) ``Inferring sparse, overcomplete
image codes using an efficient coding framework.'' J. Opt. Soc. Am. A:
Optics, Image Science, and Vision (submitted). abstract,
compressed
postscript .