Misplaced Pages

Bruno Olshausen

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
American neuroscientist and professor
Bruno Adolphus Olshausen
Alma materStanford University (BA, MA), California Institute of Technology (PhD)
Scientific career
ThesisNeural routing circuits for forming invariant representations of visual objects (1994)

Bruno Adolphus Olshausen is an American neuroscientist and professor at the University of California, Berkeley, known for his work on computational neuroscience, vision science, and sparse coding. He currently serves as a Professor in the Helen Wills Neuroscience Institute and the UC Berkeley School of Optometry, with an affiliated appointment in Electrical Engineering and Computer Sciences. He is also the Director of the Redwood Center for Theoretical Neuroscience at UC Berkeley.

Career

Olshausen received his B.S. and M.S. degrees in Electrical Engineering from Stanford University in 1986 and 1987 respectively. He earned his Ph.D. in Computation and Neural Systems from the California Institute of Technology in 1994. After completing his doctoral studies, he held postdoctoral positions at Department of Psychology, Cornell University and Center for Biological and Computational Learning, Massachusetts Institute of Technology.

Olshausen has served in several editorial and advisory roles. In 2009, he was awarded Fellowship of Wissenschaftskolleg zu Berlin and Fellowship of Canadian Institute for Advanced Research, Neural Computation and Adaptive Perception program.

His academic appointments include:

  • Assistant Professor (1996-2001), Department of Psychology and Center for Neuroscience, University of California, Davis
  • Associate Professor (2001-2005), Department of Psychology and Center for Neuroscience, UC Davis
  • Associate Professor (2005-2010), Helen Wills Neuroscience Institute and School of Optometry, UC Berkeley
  • Professor (2010-present), Helen Wills Neuroscience Institute and School of Optometry, UC Berkeley

Research

Olshausen's research focuses on understanding the information processing strategies employed by the visual system for tasks such as object recognition and scene analysis. His approach combines studying neural response properties with mathematical modeling to develop functional theories of vision. This work aims to both advance understanding of brain function and develop new algorithms for image analysis based on biological principles. He has also contributed to technological applications, including image and signal processing, alternatives to backpropagation for unsupervised learning, memory storage and computation, analog data compression systems, etc.

Neural coding

One of Olshausen's most significant contributions is demonstrating how the principle of sparse coding can explain response properties of neurons in visual cortex. His 1996 paper in Nature with David J. Field showed how simple cells in the V1 cortex receptive field properties could emerge from learning a sparse code for natural images. This paper is based on two previous reports that gave additional technical details.

Features learned by generalized Hebbian algorithm running on 8-by-8 patches of Caltech 101.

The paper argued that simple cells have Gabor-like, localized, oriented, and bandpass receptive fields. Previous methods, such as generalized Hebbian algorithm, obtains Fourier-like receptive fields that are not localized or oriented. But with sparse coding, such receptive fields do emerge.

Specifically, consider an image I {\displaystyle I} and some receptive fields ϕ 1 , , ϕ m {\displaystyle \phi _{1},\dots ,\phi _{m}} . An image can be approximately represented as a linear sum of the receptive fields: I i a i ϕ i {\displaystyle I\approx \sum _{i}a_{i}\phi _{i}} . If so, then the image can be coded as ( a 1 , , a m ) {\displaystyle (a_{1},\dots ,a_{m})} , a code which may have better properties than directly coding for the pixel values of the image.

The algorithm proceeds as follows:

  • Initialize σ i = 1 {\displaystyle \sigma _{i}=1} for all i {\displaystyle i} , initialize λ {\displaystyle \lambda } to a good value.
  • Choose a bell-shaped function S ( x ) {\displaystyle S(x)} . Examples include S ( x ) = | x | , ln ( 1 + x 2 ) , e x 2 {\textstyle S(x)=|x|,\ln(1+x^{2}),-e^{-x^{2}}}
  • Loop
    • Sample a batch of images I {\displaystyle I} .
      • For each image I {\displaystyle I} in the batch, solve for the coefficients a ( I ) = ( a 1 ( I ) , a 2 ( I ) , ) {\displaystyle a(I)=(a_{1}(I),a_{2}(I),\dots )} that minimize the loss function L ( a ) := I i a i ϕ i 2 + λ i S ( a i / σ i ) {\displaystyle L(a):=\|I-\sum _{i}a_{i}\phi _{i}\|^{2}+\lambda \sum _{i}S(a_{i}/\sigma _{i})}
      • Define the reconstructed image I ^ ( I ) := i a i ( I ) ϕ i {\displaystyle {\hat {I}}(I):=\sum _{i}a_{i}(I)\phi _{i}} .
    • Update each feature ϕ i {\displaystyle \phi _{i}} by Hebbian learning: ϕ i ϕ i + η E [ a i ( I I ^ ) ] {\textstyle \phi _{i}\leftarrow \phi _{i}+\eta E} . Here, η {\displaystyle \eta } is the learning rate and the expectation is over all images I {\displaystyle I} in the batch.
    • Update each σ i 2 {\displaystyle \sigma _{i}^{2}} by σ i 2 E [ a i 2 ] {\textstyle \sigma _{i}^{2}\leftarrow E} . Adjust learning rate.

The key part of the algorithm is the loss function L ( a ) := I i a i ϕ i 2 + λ i S ( a i / σ i ) {\displaystyle L(a):=\|I-\sum _{i}a_{i}\phi _{i}\|^{2}+\lambda \sum _{i}S(a_{i}/\sigma _{i})} where the first term is image reconstruction loss, and the second term is the sparsity loss. Minimizing the first term leads to accurate image reconstruction, and minimizing the second term leads to sparse linear coefficients, that is, a vector ( a 1 , , a m ) {\displaystyle (a_{1},\dots ,a_{m})} with many almost-zero entries. The hyperparameter λ {\displaystyle \lambda } balances the importance of image reconstruction vs sparsity.

Based on the 1996 paper, he worked out a theory that the Gabor filters appearing in the V1 cortex performs sparse coding with overcomplete basis set, such that it is optimal for images occurring in the natural habitat of humans.

References

  1. "Bruno's cv". www.rctn.org. Retrieved 2024-11-18.
  2. "Bruno Olshausen | EECS at UC Berkeley". www2.eecs.berkeley.edu. Retrieved 2024-11-18.
  3. Olshausen, Bruno A.; Field, David J. (June 1996). "Emergence of simple-cell receptive field properties by learning a sparse code for natural images". Nature. 381 (6583): 607–609. Bibcode:1996Natur.381..607O. doi:10.1038/381607a0. ISSN 1476-4687. PMID 8637596.
  4. ^ Olshausen, B.; Field, D. (May 1996). "Natural image statistics and efficient coding". Network: Computation in Neural Systems. 7 (2): 333–339. doi:10.1088/0954-898X/7/2/014. ISSN 0954-898X. PMID 16754394.
  5. B. Olshausen, D. Field, "Sparse coding of natural images produces localized, oriented, bandpass receptive fields", Technical Report CCN-110-95, Department of Psychology, Cornell University, Ithaca, New York 14853, 1995.
  6. Olshausen, Bruno A.; Field, David J. (1997-12-01). "Sparse coding with an overcomplete basis set: A strategy employed by V1?". Vision Research. 37 (23): 3311–3325. doi:10.1016/S0042-6989(97)00169-7. ISSN 0042-6989. PMID 9425546.
  7. Simoncelli, Eero P; Olshausen, Bruno A (March 2001). "Natural Image Statistics and Neural Representation". Annual Review of Neuroscience. 24 (1): 1193–1216. doi:10.1146/annurev.neuro.24.1.1193. ISSN 0147-006X. PMID 11520932.

External links

Categories: