Learning Midlevel Auditory Codes from Natural Sound Statistics
Autor: | Wiktor Mlynarski, Joshua H. McDermott |
---|---|
Rok vydání: | 2018 |
Předmět: |
0301 basic medicine
Auditory Pathways Sound Spectrography Cognitive Neuroscience Speech recognition Models Neurological Sensory system Auditory cortex 03 medical and health sciences 0302 clinical medicine Arts and Humanities (miscellaneous) Statistics Animals Learning Natural sounds Auditory Cortex Neurons Models Statistical Artificial neural network Generative model 030104 developmental biology Kernel (image processing) Pattern Recognition Physiological Pattern recognition (psychology) Auditory Perception Cats Spectrogram Neural Networks Computer Noise Psychology 030217 neurology & neurosurgery |
Zdroj: | Neural Computation. 30:631-669 |
ISSN: | 1530-888X 0899-7667 |
DOI: | 10.1162/neco_a_01048 |
Popis: | Interaction with the world requires an organism to transform sensory signals into representations in which behaviorally meaningful properties of the environment are made explicit. These representations are derived through cascades of neuronal processing stages in which neurons at each stage recode the output of preceding stages. Explanations of sensory coding may thus involve understanding how low-level patterns are combined into more complex structures. To gain insight into such midlevel representations for sound, we designed a hierarchical generative model of natural sounds that learns combinations of spectrotemporal features from natural stimulus statistics. In the first layer, the model forms a sparse convolutional code of spectrograms using a dictionary of learned spectrotemporal kernels. To generalize from specific kernel activation patterns, the second layer encodes patterns of time-varying magnitude of multiple first-layer coefficients. When trained on corpora of speech and environmental sounds, some second-layer units learned to group similar spectrotemporal features. Others instantiate opponency between distinct sets of features. Such groupings might be instantiated by neurons in the auditory cortex, providing a hypothesis for midlevel neuronal computation. |
Databáze: | OpenAIRE |
Externí odkaz: | |
Nepřihlášeným uživatelům se plný text nezobrazuje | K zobrazení výsledku je třeba se přihlásit. |