Recurrent neural networks can explain flexible trading of speed and accuracy in biological vision

Autor:	Ian Charest, Tim C. Kietzmann, Courtney J. Spoerer, Johannes Mehrer, Nikolaus Kriegeskorte
Přispěvatelé:	Spoerer, Courtney J [0000-0003-3867-4900], Kietzmann, Tim C [0000-0001-8076-6062], Charest, Ian [0000-0002-3939-3003], Kriegeskorte, Nikolaus [0000-0001-7433-9005], Apollo - University of Cambridge Repository
Rok vydání:	2020
Předmět:	0301 basic medicine Male Physiology Visual System Computer science Entropy Sensory Physiology Social Sciences Infographics Convolutional neural network Feedforward Neural Networks Cognition Learning and Memory Mathematical and Statistical Techniques 0302 clinical medicine Mental chronometry Psychology Biology (General) Recurrent Neural Networks Data Management Principal Component Analysis 0303 health sciences Ecology Artificial neural network Physics Statistics Cognitive neuroscience of visual object recognition Cognitive artificial intelligence Sensory Systems Computational Theory and Mathematics Modeling and Simulation Physical Sciences Visual Perception Thermodynamics Feedforward neural network Female Graphs Research Article Adult Computer and Information Sciences Neural Networks QH301-705.5 Cognitive Neuroscience Models Neurological Research and Analysis Methods Cellular and Molecular Neuroscience 03 medical and health sciences Young Adult Memory Genetics Reaction Time Humans Statistical Methods Molecular Biology Ecology Evolution Behavior and Systematics Vision Ocular Signal-flow graph 030304 developmental biology Computational neuroscience business.industry Data Visualization Cognitive Psychology Feed forward Biology and Life Sciences Computational Biology Pattern recognition 030104 developmental biology Recurrent neural network Multivariate Analysis Cognitive Science Perception Artificial intelligence Neural Networks Computer Visual Object Recognition business Mathematics 030217 neurology & neurosurgery Energy (signal processing) Neuroscience
Zdroj:	Plos Computational Biology, 16, 10 PLoS Computational Biology PLoS Computational Biology, Vol 16, Iss 10, p e1008215 (2020) Plos Computational Biology, 16
ISSN:	1553-7358
Popis:	Deep feedforward neural network models of vision dominate in both computational neuroscience and engineering. The primate visual system, by contrast, contains abundant recurrent connections. Recurrent signal flow enables recycling of limited computational resources over time, and so might boost the performance of a physically finite brain or model. Here we show: (1) Recurrent convolutional neural network models outperform feedforward convolutional models matched in their number of parameters in large-scale visual recognition tasks on natural images. (2) Setting a confidence threshold, at which recurrent computations terminate and a decision is made, enables flexible trading of speed for accuracy. At a given confidence threshold, the model expends more time and energy on images that are harder to recognise, without requiring additional parameters for deeper computations. (3) The recurrent model’s reaction time for an image predicts the human reaction time for the same image better than several parameter-matched and state-of-the-art feedforward models. (4) Across confidence thresholds, the recurrent model emulates the behaviour of feedforward control models in that it achieves the same accuracy at approximately the same computational cost (mean number of floating-point operations). However, the recurrent model can be run longer (higher confidence threshold) and then outperforms parameter-matched feedforward comparison models. These results suggest that recurrent connectivity, a hallmark of biological visual systems, may be essential for understanding the accuracy, flexibility, and dynamics of human visual recognition. Author summary Deep neural networks provide the best current models of biological vision and achieve the highest performance in computer vision. Inspired by the primate brain, these models transform the image signals through a sequence of stages, leading to recognition. Unlike brains in which outputs of a given computation are fed back into the same computation, these models do not process signals recurrently. The ability to recycle limited neural resources by processing information recurrently could explain the accuracy and flexibility of biological visual systems, which computer vision systems cannot yet match. Here we report that recurrent processing can improve recognition performance compared to similarly complex feedforward networks. Recurrent processing also enabled models to behave more flexibly and trade off speed for accuracy. Like humans, the recurrent network models can compute longer when an object is hard to recognise, which boosts their accuracy. The model’s recognition times predicted human recognition times for the same images. The performance and flexibility of recurrent neural network models illustrates that modeling biological vision can help us improve computer vision.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c227145782f7d4b20d04d6b4a6414d75 https://hdl.handle.net/2066/225297 Zobrazit plný text záznamu