Teaching recurrent neural networks to infer global temporal structure from local examples
Autor: | George J. Pappas, Danielle S. Bassett, Erfan Nozari, Zhixin Lu, Jason Z. Kim |
---|---|
Rok vydání: | 2021 |
Předmět: |
0301 basic medicine
Period-doubling bifurcation Structure (mathematical logic) Computer Networks and Communications Process (engineering) Computer science business.industry Computer Science::Neural and Evolutionary Computation Chaotic Lorenz system Human-Computer Interaction 03 medical and health sciences Complex dynamics 030104 developmental biology 0302 clinical medicine Recurrent neural network Artificial Intelligence Computer Vision and Pattern Recognition Artificial intelligence Representation (mathematics) business 030217 neurology & neurosurgery Software |
Zdroj: | Nature Machine Intelligence. 3:316-323 |
ISSN: | 2522-5839 |
Popis: | The ability to store and manipulate information is a hallmark of computational systems. Whereas computers are carefully engineered to represent and perform mathematical operations on structured data, neurobiological systems adapt to perform analogous functions without needing to be explicitly engineered. Recent efforts have made progress in modelling the representation and recall of information in neural systems. However, precisely how neural systems learn to modify these representations remains far from understood. Here, we demonstrate that a recurrent neural network (RNN) can learn to modify its representation of complex information using only examples, and we explain the associated learning mechanism with new theory. Specifically, we drive an RNN with examples of translated, linearly transformed or pre-bifurcated time series from a chaotic Lorenz system, alongside an additional control signal that changes value for each example. By training the network to replicate the Lorenz inputs, it learns to autonomously evolve about a Lorenz-shaped manifold. Additionally, it learns to continuously interpolate and extrapolate the translation, transformation and bifurcation of this representation far beyond the training data by changing the control signal. Furthermore, we demonstrate that RNNs can infer the bifurcation structure of normal forms and period doubling routes to chaos, and extrapolate non-dynamical, kinematic trajectories. Finally, we provide a mechanism for how these computations are learned, and replicate our main results using a Wilson–Cowan reservoir. Together, our results provide a simple but powerful mechanism by which an RNN can learn to manipulate internal representations of complex information, enabling the principled study and precise design of RNNs. Recurrent neural networks (RNNs) can learn to process temporal information, such as speech or movement. New work makes such approaches more powerful and flexible by describing theory and experiments demonstrating that RNNs can learn from a few examples to generalize and predict complex dynamics including chaotic behaviour. |
Databáze: | OpenAIRE |
Externí odkaz: |