Hierarchical Conflict Propagation: Sequence Learning in a Recurrent Deep Neural Network

Autor:	Simpson, Andrew J. R.
Rok vydání:	2016
Předmět:	Computer Science - Learning 68Txx
Druh dokumentu:	Working Paper
Popis:	Recurrent neural networks (RNN) are capable of learning to encode and exploit activation history over an arbitrary timescale. However, in practice, state of the art gradient descent based training methods are known to suffer from difficulties in learning long term dependencies. Here, we describe a novel training method that involves concurrent parallel cloned networks, each sharing the same weights, each trained at different stimulus phase and each maintaining independent activation histories. Training proceeds by recursively performing batch-updates over the parallel clones as activation history is progressively increased. This allows conflicts to propagate hierarchically from short-term contexts towards longer-term contexts until they are resolved. We illustrate the parallel clones method and hierarchical conflict propagation with a character-level deep RNN tasked with memorizing a paragraph of Moby Dick (by Herman Melville).
Databáze:	arXiv
Externí odkaz:	http://arxiv.org/abs/1602.08118 Zobrazit plný text záznamu View this record from Arxiv