Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Choe, Dokook"'
Purely character-based language models (LMs) have been lagging in quality on large scale datasets, and current state-of-the-art LMs rely on word tokenization. It has been assumed that injecting the prior knowledge of a tokenizer into the model is ess
Externí odkaz:
http://arxiv.org/abs/1908.10322
LSTMs and other RNN variants have shown strong performance on character-level language modeling. These models are typically trained using truncated backpropagation through time, and it is common to assume that their success stems from their ability t
Externí odkaz:
http://arxiv.org/abs/1808.04444