Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Gawri, Bhavesh"'
As language models have grown in parameters and layers, it has become much harder to train and infer with them on single GPUs. This is severely restricting the availability of large language models such as GPT-3, BERT-Large, and many others. A common
Externí odkaz:
http://arxiv.org/abs/2212.13392