Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Qu, Yuanbin"'
Autor:
Zhao, Yu, Qu, Yuanbin, Staniszewski, Konrad, Tworkowski, Szymon, Liu, Wei, Miłoś, Piotr, Wu, Yuxiang, Minervini, Pasquale
Most language model pre-training frameworks concatenate multiple documents into fixed-length sequences and use causal masking to compute the likelihood of each token given its context; this strategy is widely adopted due to its simplicity and efficie
Externí odkaz:
http://arxiv.org/abs/2402.13991