Zobrazeno 1 - 5
of 5
pro vyhledávání: '"Provilkov, Ivan"'
Autor:
Provilkov, Ivan, Malinin, Andrey
Neural Machine Translation (NMT) is known to suffer from a beam-search problem: after a certain point, increasing beam size causes an overall drop in translation quality. This effect is especially pronounced for long sentences. While much work was do
Externí odkaz:
http://arxiv.org/abs/2109.06253
Autor:
Malinin, Andrey, Band, Neil, Ganshin, Alexander, Chesnokov, German, Gal, Yarin, Gales, Mark J. F., Noskov, Alexey, Ploskonosov, Andrey, Prokhorenkova, Liudmila, Provilkov, Ivan, Raina, Vatsal, Raina, Vyas, Roginskiy, Denis, Shmatova, Mariya, Tigas, Panos, Yangel, Boris
There has been significant research done on developing methods for improving robustness to distributional shift and uncertainty estimation. In contrast, only limited work has examined developing standard datasets and benchmarks for assessing these ap
Externí odkaz:
http://arxiv.org/abs/2107.07455
Autor:
Qian, Zhen, Belavin, Vladislav, Bokov, Vasily, Brugnera, Riccardo, Compagnucci, Alessandro, Gavrikov, Arsenii, Garfagnini, Alberto, Gonchar, Maxim, Khatbullina, Leyla, Li, Ziyuan, Luo, Wuming, Malyshkin, Yury, Piccinelli, Samuele, Provilkov, Ivan, Ratnikov, Fedor, Selivanov, Dmitry, Treskov, Konstantin, Ustyuzhanin, Andrey, Vidaich, Francesco, You, Zhengyun, Zhang, Yumei, Zhu, Jiang, Manzali, Francesco
The Jiangmen Underground Neutrino Observatory (JUNO) is an experiment designed to study neutrino oscillations. Determination of neutrino mass ordering and precise measurement of neutrino oscillation parameters $\sin^2 2\theta_{12}$, $\Delta m^2_{21}$
Externí odkaz:
http://arxiv.org/abs/2101.04839
Prior Networks are a recently developed class of models which yield interpretable measures of uncertainty and have been shown to outperform state-of-the-art ensemble approaches on a range of tasks. They can also be used to distill an ensemble of mode
Externí odkaz:
http://arxiv.org/abs/2006.11590
Subword segmentation is widely used to address the open vocabulary problem in machine translation. The dominant approach to subword segmentation is Byte Pair Encoding (BPE), which keeps the most frequent words intact while splitting the rare ones int
Externí odkaz:
http://arxiv.org/abs/1910.13267