Výsledky vyhledávání - "Provilkov, Ivan"

Report

Multi-Sentence Resampling: A Simple Approach to Alleviate Dataset Length Bias and Beam-Search Degradation

Neural Machine Translation (NMT) is known to suffer from a beam-search problem: after a certain point, increasing beam size causes an overall drop in translation quality. This effect is especially pronounced for long sentences. While much work was do

Externí odkaz: http://arxiv.org/abs/2109.06253

Zobrazit plný text záznamu

Report

Shifts: A Dataset of Real Distributional Shift Across Multiple Large-Scale Tasks

Autor: Malinin, Andrey, Band, Neil, Ganshin, Alexander, Chesnokov, German, Gal, Yarin, Gales, Mark J. F., Noskov, Alexey, Ploskonosov, Andrey, Prokhorenkova, Liudmila, Provilkov, Ivan, Raina, Vatsal, Raina, Vyas, Roginskiy, Denis, Shmatova, Mariya, Tigas, Panos, Yangel, Boris

There has been significant research done on developing methods for improving robustness to distributional shift and uncertainty estimation. In contrast, only limited work has examined developing standard datasets and benchmarks for assessing these ap

Externí odkaz: http://arxiv.org/abs/2107.07455

Zobrazit plný text záznamu

Report

Vertex and Energy Reconstruction in JUNO with Machine Learning Methods

The Jiangmen Underground Neutrino Observatory (JUNO) is an experiment designed to study neutrino oscillations. Determination of neutrino mass ordering and precise measurement of neutrino oscillation parameters $\sin^2 2\theta_{12}$, $\Delta m^2_{21}$

Externí odkaz: http://arxiv.org/abs/2101.04839

Zobrazit plný text záznamu

Report

Regression Prior Networks

Autor: Malinin, Andrey, Chervontsev, Sergey, Provilkov, Ivan, Gales, Mark

Prior Networks are a recently developed class of models which yield interpretable measures of uncertainty and have been shown to outperform state-of-the-art ensemble approaches on a range of tasks. They can also be used to distill an ensemble of mode

Externí odkaz: http://arxiv.org/abs/2006.11590

Zobrazit plný text záznamu

Report

BPE-Dropout: Simple and Effective Subword Regularization

Autor: Provilkov, Ivan, Emelianenko, Dmitrii, Voita, Elena

Subword segmentation is widely used to address the open vocabulary problem in machine translation. The dominant approach to subword segmentation is Byte Pair Encoding (BPE), which keeps the most frequent words intact while splitting the rare ones int

Externí odkaz: http://arxiv.org/abs/1910.13267

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání