Zobrazeno 1 - 10
of 2 256
pro vyhledávání: '"Troshin , Sergey"'
Language models trained on large amounts of data are known to produce inappropriate content in some cases and require careful tuning to be used in the real world. We revisit the reward augmented decoding (RAD) approach to control the generation from
Externí odkaz:
http://arxiv.org/abs/2407.04615
Autor:
Chirkova, Nadezhda, Troshin, Sergey
Recent works have widely adopted large language model pretraining for source code, suggested source code-specific pretraining objectives and investigated the applicability of various Transformer-based language model architectures for source code. Thi
Externí odkaz:
http://arxiv.org/abs/2308.00683
Autor:
Allal, Loubna Ben, Li, Raymond, Kocetkov, Denis, Mou, Chenghao, Akiki, Christopher, Ferrandis, Carlos Munoz, Muennighoff, Niklas, Mishra, Mayank, Gu, Alex, Dey, Manan, Umapathi, Logesh Kumar, Anderson, Carolyn Jane, Zi, Yangtian, Poirier, Joel Lamy, Schoelkopf, Hailey, Troshin, Sergey, Abulkhanov, Dmitry, Romero, Manuel, Lappert, Michael, De Toni, Francesco, del Río, Bernardo García, Liu, Qian, Bose, Shamik, Bhattacharyya, Urvashi, Zhuo, Terry Yue, Yu, Ian, Villegas, Paulo, Zocca, Marco, Mangrulkar, Sourab, Lansky, David, Nguyen, Huu, Contractor, Danish, Villa, Luis, Li, Jia, Bahdanau, Dzmitry, Jernite, Yacine, Hughes, Sean, Fried, Daniel, Guha, Arjun, de Vries, Harm, von Werra, Leandro
The BigCode project is an open-scientific collaboration working on the responsible development of large language models for code. This tech report describes the progress of the collaboration until December 2022, outlining the current state of the Per
Externí odkaz:
http://arxiv.org/abs/2301.03988
Autor:
Troshin, Sergey, Chirkova, Nadezhda
Deep learning models are widely used for solving challenging code processing tasks, such as code generation or code summarization. Traditionally, a specific model architecture was carefully built to solve a particular code processing task. However, r
Externí odkaz:
http://arxiv.org/abs/2202.08975
Autor:
Bobrov, Evgeny, Troshin, Sergey, Chirkova, Nadezhda, Lobacheva, Ekaterina, Panchenko, Sviatoslav, Vetrov, Dmitry, Kropotov, Dmitry
Channel decoding, channel detection, channel assessment, and resource management for wireless multiple-input multiple-output (MIMO) systems are all examples of problems where machine learning (ML) can be successfully applied. In this paper, we study
Externí odkaz:
http://arxiv.org/abs/2112.14423
The paper studies the multi-user precoding problem as a non-convex optimization problem for wireless multiple input and multiple output (MIMO) systems. In our work, we approximate the target Spectral Efficiency function with a novel computationally s
Externí odkaz:
http://arxiv.org/abs/2107.13440
Autor:
Bobrov, Evgeny, Chinyaev, Boris, Kuznetsov, Viktor, Lu, Hao, Minenkov, Dmitrii, Troshin, Sergey, Yudakov, Daniil, Zaev, Danila
Modern wireless cellular networks use massive multiple-input multiple-output (MIMO) technology. This technology involves operations with an antenna array at a base station that simultaneously serves multiple mobile devices which also use multiple ant
Externí odkaz:
http://arxiv.org/abs/2107.00853
Autor:
Chirkova, Nadezhda, Troshin, Sergey
There is an emerging interest in the application of natural language processing models to source code processing tasks. One of the major problems in applying deep learning to software engineering is that source code often contains a lot of rare ident
Externí odkaz:
http://arxiv.org/abs/2010.12663
Autor:
Chirkova, Nadezhda, Troshin, Sergey
Initially developed for natural language processing (NLP), Transformers are now widely used for source code processing, due to the format similarity between source code and text. In contrast to natural language, source code is strictly structured, i.
Externí odkaz:
http://arxiv.org/abs/2010.07987
Autor:
Troshin, Sergey, Tyurin, Nikolay
We discuss effects of reflective scattering for hadron and heavy nuclei collisions at the LHC and asymptotical energies. It is shown that the reflective scattering might lead to decreasing matter density with energy beyond the LHC energies. Limiting
Externí odkaz:
http://arxiv.org/abs/0909.3926