Výsledky vyhledávání - "Moshkov, P. A."

Report

OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction Data

Autor: Toshniwal, Shubham, Du, Wei, Moshkov, Ivan, Kisacanin, Branislav, Ayrapetyan, Alexan, Gitman, Igor

Mathematical reasoning continues to be a critical challenge in large language model (LLM) development with significant interest. However, most of the cutting-edge progress in mathematical reasoning with LLMs has become \emph{closed-source} due to lac

Externí odkaz: http://arxiv.org/abs/2410.01560

Zobrazit plný text záznamu

Report

Nemotron-4 340B Technical Report

Autor: Nvidia, Adler, Bo, Agarwal, Niket, Aithal, Ashwath, Anh, Dong H., Bhattacharya, Pallab, Brundyn, Annika, Casper, Jared, Catanzaro, Bryan, Clay, Sharon, Cohen, Jonathan, Das, Sirshak, Dattagupta, Ayush, Delalleau, Olivier, Derczynski, Leon, Dong, Yi, Egert, Daniel, Evans, Ellie, Ficek, Aleksander, Fridman, Denys, Ghosh, Shaona, Ginsburg, Boris, Gitman, Igor, Grzegorzek, Tomasz, Hero, Robert, Huang, Jining, Jawa, Vibhu, Jennings, Joseph, Jhunjhunwala, Aastha, Kamalu, John, Khan, Sadaf, Kuchaiev, Oleksii, LeGresley, Patrick, Li, Hui, Liu, Jiwei, Liu, Zihan, Long, Eileen, Mahabaleshwarkar, Ameya Sunil, Majumdar, Somshubra, Maki, James, Martinez, Miguel, de Melo, Maer Rodrigues, Moshkov, Ivan, Narayanan, Deepak, Narenthiran, Sean, Navarro, Jesus, Nguyen, Phong, Nitski, Osvald, Noroozi, Vahid, Nutheti, Guruprasad, Parisien, Christopher, Parmar, Jupinder, Patwary, Mostofa, Pawelec, Krzysztof, Ping, Wei, Prabhumoye, Shrimai, Roy, Rajarshi, Saar, Trisha, Sabavat, Vasanth Rao Naik, Satheesh, Sanjeev, Scowcroft, Jane Polak, Sewall, Jason, Shamis, Pavel, Shen, Gerald, Shoeybi, Mohammad, Sizer, Dave, Smelyanskiy, Misha, Soares, Felipe, Sreedhar, Makesh Narsimhan, Su, Dan, Subramanian, Sandeep, Sun, Shengyang, Toshniwal, Shubham, Wang, Hao, Wang, Zhilin, You, Jiaxuan, Zeng, Jiaqi, Zhang, Jimmy, Zhang, Jing, Zhang, Vivienne, Zhang, Yian, Zhu, Chen

We release the Nemotron-4 340B model family, including Nemotron-4-340B-Base, Nemotron-4-340B-Instruct, and Nemotron-4-340B-Reward. Our models are open access under the NVIDIA Open Model License Agreement, a permissive model license that allows distri

Externí odkaz: http://arxiv.org/abs/2406.11704

Zobrazit plný text záznamu

Report

OpenMathInstruct-1: A 1.8 Million Math Instruction Tuning Dataset

Autor: Toshniwal, Shubham, Moshkov, Ivan, Narenthiran, Sean, Gitman, Daria, Jia, Fei, Gitman, Igor

Recent work has shown the immense potential of synthetically generated datasets for training large language models (LLMs), especially for acquiring targeted skills. Current large-scale math instruction tuning datasets such as MetaMathQA (Yu et al., 2

Externí odkaz: http://arxiv.org/abs/2402.10176

Zobrazit plný text záznamu

Report

Lower Bounds on Cardinality of Reducts for Decision Tables from Closed Classes

Autor: Ostonov, Azimkhon, Moshkov, Mikhail

In this paper, we consider classes of decision tables closed under removal of attributes (columns) and changing of decisions attached to rows. For decision tables from closed classes, we study lower bounds on the minimum cardinality of reducts, which

Externí odkaz: http://arxiv.org/abs/2401.01324

Zobrazit plný text záznamu

Report

Comparison of Deterministic and Nondeterministic Decision Trees for Decision Tables with Many-valued Decisions from Closed Classes

Autor: Ostonov, Azimkhon, Moshkov, Mikhail

In this paper, we consider classes of decision tables with many-valued decisions closed relative to removal of attributes (columns) and changing sets of decisions assigned to rows. For tables from an arbitrary closed class, we study a function $\math

Externí odkaz: http://arxiv.org/abs/2312.01116

Zobrazit plný text záznamu

Report

CHAMMI: A benchmark for channel-adaptive models in microscopy imaging

Autor: Chen, Zitong, Pham, Chau, Wang, Siqi, Doron, Michael, Moshkov, Nikita, Plummer, Bryan A., Caicedo, Juan C.

Most neural networks assume that input images have a fixed number of channels (three for RGB images). However, there are many settings where the number of channels may vary, such as microscopy images where the number of channels changes depending on

Externí odkaz: http://arxiv.org/abs/2310.19224

Zobrazit plný text záznamu

Report

Deterministic and Strongly Nondeterministic Decision Trees for Decision Tables from Closed Classes

Autor: Ostonov, Azimkhon, Moshkov, Mikhail

Publikováno v: IEEE Access 12, 164979-164988 (2024)

In this paper, we consider classes of decision tables with 0-1-decisions closed relative to removal of attributes (columns) and changing decisions assigned to rows. For tables from an arbitrary closed class, we study the dependence of the minimum com

Externí odkaz: http://arxiv.org/abs/2305.06093

Zobrazit plný text záznamu

Report

Construction of Decision Trees and Acyclic Decision Graphs from Decision Rule Systems

Autor: Durdymyradov, Kerven, Moshkov, Mikhail

Decision trees and systems of decision rules are widely used as classifiers, as a means for knowledge representation, and as algorithms. They are among the most interpretable models for data analysis. The study of the relationships between these two

Externí odkaz: http://arxiv.org/abs/2305.01721

Zobrazit plný text záznamu

Report

Comparative Analysis of Deterministic and Nondeterministic Decision Trees for Decision Tables from Closed Classes

Autor: Ostonov, Azimkhon, Moshkov, Mikhail

In this paper, we consider classes of decision tables with many-valued decisions closed under operations of removal of columns, changing of decisions, permutation of columns, and duplication of columns. We study relationships among three parameters o

Externí odkaz: http://arxiv.org/abs/2304.10594

Zobrazit plný text záznamu

Report

Bounds on Depth of Decision Trees Derived from Decision Rule Systems

Autor: Durdymyradov, Kerven, Moshkov, Mikhail

Systems of decision rules and decision trees are widely used as a means for knowledge representation, as classifiers, and as algorithms. They are among the most interpretable models for classifying and representing knowledge. The study of relationshi

Externí odkaz: http://arxiv.org/abs/2302.07063

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání