Výsledky vyhledávání - "Vassilieva, A."

Report

Crystal: Illuminating LLM Abilities on Language and Code

Autor: Tao, Tianhua, Li, Junbo, Tan, Bowen, Wang, Hongyi, Marshall, William, Kanakiya, Bhargav M, Hestness, Joel, Vassilieva, Natalia, Shen, Zhiqiang, Xing, Eric P., Liu, Zhengzhong

Large Language Models (LLMs) specializing in code generation (which are also often referred to as code LLMs), e.g., StarCoder and Code Llama, play increasingly critical roles in various software development scenarios. It is also crucial for code LLMs

Externí odkaz: http://arxiv.org/abs/2411.04156

Zobrazit plný text záznamu

Report

Bilingual Adaptation of Monolingual Foundation Models

We present an efficient method for adapting a monolingual Large Language Model (LLM) to another language, addressing challenges of catastrophic forgetting and tokenizer limitations. We focus this study on adapting Llama 2 to Arabic. Our two-stage app

Externí odkaz: http://arxiv.org/abs/2407.12869

Zobrazit plný text záznamu

Report

Quasisymmetric expansion of Hall-Littlewood symmetric functions

Autor: Grinberg, Darij, Vassilieva, Ekaterina A.

Publikováno v: S\'eminaire Lotharingien de Combinatoire 91B (2024) (Proceedings of the 36th FPSAC) Article #86, 12 pp

In our previous works we introduced a $q$-deformation of the generating functions for enriched $P$-partitions. We call the evaluation of this generating functions on labelled chains, the $q$-fundamental quasisymmetric functions. These functions inter

Externí odkaz: http://arxiv.org/abs/2406.01166

Zobrazit plný text záznamu

Report

Med42 -- Evaluating Fine-Tuning Strategies for Medical LLMs: Full-Parameter vs. Parameter-Efficient Approaches

Autor: Christophe, Clément, Kanithi, Praveen K, Munjal, Prateek, Raha, Tathagata, Hayat, Nasir, Rajan, Ronnie, Al-Mahrooqi, Ahmed, Gupta, Avani, Salman, Muhammad Umar, Gosal, Gurpreet, Kanakiya, Bhargav, Chen, Charles, Vassilieva, Natalia, Amor, Boulbaba Ben, Pimentel, Marco AF, Khan, Shadab

This study presents a comprehensive analysis and comparison of two predominant fine-tuning methodologies - full-parameter fine-tuning and parameter-efficient tuning - within the context of medical Large Language Models (LLMs). We developed and refine

Externí odkaz: http://arxiv.org/abs/2404.14779

Zobrazit plný text záznamu

Report

BTLM-3B-8K: 7B Parameter Performance in a 3B Parameter Model

Autor: Dey, Nolan, Soboleva, Daria, Al-Khateeb, Faisal, Yang, Bowen, Pathria, Ribhu, Khachane, Hemant, Muhammad, Shaheer, Zhiming, Chen, Myers, Robert, Steeves, Jacob Robert, Vassilieva, Natalia, Tom, Marvin, Hestness, Joel

We introduce the Bittensor Language Model, called "BTLM-3B-8K", a new state-of-the-art 3 billion parameter open-source language model. BTLM-3B-8K was trained on 627B tokens from the SlimPajama dataset with a mixture of 2,048 and 8,192 context lengths

Externí odkaz: http://arxiv.org/abs/2309.11568

Zobrazit plný text záznamu

Report

SlimPajama-DC: Understanding Data Combinations for LLM Training

Autor: Shen, Zhiqiang, Tao, Tianhua, Ma, Liqun, Neiswanger, Willie, Liu, Zhengzhong, Wang, Hongyi, Tan, Bowen, Hestness, Joel, Vassilieva, Natalia, Soboleva, Daria, Xing, Eric

This paper aims to understand the impacts of various data combinations (e.g., web text, Wikipedia, GitHub, books) on the pretraining of large language models using SlimPajama. SlimPajama is a rigorously deduplicated, multi-source dataset, which has b

Externí odkaz: http://arxiv.org/abs/2309.10818

Zobrazit plný text záznamu

Report

The enriched $q$-monomial basis of the quasisymmetric functions

Autor: Grinberg, Darij, Vassilieva, Ekaterina A.

We construct a new family $\left( \eta_{\alpha}^{\left( q\right) }\right) _{\alpha\in\operatorname*{Comp}}$ of quasisymmetric functions for each element $q$ of the base ring. We call them the "enriched $q$-monomial quasisymmetric functions". When $r:

Externí odkaz: http://arxiv.org/abs/2309.01118

Zobrazit plný text záznamu

Report

Jais and Jais-chat: Arabic-Centric Foundation and Instruction-Tuned Open Generative Large Language Models

We introduce Jais and Jais-chat, new state-of-the-art Arabic-centric foundation and instruction-tuned open generative large language models (LLMs). The models are based on the GPT-3 decoder-only architecture and are pretrained on a mixture of Arabic

Externí odkaz: http://arxiv.org/abs/2308.16149

Zobrazit plný text záznamu

Report

The algebra of extended peaks

Autor: Grinberg, Darij, Vassilieva, Ekaterina A.

Publikováno v: S\'eminaire Lotharingien de Combinatoire 89B (2023), Proceedings of the 35th Conference on Formal Power Series 2023, Article #46

Building up on our previous works regarding $q$-deformed $P$-partitions, we introduce a new family of subalgebras for the ring of quasisymmetric functions. Each of these subalgebras admits as a basis a $q$-analogue to Gessel's fundamental quasisymmet

Externí odkaz: http://arxiv.org/abs/2301.00309

Zobrazit plný text záznamu

Akademický článek

Arterial to jugular‐bulb lactate difference in patients undergoing elective brain tumor craniotomy

Autor: Alexandra Vassilieva, Markus Harboe Olsen, Jane Skjøth‐Rasmussen, Kirsten Møller, Martin Kryspin Sørensen

Publikováno v: Physiological Reports, Vol 12, Iss 20, Pp n/a-n/a (2024)

Abstract Hyperlactatemia is common during tumor craniotomy, but the underlying pathophysiology is unclear. This study measured simultaneous arterial and jugular‐bulb lactate concentrations in patients undergoing brain tumor craniotomy to investigat

Externí odkaz: https://doaj.org/article/736d1c1ccb89471689ca451386cafaed

Zobrazit plný text záznamu

Plný text ve formátu HTML

Vyhledávací nástroje:

Upřesnit hledání