Výsledky vyhledávání - "P. Bielikova"

Report

Task Prompt Vectors: Effective Initialization through Multi-Task Soft-Prompt Transfer

Autor: Belanec, Robert, Ostermann, Simon, Srba, Ivan, Bielikova, Maria

Prompt tuning is a modular and efficient solution for training large language models (LLMs). One of its main advantages is task modularity, making it suitable for multi-task problems. However, current soft-prompt-based methods often sacrifice multi-t

Externí odkaz: http://arxiv.org/abs/2408.01119

Zobrazit plný text záznamu

Report

Fighting Randomness with Randomness: Mitigating Optimisation Instability of Fine-Tuning using Delayed Ensemble and Noisy Interpolation

Autor: Pecher, Branislav, Cegin, Jan, Belanec, Robert, Simko, Jakub, Srba, Ivan, Bielikova, Maria

While fine-tuning of pre-trained language models generally helps to overcome the lack of labelled training samples, it also displays model performance instability. This instability mainly originates from randomness in initialisation or data shuffling

Externí odkaz: http://arxiv.org/abs/2406.12471

Zobrazit plný text záznamu

Report

On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices

Autor: Pecher, Branislav, Srba, Ivan, Bielikova, Maria

While learning with limited labelled data can improve performance when the labels are lacking, it is also sensitive to the effects of uncontrolled randomness introduced by so-called randomness factors (e.g., varying order of data). We propose a metho

Externí odkaz: http://arxiv.org/abs/2402.12817

Zobrazit plný text záznamu

Report

Comparing Specialised Small and General Large Language Models on Text Classification: 100 Labelled Samples to Achieve Break-Even Performance

Autor: Pecher, Branislav, Srba, Ivan, Bielikova, Maria

When solving NLP tasks with limited labelled data, researchers can either use a general large language model without further update, or use a small number of labelled examples to tune a specialised smaller model. In this work, we address the research

Externí odkaz: http://arxiv.org/abs/2402.12819

Zobrazit plný text záznamu

Report

Automatic Combination of Sample Selection Strategies for Few-Shot Learning

Autor: Pecher, Branislav, Srba, Ivan, Bielikova, Maria, Vanschoren, Joaquin

In few-shot learning, such as meta-learning, few-shot fine-tuning or in-context learning, the limited number of samples used to train a model have a significant impact on the overall success. Although a large number of sample selection strategies exi

Externí odkaz: http://arxiv.org/abs/2402.03038

Zobrazit plný text záznamu

Report

Authorship Obfuscation in Multilingual Machine-Generated Text Detection

Autor: Macko, Dominik, Moro, Robert, Uchendu, Adaku, Srba, Ivan, Lucas, Jason Samuel, Yamashita, Michiharu, Tripto, Nafis Irtiza, Lee, Dongwon, Simko, Jakub, Bielikova, Maria

High-quality text generation capability of recent Large Language Models (LLMs) causes concerns about their misuse (e.g., in massive generation/spread of disinformation). Machine-generated text (MGT) detection is important to cope with such threats. H

Externí odkaz: http://arxiv.org/abs/2401.07867

Zobrazit plný text záznamu

Report

Effects of diversity incentives on sample diversity and downstream model performance in LLM-based text augmentation

Autor: Cegin, Jan, Pecher, Branislav, Simko, Jakub, Srba, Ivan, Bielikova, Maria, Brusilovsky, Peter

The latest generative large language models (LLMs) have found their application in data augmentation tasks, where small numbers of text samples are LLM-paraphrased and then used to fine-tune downstream models. However, more research is needed to asse

Externí odkaz: http://arxiv.org/abs/2401.06643

Zobrazit plný text záznamu

Report

A Survey on Stability of Learning with Limited Labelled Data and its Sensitivity to the Effects of Randomness

Autor: Pecher, Branislav, Srba, Ivan, Bielikova, Maria

Learning with limited labelled data, such as prompting, in-context learning, fine-tuning, meta-learning or few-shot learning, aims to effectively train a model using only a small amount of labelled samples. However, these approaches have been observe

Externí odkaz: http://arxiv.org/abs/2312.01082

Zobrazit plný text záznamu

Report

Disinformation Capabilities of Large Language Models

Autor: Vykopal, Ivan, Pikuliak, Matúš, Srba, Ivan, Moro, Robert, Macko, Dominik, Bielikova, Maria

Automated disinformation generation is often listed as an important risk associated with large language models (LLMs). The theoretical ability to flood the information space with disinformation content might have dramatic consequences for societies a

Externí odkaz: http://arxiv.org/abs/2311.08838

Zobrazit plný text záznamu

Report

MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark

Autor: Macko, Dominik, Moro, Robert, Uchendu, Adaku, Lucas, Jason Samuel, Yamashita, Michiharu, Pikuliak, Matúš, Srba, Ivan, Le, Thai, Lee, Dongwon, Simko, Jakub, Bielikova, Maria

Publikováno v: Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing

There is a lack of research into capabilities of recent LLMs to generate convincing text in languages other than English and into performance of detectors of machine-generated text in multilingual settings. This is also reflected in the available ben

Externí odkaz: http://arxiv.org/abs/2310.13606

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání