Výsledky vyhledávání

Report

Self-training Large Language Models through Knowledge Detection

Autor: Yeo, Wei Jie, Ferdinan, Teddy, Kazienko, Przemyslaw, Satapathy, Ranjan, Cambria, Erik

Large language models (LLMs) often necessitate extensive labeled datasets and training compute to achieve impressive performance across downstream tasks. This paper explores a self-training paradigm, where the LLM autonomously curates its own labels

Externí odkaz: http://arxiv.org/abs/2406.11275

Zobrazit plný text záznamu

Report

Eagle and Finch: RWKV with Matrix-Valued States and Dynamic Recurrence

We present Eagle (RWKV-5) and Finch (RWKV-6), sequence models improving upon the RWKV (RWKV-4) architecture. Our architectural design advancements include multi-headed matrix-valued states and a dynamic recurrence mechanism that improve expressivity

Externí odkaz: http://arxiv.org/abs/2404.05892

Zobrazit plný text záznamu

Report

Personalized Large Language Models

Autor: Woźniak, Stanisław, Koptyra, Bartłomiej, Janz, Arkadiusz, Kazienko, Przemysław, Kocoń, Jan

Large language models (LLMs) have significantly advanced Natural Language Processing (NLP) tasks in recent years. However, their universal nature poses limitations in scenarios requiring personalized responses, such as recommendation systems and chat

Externí odkaz: http://arxiv.org/abs/2402.09269

Zobrazit plný text záznamu

Report

Into the Unknown: Self-Learning Large Language Models

Autor: Ferdinan, Teddy, Kocoń, Jan, Kazienko, Przemysław

We address the main problem of self-learning LLM: the question of what to learn. We propose a self-learning LLM framework that enables an LLM to independently learn previously unknown knowledge through self-assessment of their own hallucinations. We

Externí odkaz: http://arxiv.org/abs/2402.09147

Zobrazit plný text záznamu

Report

From Generalized Laughter to Personalized Chuckles: Unleashing the Power of Data Fusion in Subjective Humor Detection

Autor: Bielaniewicz, Julita, Kazienko, Przemysław

The vast area of subjectivity in Natural Language Processing (NLP) poses a challenge to the solutions typically used in generalized tasks. As exploration in the scope of generalized NLP is much more advanced, it implies the tremendous gap that is sti

Externí odkaz: http://arxiv.org/abs/2312.11296

Zobrazit plný text záznamu

Report

Towards Model-Based Data Acquisition for Subjective Multi-Task NLP Problems

Autor: Kanclerz, Kamil, Bielaniewicz, Julita, Gruza, Marcin, Kocon, Jan, Woźniak, Stanisław, Kazienko, Przemysław

Data annotated by humans is a source of knowledge by describing the peculiarities of the problem and therefore fueling the decision process of the trained model. Unfortunately, the annotation process for subjective natural language processing (NLP) p

Externí odkaz: http://arxiv.org/abs/2312.08198

Zobrazit plný text záznamu

Report

Modeling Uncertainty in Personalized Emotion Prediction with Normalizing Flows

Autor: Miłkowski, Piotr, Karanowski, Konrad, Wielopolski, Patryk, Kocoń, Jan, Kazienko, Przemysław, Zięba, Maciej

Designing predictive models for subjective problems in natural language processing (NLP) remains challenging. This is mainly due to its non-deterministic nature and different perceptions of the content by different humans. It may be solved by Persona

Externí odkaz: http://arxiv.org/abs/2312.06034

Zobrazit plný text záznamu

Report

Scaling Representation Learning from Ubiquitous ECG with State-Space Models

Autor: Avramidis, Kleanthis, Kunc, Dominika, Perz, Bartosz, Adsul, Kranti, Feng, Tiantian, Kazienko, Przemysław, Saganowski, Stanisław, Narayanan, Shrikanth

Ubiquitous sensing from wearable devices in the wild holds promise for enhancing human well-being, from diagnosing clinical conditions and measuring stress to building adaptive health promoting scaffolds. But the large volumes of data therein across

Externí odkaz: http://arxiv.org/abs/2309.15292

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání