Výsledky vyhledávání - "memorisation"

Report

Efficient and Private: Memorisation under differentially private parameter-efficient fine-tuning in language models

Autor: Ma, Olivia, Passerat-Palmbach, Jonathan, Usynin, Dmitrii

Fine-tuning large language models (LLMs) for specific tasks introduces privacy risks, as models may inadvertently memorise and leak sensitive training data. While Differential Privacy (DP) offers a solution to mitigate these risks, it introduces sign

Externí odkaz: http://arxiv.org/abs/2411.15831

Zobrazit plný text záznamu

Report

Generalisation First, Memorisation Second? Memorisation Localisation for Natural Language Classification Tasks

Autor: Dankers, Verna, Titov, Ivan

Memorisation is a natural part of learning from real-world data: neural models pick up on atypical input-output combinations and store those training examples in their parameter space. That this happens is well-known, but how and where are questions

Externí odkaz: http://arxiv.org/abs/2408.04965

Zobrazit plný text záznamu

Report

Understanding Memorisation in LLMs: Dynamics, Influencing Factors, and Implications

Autor: Speicher, Till, Khan, Mohammad Aflah, Wu, Qinyuan, Nanda, Vedant, Das, Soumi, Ghosh, Bishwamittra, Gummadi, Krishna P., Terzi, Evimaria

Understanding whether and to what extent large language models (LLMs) have memorised training data has important implications for the reliability of their output and the privacy of their training data. In order to cleanly measure and disentangle memo

Externí odkaz: http://arxiv.org/abs/2407.19262

Zobrazit plný text záznamu

Report

Causal Estimation of Memorisation Profiles

Autor: Lesci, Pietro, Meister, Clara, Hofmann, Thomas, Vlachos, Andreas, Pimentel, Tiago

Publikováno v: Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) (2024)

Understanding memorisation in language models has practical and societal implications, e.g., studying models' training dynamics or preventing copyright infringements. Prior work defines memorisation as the causal effect of training with an instance o

Externí odkaz: http://arxiv.org/abs/2406.04327

Zobrazit plný text záznamu

Report

Memorisation Cartography: Mapping out the Memorisation-Generalisation Continuum in Neural Machine Translation

Autor: Dankers, Verna, Titov, Ivan, Hupkes, Dieuwke

When training a neural network, it will quickly memorise some source-target mappings from your dataset but never learn some others. Yet, memorisation is not easily expressed as a binary feature that is good or bad: individual datapoints lie on a memo

Externí odkaz: http://arxiv.org/abs/2311.05379

Zobrazit plný text záznamu

Report

Traces of Memorisation in Large Language Models for Code

Autor: Al-Kaswan, Ali, Izadi, Maliheh, van Deursen, Arie

Large language models have gained significant popularity because of their ability to generate human-like text and potential applications in various fields, such as Software Engineering. Large language models for code are commonly trained on large uns

Externí odkaz: http://arxiv.org/abs/2312.11658

Zobrazit plný text záznamu

Report

SoK: Memorisation in machine learning

Autor: Usynin, Dmitrii, Knolle, Moritz, Kaissis, Georgios

Quantifying the impact of individual data samples on machine learning models is an open research problem. This is particularly relevant when complex and high-dimensional relationships have to be learned from a limited sample of the data generating di

Externí odkaz: http://arxiv.org/abs/2311.03075

Zobrazit plný text záznamu

Elektronická kniha

Text memorisation in Chinese foreign language education [electronic resource] / Xia Yu.

Autor: Yu, Xia, 1973-

Externí odkaz: Kolekce e-knih KNAV Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on request.

Report

Do Smaller Language Models Answer Contextualised Questions Through Memorisation Or Generalisation?

Autor: Hartill, Tim, Bensemann, Joshua, Witbrock, Michael, Riddle, Patricia J.

A distinction is often drawn between a model's ability to predict a label for an evaluation sample that is directly memorised from highly similar training samples versus an ability to predict the label via some method of generalisation. In the contex

Externí odkaz: http://arxiv.org/abs/2311.12337

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání