Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Bongni, Sascha"'
Recent efforts in fine-tuning language models often rely on automatic data selection, commonly using Nearest Neighbors retrieval from large datasets. However, we theoretically show that this approach tends to select redundant data, limiting its effec
Externí odkaz:
http://arxiv.org/abs/2410.08020