Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Andriiainen, Andrei"'
Autor:
Tulchinskii, Eduard, Kushnareva, Laida, Kuznetsov, Kristian, Voznyuk, Anastasia, Andriiainen, Andrei, Piontkovskaya, Irina, Burnaev, Evgeny, Barannikov, Serguei
A standard way to evaluate the abilities of LLM involves presenting a multiple-choice question and selecting the option with the highest logit as the model's predicted answer. However, such a format for evaluating LLMs has limitations, since even if
Externí odkaz:
http://arxiv.org/abs/2410.02343