Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Grosch, Sharon"'
This paper systematically compares different methods of deriving item-level predictions of language models for multiple-choice tasks. It compares scoring methods for answer options based on free generation of responses, various probability-based scor
Externí odkaz:
http://arxiv.org/abs/2403.00998