Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Thakur, Aman Singh"'
Autor:
Thakur, Aman Singh, Choudhary, Kartik, Ramayapally, Venkat Srinik, Vaidyanathan, Sankaran, Hupkes, Dieuwke
Offering a promising solution to the scalability challenges associated with human evaluation, the LLM-as-a-judge paradigm is rapidly gaining traction as an approach to evaluating large language models (LLMs). However, there are still many open questi
Externí odkaz:
http://arxiv.org/abs/2406.12624