Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Low, Hao Han"'
Autor:
Gupta, Prannaya, Yau, Le Qi, Low, Hao Han, Lee, I-Shiang, Lim, Hugo Maximus, Teoh, Yu Xin, Koh, Jia Hng, Liew, Dar Win, Bhardwaj, Rishabh, Bhardwaj, Rajat, Poria, Soujanya
WalledEval is a comprehensive AI safety testing toolkit designed to evaluate large language models (LLMs). It accommodates a diverse range of models, including both open-weight and API-based ones, and features over 35 safety benchmarks covering areas
Externí odkaz:
http://arxiv.org/abs/2408.03837