Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Harper, Jackson Bailey"'
We introduce a novel and extensible benchmark for large language models (LLMs) through grid-based games such as Tic-Tac-Toe, Connect Four, and Gomoku. The open-source game simulation code, available on GitHub, allows LLMs to compete and generates det
Externí odkaz:
http://arxiv.org/abs/2407.07796