Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Schloss, Benjamin J"'
Recent work has shown the promise of learning with human feedback paradigms to produce human-determined high-quality text. Existing works use human feedback to train large language models (LLMs) in general domain abstractive summarization and have ob
Externí odkaz:
http://arxiv.org/abs/2310.05857