Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Seth, Agastya"'
Autor:
Uddin, Md Nayem, Saeidi, Amir, Handa, Divij, Seth, Agastya, Son, Tran Cao, Blanco, Eduardo, Corman, Steven R., Baral, Chitta
This paper introduces UnSeenTimeQA, a novel data contamination-free time-sensitive question-answering (TSQA) benchmark. It differs from existing TSQA benchmarks by avoiding web-searchable queries grounded in the real-world. We present a series of tim
Externí odkaz:
http://arxiv.org/abs/2407.03525
As Large Language Models (LLMs) play an increasingly pivotal role in natural language processing applications, their safety concerns become critical areas of NLP research. This paper presents Safety and Over-Defensiveness Evaluation (SODE) benchmark:
Externí odkaz:
http://arxiv.org/abs/2401.00287