Výsledky vyhledávání - "Zahraei, Pardis Sadat"

Report

Detecting Bias and Enhancing Diagnostic Accuracy in Large Language Models for Healthcare

Autor: Zahraei, Pardis Sadat, Shakeri, Zahra

Biased AI-generated medical advice and misdiagnoses can jeopardize patient safety, making the integrity of AI in healthcare more critical than ever. As Large Language Models (LLMs) take on a growing role in medical decision-making, addressing their b

Externí odkaz: http://arxiv.org/abs/2410.06566

Zobrazit plný text záznamu

Report

TuringQ: Benchmarking AI Comprehension in Theory of Computation

Autor: Zahraei, Pardis Sadat, Asgari, Ehsaneddin

We present TuringQ, the first benchmark designed to evaluate the reasoning capabilities of large language models (LLMs) in the theory of computation. TuringQ consists of 4,006 undergraduate and graduate-level question-answer pairs, categorized into f

Externí odkaz: http://arxiv.org/abs/2410.06547

Zobrazit plný text záznamu

Report

WSC+: Enhancing The Winograd Schema Challenge Using Tree-of-Experts

Autor: Zahraei, Pardis Sadat, Emami, Ali

The Winograd Schema Challenge (WSC) serves as a prominent benchmark for evaluating machine understanding. While Large Language Models (LLMs) excel at answering WSC questions, their ability to generate such questions remains less explored. In this wor

Externí odkaz: http://arxiv.org/abs/2401.17703

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání