Výsledky vyhledávání - "Agrawal, Srishti Shekhar"

Report

Value Imprint: A Technique for Auditing the Human Values Embedded in RLHF Datasets

Autor: Obi, Ike, Pant, Rohan, Agrawal, Srishti Shekhar, Ghazanfar, Maham, Basiletti, Aaron

LLMs are increasingly fine-tuned using RLHF datasets to align them with human preferences and values. However, very limited research has investigated which specific human values are operationalized through these datasets. In this paper, we introduce

Externí odkaz: http://arxiv.org/abs/2411.11937

Zobrazit plný text záznamu

Conference

Understanding the Experiences, Challenges, and Needs of Dementia Caregivers in the Indian Subcontinent.

Autor: Agrawal, Srishti Shekhar, Panchal, Shrey, He, Liang

Publikováno v: ACM SIGACCESS Conference on Computers & Accessibility; 2023, p1-5, 5p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání