Výsledky vyhledávání - "Bianchi, Federico"

Report

Belief in the Machine: Investigating Epistemological Blind Spots of Language Models

Autor: Suzgun, Mirac, Gur, Tayfun, Bianchi, Federico, Ho, Daniel E., Icard, Thomas, Jurafsky, Dan, Zou, James

As language models (LMs) become integral to fields like healthcare, law, and journalism, their ability to differentiate between fact, belief, and knowledge is essential for reliable decision-making. Failure to grasp these distinctions can lead to sig

Externí odkaz: http://arxiv.org/abs/2410.21195

Zobrazit plný text záznamu

Report

h4rm3l: A Dynamic Benchmark of Composable Jailbreak Attacks for LLM Safety Assessment

Autor: Doumbouya, Moussa Koulako Bala, Nandi, Ananjan, Poesia, Gabriel, Ghilardi, Davide, Goldie, Anna, Bianchi, Federico, Jurafsky, Dan, Manning, Christopher D.

The safety of Large Language Models (LLMs) remains a critical concern due to a lack of adequate benchmarks for systematically evaluating their ability to resist generating harmful content. Previous efforts towards automated red teaming involve static

Externí odkaz: http://arxiv.org/abs/2408.04811

Zobrazit plný text záznamu

Report

TextGrad: Automatic 'Differentiation' via Text

Autor: Yuksekgonul, Mert, Bianchi, Federico, Boen, Joseph, Liu, Sheng, Huang, Zhi, Guestrin, Carlos, Zou, James

AI is undergoing a paradigm shift, with breakthroughs achieved by systems orchestrating multiple large language models (LLMs) and other complex components. As a result, developing principled and automated optimization methods for compound AI systems

Externí odkaz: http://arxiv.org/abs/2406.07496

Zobrazit plný text záznamu

Report

Large Language Models are Vulnerable to Bait-and-Switch Attacks for Generating Harmful Content

Autor: Bianchi, Federico, Zou, James

The risks derived from large language models (LLMs) generating deceptive and damaging content have been the subject of considerable research, but even safe generations can lead to problematic downstream impacts. In our study, we shift the focus to ho

Externí odkaz: http://arxiv.org/abs/2402.13926

Zobrazit plný text záznamu

Report

How Well Can LLMs Negotiate? NegotiationArena Platform and Analysis

Autor: Bianchi, Federico, Chia, Patrick John, Yuksekgonul, Mert, Tagliabue, Jacopo, Jurafsky, Dan, Zou, James

Negotiation is the basis of social interactions; humans negotiate everything from the price of cars to how to share common resources. With rapidly growing interest in using large language models (LLMs) to act as agents on behalf of human users, such

Externí odkaz: http://arxiv.org/abs/2402.05863

Zobrazit plný text záznamu

Report

Vehicle-to-Grid and ancillary services:a profitability analysis under uncertainty

Autor: Bianchi, Federico, Falsone, Alessandro, Vignali, Riccardo

The rapid and massive diffusion of electric vehicles poses new challenges to the electric system, which must be able to supply these new loads, but at the same time opens up new opportunities thanks to the possible provision of ancillary services. In

Externí odkaz: http://arxiv.org/abs/2309.11118

Zobrazit plný text záznamu

Report

Safety-Tuned LLaMAs: Lessons From Improving the Safety of Large Language Models that Follow Instructions

Autor: Bianchi, Federico, Suzgun, Mirac, Attanasio, Giuseppe, Röttger, Paul, Jurafsky, Dan, Hashimoto, Tatsunori, Zou, James

Training large language models to follow instructions makes them perform better on a wide range of tasks and generally become more helpful. However, a perfectly helpful model will follow even the most malicious instructions and readily generate harmf

Externí odkaz: http://arxiv.org/abs/2309.07875

Zobrazit plný text záznamu

Report

XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models

Autor: Röttger, Paul, Kirk, Hannah Rose, Vidgen, Bertie, Attanasio, Giuseppe, Bianchi, Federico, Hovy, Dirk

Without proper safeguards, large language models will readily follow malicious instructions and generate toxic content. This risk motivates safety efforts such as red-teaming and large-scale feedback learning, which aim to make models both helpful an

Externí odkaz: http://arxiv.org/abs/2308.01263

Zobrazit plný text záznamu

Report

E Pluribus Unum: Guidelines on Multi-Objective Evaluation of Recommender Systems

Autor: Chia, Patrick John, Attanasio, Giuseppe, Tagliabue, Jacopo, Bianchi, Federico, Greco, Ciro, Moreira, Gabriel de Souza P., Eynard, Davide, Husain, Fahd

Recommender Systems today are still mostly evaluated in terms of accuracy, with other aspects beyond the immediate relevance of recommendations, such as diversity, long-term user retention and fairness, often taking a back seat. Moreover, reconciling

Externí odkaz: http://arxiv.org/abs/2304.10621

Zobrazit plný text záznamu

Report

EvalRS 2023. Well-Rounded Recommender Systems For Real-World Deployments

Autor: Bianchi, Federico, Chia, Patrick John, Greco, Ciro, Pomo, Claudio, Moreira, Gabriel, Eynard, Davide, Husain, Fahd, Tagliabue, Jacopo

EvalRS aims to bring together practitioners from industry and academia to foster a debate on rounded evaluation of recommender systems, with a focus on real-world impact across a multitude of deployment scenarios. Recommender systems are often evalua

Externí odkaz: http://arxiv.org/abs/2304.07145

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání