Výsledky vyhledávání - "Zemour, Eliott"

Report

PrimeGuard: Safe and Helpful LLMs through Tuning-Free Routing

Autor: Manczak, Blazej, Zemour, Eliott, Lin, Eric, Mugunthan, Vaikkunth

Deploying language models (LMs) necessitates outputs to be both high-quality and compliant with safety guidelines. Although Inference-Time Guardrails (ITG) offer solutions that shift model output distributions towards compliance, we find that current

Externí odkaz: http://arxiv.org/abs/2407.16318

Zobrazit plný text záznamu

Report

Does fine-tuning GPT-3 with the OpenAI API leak personally-identifiable information?

Autor: Sun, Albert Yu, Zemour, Eliott, Saxena, Arushi, Vaidyanathan, Udith, Lin, Eric, Lau, Christian, Mugunthan, Vaikkunth

Machine learning practitioners often fine-tune generative pre-trained models like GPT-3 to improve model performance at specific tasks. Previous works, however, suggest that fine-tuned machine learning models memorize and emit sensitive information f

Externí odkaz: http://arxiv.org/abs/2307.16382

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání