Výsledky vyhledávání

Report

Testing the Limits of Jailbreaking Defenses with the Purple Problem

Autor: Kim, Taeyoun, Kotha, Suhas, Raghunathan, Aditi

The rise of "jailbreak" attacks on language models has led to a flurry of defenses aimed at preventing undesirable responses. We critically examine the two stages of the defense pipeline: (i) defining what constitutes unsafe outputs, and (ii) enforci

Externí odkaz: http://arxiv.org/abs/2403.14725

Zobrazit plný text záznamu

Report

Introducing Adaptive Continuous Adversarial Training (ACAT) to Enhance ML Robustness

Autor: elShehaby, Mohamed, Kotha, Aditya, Matrawy, Ashraf

Adversarial training enhances the robustness of Machine Learning (ML) models against adversarial attacks. However, obtaining labeled training and adversarial training data in network/cybersecurity domains is challenging and costly. Therefore, this le

Externí odkaz: http://arxiv.org/abs/2403.10461

Zobrazit plný text záznamu

Report

A Safe Harbor for AI Evaluation and Red Teaming

Independent evaluation and red teaming are critical for identifying the risks posed by generative AI systems. However, the terms of service and enforcement strategies used by prominent AI companies to deter model misuse have disincentives on good fai

Externí odkaz: http://arxiv.org/abs/2403.04893

Zobrazit plný text záznamu

Report

Repetition Improves Language Model Embeddings

Autor: Springer, Jacob Mitchell, Kotha, Suhas, Fried, Daniel, Neubig, Graham, Raghunathan, Aditi

Recent approaches to improving the extraction of text embeddings from autoregressive large language models (LLMs) have largely focused on improvements to data, backbone pretrained language models, or improving task-differentiation via instructions. I

Externí odkaz: http://arxiv.org/abs/2402.15449

Zobrazit plný text záznamu

Report

Understanding Catastrophic Forgetting in Language Models via Implicit Inference

Autor: Kotha, Suhas, Springer, Jacob Mitchell, Raghunathan, Aditi

We lack a systematic understanding of the effects of fine-tuning (via methods such as instruction-tuning or reinforcement learning from human feedback), particularly on tasks outside the narrow fine-tuning distribution. In a simplified scenario, we d

Externí odkaz: http://arxiv.org/abs/2309.10105

Zobrazit plný text záznamu

Report

ARTEMIS: AI-driven Robotic Triage Labeling and Emergency Medical Information System

Autor: Senthilkumaran, Revanth Krishna, Prashanth, Mridu, Viswanath, Hrishikesh, Kotha, Sathvika, Tiwari, Kshitij, Bera, Aniket

Mass casualty incidents (MCIs) pose a significant challenge to emergency medical services by overwhelming available resources and personnel. Effective victim assessment is the key to minimizing casualties during such a crisis. We introduce ARTEMIS, a

Externí odkaz: http://arxiv.org/abs/2309.08865

Zobrazit plný text záznamu

Report

A Comparative Study of the Perceptual Sensitivity of Topological Visualizations to Feature Variations

Autor: Athawale, Tushar M., Triana, Bryan, Kotha, Tanmay, Pugmire, Dave, Rosen, Paul

Color maps are a commonly used visualization technique in which data are mapped to optical properties, e.g., color or opacity. Color maps, however, do not explicitly convey structures (e.g., positions and scale of features) within data. Topology-base

Externí odkaz: http://arxiv.org/abs/2307.08795

Zobrazit plný text záznamu

Report

Provably Bounding Neural Network Preimages

Autor: Kotha, Suhas, Brix, Christopher, Kolter, Zico, Dvijotham, Krishnamurthy, Zhang, Huan

Most work on the formal verification of neural networks has focused on bounding the set of outputs that correspond to a given set of inputs (for example, bounded perturbations of a nominal input). However, many use cases of neural network verificatio

Externí odkaz: http://arxiv.org/abs/2302.01404

Zobrazit plný text záznamu

Akademický článek

Major Perioperative Cardiac Risk Assessment: A Review for Cardio-Oncologists and Perioperative Physicians

Autor: Emily P. Johnson, Robert Monsour, Osama Hafez, Rohini Kotha, Robert S. Ackerman

Publikováno v: Clinics and Practice, Vol 14, Iss 3, Pp 906-914 (2024)

The Revised Cardiac Risk Index (RCRI) and the American College of Surgeons (ACS) National Surgical Quality Improvement Program (NSQIP) preoperative risk assessment tools are the most widely used methods for quantifying the risk of major negative peri

Externí odkaz: https://doaj.org/article/217c4c59b6f4420285ad7e8b2c173788

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Modelling seismic ground motion and its uncertainty in different tectonic contexts: challenges and application to the 2020 European Seismic Hazard Model (ESHM20)

Autor: G. Weatherill, S. R. Kotha, L. Danciu, S. Vilanova, F. Cotton

Publikováno v: Natural Hazards and Earth System Sciences, Vol 24, Pp 1795-1834 (2024)

Current practice in strong ground motion modelling for probabilistic seismic hazard analysis (PSHA) requires the identification and calibration of empirical models appropriate to the tectonic regimes within the region of application, along with quant

Externí odkaz: https://doaj.org/article/0a3db59c2e0643b98fe1a584db48f1d0

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání