Výsledky vyhledávání

Report

Anytime Acceleration of Gradient Descent

Autor: Zhang, Zihan, Lee, Jason D., Du, Simon S., Chen, Yuxin

This work investigates stepsize-based acceleration of gradient descent with {\em anytime} convergence guarantees. For smooth (non-strongly) convex optimization, we propose a stepsize schedule that allows gradient descent to achieve convergence guaran

Externí odkaz: http://arxiv.org/abs/2411.17668

Zobrazit plný text záznamu

Report

Learning to Cooperate with Humans using Generative Agents

Autor: Liang, Yancheng, Chen, Daphne, Gupta, Abhishek, Du, Simon S., Jaques, Natasha

Training agents that can coordinate zero-shot with humans is a key mission in multi-agent reinforcement learning (MARL). Current algorithms focus on training simulated human partner policies which are then used to train a Cooperator agent. The simula

Externí odkaz: http://arxiv.org/abs/2411.13934

Zobrazit plný text záznamu

Report

MIXAD: Memory-Induced Explainable Time Series Anomaly Detection

Autor: Kim, Minha, Bhaumik, Kishor Kumar, Ali, Amin Ahsan, Woo, Simon S.

For modern industrial applications, accurately detecting and diagnosing anomalies in multivariate time series data is essential. Despite such need, most state-of-the-art methods often prioritize detection performance over model interpretability. Addr

Externí odkaz: http://arxiv.org/abs/2410.22735

Zobrazit plný text záznamu

Report

SSMT: Few-Shot Traffic Forecasting with Single Source Meta-Transfer

Autor: Bhaumik, Kishor Kumar, Kim, Minha, Niloy, Fahim Faisal, Ali, Amin Ahsan, Woo, Simon S.

Traffic forecasting in Intelligent Transportation Systems (ITS) is vital for intelligent traffic prediction. Yet, ITS often relies on data from traffic sensors or vehicle devices, where certain cities might not have all those smart devices or enablin

Externí odkaz: http://arxiv.org/abs/2410.15589

Zobrazit plný text záznamu

Report

LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond

Autor: Islam, Md Tanvir, Alam, Inzamamul, Woo, Simon S., Anwar, Saeed, Lee, IK Hyun, Muhammad, Khan

Low-light image enhancement (LLIE) is essential for numerous computer vision tasks, including object detection, tracking, segmentation, and scene understanding. Despite substantial research on improving low-quality images captured in underexposed con

Externí odkaz: http://arxiv.org/abs/2410.09831

Zobrazit plný text záznamu

Report

Preserving Old Memories in Vivid Detail: Human-Interactive Photo Restoration Framework

Autor: Back, Seung-Yeon, Son, Geonho, Jeong, Dahye, Park, Eunil, Woo, Simon S.

Photo restoration technology enables preserving visual memories in photographs. However, physical prints are vulnerable to various forms of deterioration, ranging from physical damage to loss of image quality, etc. While restoration by human experts

Externí odkaz: http://arxiv.org/abs/2410.09529

Zobrazit plný text záznamu

Report

The Crucial Role of Samplers in Online Direct Preference Optimization

Autor: Shi, Ruizhe, Zhou, Runlong, Du, Simon S.

Direct Preference Optimization (DPO) has emerged as a stable, scalable, and efficient solution for language model alignment. Despite its empirical success, the $\textit{optimization}$ properties, particularly the impact of samplers on its convergence

Externí odkaz: http://arxiv.org/abs/2409.19605

Zobrazit plný text záznamu

Report

UGAD: Universal Generative AI Detector utilizing Frequency Fingerprints

Autor: Alam, Inzamamul, Muneer, Muhammad Shahid, Woo, Simon S.

In the wake of a fabricated explosion image at the Pentagon, an ability to discern real images from fake counterparts has never been more critical. Our study introduces a novel multi-modal approach to detect AI-generated images amidst the proliferati

Externí odkaz: http://arxiv.org/abs/2409.07913

Zobrazit plný text záznamu

Report

GDFlow: Anomaly Detection with NCDE-based Normalizing Flow for Advanced Driver Assistance System

Autor: Lee, Kangjun, Kim, Minha, Jun, Youngho, Woo, Simon S.

For electric vehicles, the Adaptive Cruise Control (ACC) in Advanced Driver Assistance Systems (ADAS) is designed to assist braking based on driving conditions, road inclines, predefined deceleration strengths, and user braking patterns. However, the

Externí odkaz: http://arxiv.org/abs/2409.05346

Zobrazit plný text záznamu

Report

Multi-Agent Reinforcement Learning from Human Feedback: Data Coverage and Algorithmic Techniques

Autor: Zhang, Natalia, Wang, Xinqi, Cui, Qiwen, Zhou, Runlong, Kakade, Sham M., Du, Simon S.

We initiate the study of Multi-Agent Reinforcement Learning from Human Feedback (MARLHF), exploring both theoretical foundations and empirical validations. We define the task as identifying Nash equilibrium from a preference-only offline dataset in g

Externí odkaz: http://arxiv.org/abs/2409.00717

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání