Výsledky vyhledávání - "Li, Yingzhen"

Report

Mutual Information Multinomial Estimation

Autor: Chen, Yanzhi, Ou, Zijing, Weller, Adrian, Li, Yingzhen

Estimating mutual information (MI) is a fundamental yet challenging task in data science and machine learning. This work proposes a new estimator for mutual information. Our main discovery is that a preliminary estimate of the data distribution can d

Externí odkaz: http://arxiv.org/abs/2408.09377

Zobrazit plný text záznamu

Report

Identifying Nonstationary Causal Structures with High-Order Markov Switching Models

Autor: Balsells-Rodas, Carles, Wang, Yixin, Mediano, Pedro A. M., Li, Yingzhen

Causal discovery in time series is a rapidly evolving field with a wide variety of applications in other areas such as climate science and neuroscience. Traditional approaches assume a stationary causal graph, which can be adapted to nonstationary ti

Externí odkaz: http://arxiv.org/abs/2406.17698

Zobrazit plný text záznamu

Report

Diffusion Model With Optimal Covariance Matching

Autor: Ou, Zijing, Zhang, Mingtian, Zhang, Andi, Xiao, Tim Z., Li, Yingzhen, Barber, David

The probabilistic diffusion model has become highly effective across various domains. Typically, sampling from a diffusion model involves using a denoising distribution characterized by a Gaussian with a learned mean and either fixed or learned covar

Externí odkaz: http://arxiv.org/abs/2406.10808

Zobrazit plný text záznamu

Report

C-TPT: Calibrated Test-Time Prompt Tuning for Vision-Language Models via Text Feature Dispersion

Autor: Yoon, Hee Suk, Yoon, Eunseop, Tee, Joshua Tian Jin, Hasegawa-Johnson, Mark, Li, Yingzhen, Yoo, Chang D.

In deep learning, test-time adaptation has gained attention as a method for model fine-tuning without the need for labeled data. A prime exemplification is the recently proposed test-time prompt tuning for large-scale vision-language models such as C

Externí odkaz: http://arxiv.org/abs/2403.14119

Zobrazit plný text záznamu

Report

On the Challenges and Opportunities in Generative AI

The field of deep generative modeling has grown rapidly and consistently over the years. With the availability of massive amounts of training data coupled with advances in scalable unsupervised learning paradigms, recent large-scale generative models

Externí odkaz: http://arxiv.org/abs/2403.00025

Zobrazit plný text záznamu

Report

Position: Bayesian Deep Learning is Needed in the Age of Large-Scale AI

In the current landscape of deep learning research, there is a predominant emphasis on achieving high predictive accuracy in supervised tasks involving large image and language datasets. However, a broader perspective reveals a multitude of overlooke

Externí odkaz: http://arxiv.org/abs/2402.00809

Zobrazit plný text záznamu

Report

Training Discrete Energy-Based Models with Energy Discrepancy

Autor: Schröder, Tobias, Ou, Zijing, Li, Yingzhen, Duncan, Andrew B.

Training energy-based models (EBMs) on discrete spaces is challenging because sampling over such spaces can be difficult. We propose to train discrete EBMs with energy discrepancy (ED), a novel type of contrastive loss functional which only requires

Externí odkaz: http://arxiv.org/abs/2307.07595

Zobrazit plný text záznamu

Report

Energy Discrepancies: A Score-Independent Loss for Energy-Based Models

Autor: Schröder, Tobias, Ou, Zijing, Lim, Jen Ning, Li, Yingzhen, Vollmer, Sebastian J., Duncan, Andrew B.

Energy-based models are a simple yet powerful class of probabilistic models, but their widespread adoption has been limited by the computational burden of training them. We propose a novel loss function called Energy Discrepancy (ED) which does not r

Externí odkaz: http://arxiv.org/abs/2307.06431

Zobrazit plný text záznamu

Report

On the Identifiability of Switching Dynamical Systems

Autor: Balsells-Rodas, Carles, Wang, Yixin, Li, Yingzhen

The identifiability of latent variable models has received increasing attention due to its relevance in interpretability and out-of-distribution generalisation. In this work, we study the identifiability of Switching Dynamical Systems, taking an init

Externí odkaz: http://arxiv.org/abs/2305.15925

Zobrazit plný text záznamu

Report

ESD: Expected Squared Difference as a Tuning-Free Trainable Calibration Measure

Autor: Yoon, Hee Suk, Tee, Joshua Tian Jin, Yoon, Eunseop, Yoon, Sunjae, Kim, Gwangsu, Li, Yingzhen, Yoo, Chang D.

Studies have shown that modern neural networks tend to be poorly calibrated due to over-confident predictions. Traditionally, post-processing methods have been used to calibrate the model after training. In recent years, various trainable calibration

Externí odkaz: http://arxiv.org/abs/2303.02472

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání