Výsledky vyhledávání - "Yang, Wenhao"

Report

Robust Channel Learning for Large-Scale Radio Speaker Verification

Autor: Yang, Wenhao, Wei, Jianguo, Lu, Wenhuan, Li, Lei, Lu, Xugang

Recent research in speaker verification has increasingly focused on achieving robust and reliable recognition under challenging channel conditions and noisy environments. Identifying speakers in radio communications is particularly difficult due to i

Externí odkaz: http://arxiv.org/abs/2406.10956

Zobrazit plný text záznamu

Report

Projection-Free Variance Reduction Methods for Stochastic Constrained Multi-Level Compositional Optimization

Autor: Jiang, Wei, Yang, Sifan, Yang, Wenhao, Wang, Yibo, Wan, Yuanyu, Zhang, Lijun

This paper investigates projection-free algorithms for stochastic constrained multi-level optimization. In this context, the objective function is a nested composition of several smooth functions, and the decision set is closed and convex. Existing p

Externí odkaz: http://arxiv.org/abs/2406.03787

Zobrazit plný text záznamu

Report

Efficient Sign-Based Optimization: Accelerating Convergence via Variance Reduction

Autor: Jiang, Wei, Yang, Sifan, Yang, Wenhao, Zhang, Lijun

Sign stochastic gradient descent (signSGD) is a communication-efficient method that transmits only the sign of stochastic gradients for parameter updating. Existing literature has demonstrated that signSGD can achieve a convergence rate of $\mathcal{

Externí odkaz: http://arxiv.org/abs/2406.00489

Zobrazit plný text záznamu

Report

Universal Online Convex Optimization with $1$ Projection per Round

Autor: Yang, Wenhao, Wang, Yibo, Zhao, Peng, Zhang, Lijun

To address the uncertainty in function types, recent progress in online convex optimization (OCO) has spurred the development of universal algorithms that simultaneously attain minimax rates for multiple types of convex functions. However, for a $T$-

Externí odkaz: http://arxiv.org/abs/2405.19705

Zobrazit plný text záznamu

Report

PyRadar: Towards Automatically Retrieving and Validating Source Code Repository Information for PyPI Packages

Autor: Gao, Kai, Xu, Weiwei, Yang, Wenhao, Zhou, Minghui

A package's source code repository records the development history of the package, providing indispensable information for the use and risk monitoring of the package. However, a package release often misses its source code repository due to the separ

Externí odkaz: http://arxiv.org/abs/2404.16565

Zobrazit plný text záznamu

Report

Estimation and Inference in Distributional Reinforcement Learning

Autor: Zhang, Liangyu, Peng, Yang, Liang, Jiadong, Yang, Wenhao, Zhang, Zhihua

In this paper, we study distributional reinforcement learning from the perspective of statistical efficiency. We investigate distributional policy evaluation, aiming to estimate the complete distribution of the random return (denoted $\eta^\pi$) atta

Externí odkaz: http://arxiv.org/abs/2309.17262

Zobrazit plný text záznamu

Report

Liouville theorems for ancient solutions of subexponential growth to the heat equation on graphs

Autor: Hua, Bobo, Yang, Wenhao

Mosconi proved Liouville theorems for ancient solutions of subexponential growth to the heat equation on a manifold with Ricci curvature bounded below. We extend these results to graphs with bounded geometry: for a graph with bounded geometry, any no

Externí odkaz: http://arxiv.org/abs/2309.17250

Zobrazit plný text záznamu

Report

Regularization and Variance-Weighted Regression Achieves Minimax Optimality in Linear MDPs: Theory and Practice

Autor: Kitamura, Toshinori, Kozuno, Tadashi, Tang, Yunhao, Vieillard, Nino, Valko, Michal, Yang, Wenhao, Mei, Jincheng, Ménard, Pierre, Azar, Mohammad Gheshlaghi, Munos, Rémi, Pietquin, Olivier, Geist, Matthieu, Szepesvári, Csaba, Kumagai, Wataru, Matsuo, Yutaka

Mirror descent value iteration (MDVI), an abstraction of Kullback-Leibler (KL) and entropy-regularized reinforcement learning (RL), has served as the basis for recent high-performing practical RL algorithms. However, despite the use of function appro

Externí odkaz: http://arxiv.org/abs/2305.13185

Zobrazit plný text záznamu

Report

Non-stationary Projection-free Online Learning with Dynamic and Adaptive Regret Guarantees

Autor: Wang, Yibo, Yang, Wenhao, Jiang, Wei, Lu, Shiyin, Wang, Bing, Tang, Haihong, Wan, Yuanyu, Zhang, Lijun

Projection-free online learning has drawn increasing interest due to its efficiency in solving high-dimensional problems with complicated constraints. However, most existing projection-free online methods focus on minimizing the static regret, which

Externí odkaz: http://arxiv.org/abs/2305.11726

Zobrazit plný text záznamu

Report

Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement Learning

Autor: Zhang, Liangyu, Peng, Yang, Yang, Wenhao, Zhang, Zhihua

We propose a novel generalization of constrained Markov decision processes (CMDPs) that we call the \emph{semi-infinitely constrained Markov decision process} (SICMDP). Particularly, we consider a continuum of constraints instead of a finite number o

Externí odkaz: http://arxiv.org/abs/2305.00254

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání