Výsledky vyhledávání

Report

Enhancing Diversity in Bayesian Deep Learning via Hyperspherical Energy Minimization of CKA

Autor: Smerkous, David, Bai, Qinxun, Li, Fuxin

Particle-based Bayesian deep learning often requires a similarity metric to compare two networks. However, naive similarity metrics lack permutation invariance and are inappropriate for comparing networks. Centered Kernel Alignment (CKA) on feature k

Externí odkaz: http://arxiv.org/abs/2411.00259

Zobrazit plný text záznamu

Report

Large Legislative Models: Towards Efficient AI Policymaking in Economic Simulations

Autor: Gasztowtt, Henry, Smith, Benjamin, Zhu, Vincent, Bai, Qinxun, Zhang, Edwin

The improvement of economic policymaking presents an opportunity for broad societal benefit, a notion that has inspired research towards AI-driven policymaking tools. AI policymaking holds the potential to surpass human performance through the abilit

Externí odkaz: http://arxiv.org/abs/2410.08345

Zobrazit plný text záznamu

Report

Offline Reinforcement Learning with Closed-Form Policy Improvement Operators

Autor: Li, Jiachen, Zhang, Edwin, Yin, Ming, Bai, Qinxun, Wang, Yu-Xiang, Wang, William Yang

Behavior constrained policy optimization has been demonstrated to be a successful paradigm for tackling Offline Reinforcement Learning. By exploiting historical transitions, a policy is trained to maximize a learned value function while constrained b

Externí odkaz: http://arxiv.org/abs/2211.15956

Zobrazit plný text záznamu

Report

A Geometric Understanding of Natural Gradient

Autor: Bai, Qinxun, Rosenberg, Steven, Xu, Wei

While natural gradients have been widely studied from both theoretical and empirical perspectives, we argue that some fundamental theoretical issues regarding the existence of gradients in infinite dimensional function spaces remain underexplored. We

Externí odkaz: http://arxiv.org/abs/2202.06232

Zobrazit plný text záznamu

Report

Off-policy Reinforcement Learning with Optimistic Exploration and Distribution Correction

Autor: Li, Jiachen, Cheng, Shuo, Liao, Zhenyu, Wang, Huayan, Wang, William Yang, Bai, Qinxun

Improving the sample efficiency of reinforcement learning algorithms requires effective exploration. Following the principle of $\textit{optimism in the face of uncertainty}$ (OFU), we train a separate exploration policy to maximize the approximate u

Externí odkaz: http://arxiv.org/abs/2110.12081

Zobrazit plný text záznamu

Report

Generative Particle Variational Inference via Estimation of Functional Gradients

Autor: Ratzlaff, Neale, Bai, Qinxun, Fuxin, Li, Xu, Wei

Recently, particle-based variational inference (ParVI) methods have gained interest because they can avoid arbitrary parametric assumptions that are common in variational inference. However, many ParVI approaches do not allow arbitrary sampling from

Externí odkaz: http://arxiv.org/abs/2103.01291

Zobrazit plný text záznamu

Report

Siamese Natural Language Tracker: Tracking by Natural Language Descriptions with Siamese Trackers

Autor: Feng, Qi, Ablavsky, Vitaly, Bai, Qinxun, Sclaroff, Stan

We propose a novel Siamese Natural Language Tracker (SNLT), which brings the advancements in visual tracking to the tracking by natural language (NL) descriptions task. The proposed SNLT is applicable to a wide range of Siamese trackers, providing a

Externí odkaz: http://arxiv.org/abs/1912.02048

Zobrazit plný text záznamu

Report

Implicit Generative Modeling for Efficient Exploration

Autor: Ratzlaff, Neale, Bai, Qinxun, Fuxin, Li, Xu, Wei

Efficient exploration remains a challenging problem in reinforcement learning, especially for those tasks where rewards from environments are sparse. A commonly used approach for exploring such environments is to introduce some "intrinsic" reward. In

Externí odkaz: http://arxiv.org/abs/1911.08017

Zobrazit plný text záznamu

Report

Real-time Visual Object Tracking with Natural Language Description

Autor: Feng, Qi, Ablavsky, Vitaly, Bai, Qinxun, Li, Guorong, Sclaroff, Stan

In recent years, deep-learning-based visual object trackers have been studied thoroughly, but handling occlusions and/or rapid motion of the target remains challenging. In this work, we argue that conditioning on the natural language (NL) description

Externí odkaz: http://arxiv.org/abs/1907.11751

Zobrazit plný text záznamu

Dissertation/ Thesis

The differential geometric structure in supervised learning of classifiers

Autor: Bai, Qinxun

In this thesis, we study the overfitting problem in supervised learning of classifiers from a geometric perspective. As with many inverse problems, learning a classification function from a given set of example-label pairs is an ill-posed problem, i.

Externí odkaz: https://hdl.handle.net/2144/22449

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání