Výsledky vyhledávání

Report

A Peaceman-Rachford Splitting Approach with Deep Equilibrium Network for Channel Estimation

Autor: Yuan, Dingli, Wu, Shitong, Tang, Haoran, Yang, Lu, Peng, Chenghui

Multiple-input multiple-output (MIMO) is pivotal for wireless systems, yet its high-dimensional, stochastic channel poses significant challenges for accurate estimation, highlighting the critical need for robust estimation techniques. In this paper,

Externí odkaz: http://arxiv.org/abs/2410.23752

Zobrazit plný text záznamu

Report

Can Models Learn Skill Composition from Examples?

Autor: Zhao, Haoyu, Kaur, Simran, Yu, Dingli, Goyal, Anirudh, Arora, Sanjeev

As large language models (LLMs) become increasingly advanced, their ability to exhibit compositional generalization -- the capacity to combine learned skills in novel ways not encountered during training -- has garnered significant attention. This ty

Externí odkaz: http://arxiv.org/abs/2409.19808

Zobrazit plný text záznamu

Report

ConceptMix: A Compositional Image Generation Benchmark with Controllable Difficulty

Autor: Wu, Xindi, Yu, Dingli, Huang, Yangsibo, Russakovsky, Olga, Arora, Sanjeev

Compositionality is a critical capability in Text-to-Image (T2I) models, as it reflects their ability to understand and combine multiple concepts from text descriptions. Existing evaluations of compositional capability rely heavily on human-designed

Externí odkaz: http://arxiv.org/abs/2408.14339

Zobrazit plný text záznamu

Report

AI-Assisted Generation of Difficult Math Questions

Autor: Shah, Vedant, Yu, Dingli, Lyu, Kaifeng, Park, Simon, Yu, Jiatong, He, Yinghui, Ke, Nan Rosemary, Mozer, Michael, Bengio, Yoshua, Arora, Sanjeev, Goyal, Anirudh

Current LLM training positions mathematical reasoning as a core capability. With publicly available sources fully tapped, there is unmet demand for diverse and challenging math questions. Relying solely on human experts is both time-consuming and cos

Externí odkaz: http://arxiv.org/abs/2407.21009

Zobrazit plný text záznamu

Report

Effective Reinforcement Learning Based on Structural Information Principles

Autor: Zeng, Xianghua, Peng, Hao, Su, Dingli, Li, Angsheng

Although Reinforcement Learning (RL) algorithms acquire sequential behavioral patterns through interactions with the environment, their effectiveness in noisy and high-dimensional scenarios typically relies on specific structural priors. In this pape

Externí odkaz: http://arxiv.org/abs/2404.09760

Zobrazit plný text záznamu

Report

Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates

Autor: Lyu, Kaifeng, Zhao, Haoyu, Gu, Xinran, Yu, Dingli, Goyal, Anirudh, Arora, Sanjeev

Public LLMs such as the Llama 2-Chat have driven huge activity in LLM research. These models underwent alignment training and were considered safe. Recently Qi et al. (2023) reported that even benign fine-tuning (e.g., on seemingly safe datasets) can

Externí odkaz: http://arxiv.org/abs/2402.18540

Zobrazit plný text záznamu

Report

On the Coherency of Completed Group Algebra

Autor: Burns, David, Kuang, Yu, Liang, Dingli

We investigate coherency properties of certain completed integral group rings, precisely for compact $p$-adic Lie groups.
Comment: 16 pages. Submitted

Externí odkaz: http://arxiv.org/abs/2401.05506

Zobrazit plný text záznamu

Report

On Non-Noetherian Iwasawa Theory

Autor: Burns, David, Daoud, Alexandre, Liang, Dingli

We prove a general structure theorem for finitely presented torsion modules over a class of commutative rings that need not be Noetherian. As a first application, we then use this result to study the Weil- \'etale cohomology groups of $\mathbb{G}_m$

Externí odkaz: http://arxiv.org/abs/2401.02946

Zobrazit plný text záznamu

Report

Gradient estimate for Fisher-KPP equation on Finsler metric measure spaces

Autor: Shen, Bin, Xia, Dingli

In this manuscript, we study the positive solutions of the Finslerian Fisher-KPP equation $$ u_t=\Delta^{\nabla u} u+cu(1-u). $$ The Fisher-KPP equation is widely applied and connected to many mathematical branches. We establish the global gradient e

Externí odkaz: http://arxiv.org/abs/2403.00002

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání