Výsledky vyhledávání

Report

C-MORL: Multi-Objective Reinforcement Learning through Efficient Discovery of Pareto Front

Autor: Liu, Ruohong, Pan, Yuxin, Xu, Linjie, Song, Lei, You, Pengcheng, Chen, Yize, Bian, Jiang

Multi-objective reinforcement learning (MORL) excels at handling rapidly changing preferences in tasks that involve multiple criteria, even for unseen preferences. However, previous dominating MORL methods typically generate a fixed policy set or pre

Externí odkaz: http://arxiv.org/abs/2410.02236

Zobrazit plný text záznamu

Report

Enhancing Cross-domain Pre-Trained Decision Transformers with Adaptive Attention

Autor: Zhao, Wenhao, Xu, Qiushui, Xu, Linjie, Song, Lei, Wang, Jinyu, Zhou, Chunlai, Bian, Jiang

Recently, the pre-training of decision transformers (DT) using a different domain, such as natural language text, has generated significant attention in offline reinforcement learning (Offline RL). Although this cross-domain pre-training approach ach

Externí odkaz: http://arxiv.org/abs/2409.06985

Zobrazit plný text záznamu

Report

Generalization Enhancement Strategies to Enable Cross-year Cropland Mapping with Convolutional Neural Networks Trained Using Historical Samples

Autor: Khallaghi, Sam, Abedi, Rahebe, Ali, Hanan Abou, Alemohammad, Hamed, Asipunu, Mary Dziedzorm, Alatise, Ismail, Ha, Nguyen, Luo, Boka, Mai, Cat, Song, Lei, Wussah, Amos, Xiong, Sitian, Yao, Yao-Ting, Zhang, Qi, Estes, Lyndon D.

The accuracy of mapping agricultural fields across large areas is steadily improving with high-resolution satellite imagery and deep learning (DL) models, even in regions where fields are small and geometrically irregular. However, developing effecti

Externí odkaz: http://arxiv.org/abs/2408.06467

Zobrazit plný text záznamu

Report

Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL

Autor: Choi, Yunseon, Bae, Sangmin, Ban, Seonghyun, Jeong, Minchan, Zhang, Chuheng, Song, Lei, Zhao, Li, Bian, Jiang, Kim, Kee-Eung

With the advent of foundation models, prompt tuning has positioned itself as an important technique for directing model behaviors and eliciting desired responses. Prompt tuning regards selecting appropriate keywords included into the input, thereby a

Externí odkaz: http://arxiv.org/abs/2407.14733

Zobrazit plný text záznamu

Report

Graph Neural Network Enhanced Retrieval for Question Answering of LLMs

Autor: Li, Zijian, Guo, Qingyan, Shao, Jiawei, Song, Lei, Bian, Jiang, Zhang, Jun, Wang, Rui

Retrieval augmented generation has revolutionized large language model (LLM) outputs by providing factual supports. Nevertheless, it struggles to capture all the necessary knowledge for complex reasoning questions. Existing retrieval methods typicall

Externí odkaz: http://arxiv.org/abs/2406.06572

Zobrazit plný text záznamu

Report

Position: Rethinking Post-Hoc Search-Based Neural Approaches for Solving Large-Scale Traveling Salesman Problems

Autor: Xia, Yifan, Yang, Xianliang, Liu, Zichuan, Liu, Zhihao, Song, Lei, Bian, Jiang

Recent advancements in solving large-scale traveling salesman problems (TSP) utilize the heatmap-guided Monte Carlo tree search (MCTS) paradigm, where machine learning (ML) models generate heatmaps, indicating the probability distribution of each edg

Externí odkaz: http://arxiv.org/abs/2406.03503

Zobrazit plný text záznamu

Report

Knowing What Not to Do: Leverage Language Model Insights for Action Space Pruning in Multi-agent Reinforcement Learning

Autor: Liu, Zhihao, Yang, Xianliang, Liu, Zichuan, Xia, Yifan, Jiang, Wei, Zhang, Yuanyu, Li, Lijuan, Fan, Guoliang, Song, Lei, Jiang, Bian

Multi-agent reinforcement learning (MARL) is employed to develop autonomous agents that can learn to adopt cooperative or competitive strategies within complex environments. However, the linear increase in the number of agents leads to a combinatoria

Externí odkaz: http://arxiv.org/abs/2405.16854

Zobrazit plný text záznamu

Report

TimeX++: Learning Time-Series Explanations with Information Bottleneck

Autor: Liu, Zichuan, Wang, Tianchun, Shi, Jimeng, Zheng, Xu, Chen, Zhuomin, Song, Lei, Dong, Wenqian, Obeysekera, Jayantha, Shirani, Farhad, Luo, Dongsheng

Explaining deep learning models operating on time series data is crucial in various applications of interest which require interpretable and transparent insights from time series signals. In this work, we investigate this problem from an information

Externí odkaz: http://arxiv.org/abs/2405.09308

Zobrazit plný text záznamu

Report

Protecting Your LLMs with Information Bottleneck

Autor: Liu, Zichuan, Wang, Zefan, Xu, Linjie, Wang, Jinyu, Song, Lei, Wang, Tianchun, Chen, Chunlin, Cheng, Wei, Bian, Jiang

The advent of large language models (LLMs) has revolutionized the field of natural language processing, yet they might be attacked to produce harmful content. Despite efforts to ethically align LLMs, these are often fragile and can be circumvented by

Externí odkaz: http://arxiv.org/abs/2404.13968

Zobrazit plný text záznamu

Report

Higher Replay Ratio Empowers Sample-Efficient Multi-Agent Reinforcement Learning

Autor: Xu, Linjie, Liu, Zichuan, Dockhorn, Alexander, Perez-Liebana, Diego, Wang, Jinyu, Song, Lei, Bian, Jiang

One of the notorious issues for Reinforcement Learning (RL) is poor sample efficiency. Compared to single agent RL, the sample efficiency for Multi-Agent Reinforcement Learning (MARL) is more challenging because of its inherent partial observability,

Externí odkaz: http://arxiv.org/abs/2404.09715

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání