Výsledky vyhledávání - "Zhao, Chenyang"

Report

Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative Intelligence

Autor: Chen, Weize, You, Ziming, Li, Ran, Guan, Yitong, Qian, Chen, Zhao, Chenyang, Yang, Cheng, Xie, Ruobing, Liu, Zhiyuan, Sun, Maosong

The rapid advancement of large language models (LLMs) has paved the way for the development of highly capable autonomous agents. However, existing multi-agent frameworks often struggle with integrating diverse capable third-party agents due to relian

Externí odkaz: http://arxiv.org/abs/2407.07061

Zobrazit plný text záznamu

Report

MiniCPM: Unveiling the Potential of Small Language Models with Scalable Training Strategies

The burgeoning interest in developing Large Language Models (LLMs) with up to trillion parameters has been met with concerns regarding resource efficiency and practical expense, particularly given the immense cost of experimentation. This scenario un

Externí odkaz: http://arxiv.org/abs/2404.06395

Zobrazit plný text záznamu

Report

Microcavity induced by few-layer GaSe crystal on silicon photonic crystal waveguide for efficient optical frequency conversion

Autor: Chen, Xiaoqing, Zhang, Yanyan, Ji, Yingke, Zhang, Yu, Wang, Jianguo, Wu, Xianghu, Zhao, Chenyang, Fang, Liang, Jiang, Biqiang, Zhao, Jianlin, Gan, Xuetao

We demonstrate the post-induction of high-quality microcavity on silicon photonic crystal (PC) waveguide by integrating few-layer GaSe crystal, which promises highly efficient on-chip optical frequency conversions. The integration of GaSe shifts the

Externí odkaz: http://arxiv.org/abs/2403.01434

Zobrazit plný text záznamu

Report

Emergence of cooperation under punishment: A reinforcement learning perspective

Autor: Zhao, Chenyang, Zheng, Guozhong, Zhang, Chun, Zhang, Jiqiang, Chen, Li

Punishment is a common tactic to sustain cooperation and has been extensively studied for a long time. While most of previous game-theoretic work adopt the imitation learning where players imitate the strategies who are better off, the learning logic

Externí odkaz: http://arxiv.org/abs/2401.16073

Zobrazit plný text záznamu

Report

YAYI 2: Multilingual Open-Source Large Language Models

As the latest advancements in natural language processing, large language models (LLMs) have achieved human-level language understanding and generation abilities in many real-world tasks, and even have been regarded as a potential path to the artific

Externí odkaz: http://arxiv.org/abs/2312.14862

Zobrazit plný text záznamu

Report

Large Language Model as a Policy Teacher for Training Reinforcement Learning Agents

Autor: Zhou, Zihao, Hu, Bin, Zhao, Chenyang, Zhang, Pu, Liu, Bin

Recent studies have uncovered the potential of Large Language Models (LLMs) in addressing complex sequential decision-making tasks through the provision of high-level instructions. However, LLM-based agents lack specialization in tackling specific ta

Externí odkaz: http://arxiv.org/abs/2311.13373

Zobrazit plný text záznamu

Report

TeacherLM: Teaching to Fish Rather Than Giving the Fish, Language Modeling Likewise

Autor: He, Nan, Lai, Hanyu, Zhao, Chenyang, Cheng, Zirui, Pan, Junting, Qin, Ruoyu, Lu, Ruofan, Lu, Rui, Zhang, Yunchen, Zhao, Gangming, Hou, Zhaohui, Huang, Zhiyuan, Lu, Shaoqing, Liang, Ding, Zhan, Mingjie

Large Language Models (LLMs) exhibit impressive reasoning and data augmentation capabilities in various NLP tasks. However, what about small models? In this work, we propose TeacherLM-7.1B, capable of annotating relevant fundamentals, chain of though

Externí odkaz: http://arxiv.org/abs/2310.19019

Zobrazit plný text záznamu

Report

Prompt2Model: Generating Deployable Models from Natural Language Instructions

Autor: Viswanathan, Vijay, Zhao, Chenyang, Bertsch, Amanda, Wu, Tongshuang, Neubig, Graham

Large language models (LLMs) enable system builders today to create competent NLP systems through prompting, where they only need to describe the task in natural language and provide a few examples. However, in other ways, LLMs are a step backward fr

Externí odkaz: http://arxiv.org/abs/2308.12261

Zobrazit plný text záznamu

Report

Enabling Intelligent Interactions between an Agent and an LLM: A Reinforcement Learning Approach

Autor: Hu, Bin, Zhao, Chenyang, Zhang, Pu, Zhou, Zihao, Yang, Yuanhang, Xu, Zenglin, Liu, Bin

Large language models (LLMs) encode a vast amount of world knowledge acquired from massive text datasets. Recent studies have demonstrated that LLMs can assist an embodied agent in solving complex sequential decision making tasks by providing high-le

Externí odkaz: http://arxiv.org/abs/2306.03604

Zobrazit plný text záznamu

Report

Replicating Complex Dialogue Policy of Humans via Offline Imitation Learning with Supervised Regularization

Autor: Sun, Zhoujian, Zhao, Chenyang, Huang, Zhengxing, Ding, Nai

Policy learning (PL) is a module of a task-oriented dialogue system that trains an agent to make actions in each dialogue turn. Imitating human action is a fundamental problem of PL. However, both supervised learning (SL) and reinforcement learning (

Externí odkaz: http://arxiv.org/abs/2305.03987

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání