Výsledky vyhledávání - "ZHANG Youzhi"

Akademický článek

Distributed optical fiber measurement of floor heave evolution in mining roadway

Autor: CHAI Jing, HAN Zhicheng, LEI Wulin, ZHANG Dingding, MA Chenyang, SUN Kai, WENG Mingyue, ZHANG Youzhi, DING Guoli, ZHENG Zhongyou, ZHANG Yin, HAN Gang

Publikováno v: Meitan kexue jishu, Vol 51, Iss 1, Pp 146-156 (2023)

With the deepening of coal mining and the upsizing of mining equipment, the floor heave of the mining roadway has become an important problem that restricts the efficient and safe mining of the working face. It is of great significance to reveal the

Externí odkaz: https://doaj.org/article/6536aa1dda6b455b87bfe101808fb466

Zobrazit plný text záznamu

Report

Computing Ex Ante Equilibrium in Heterogeneous Zero-Sum Team Games

Autor: Liu, Naming, Wang, Mingzhi, Wang, Xihuai, Zhang, Weinan, Yang, Yaodong, Zhang, Youzhi, An, Bo, Wen, Ying

The ex ante equilibrium for two-team zero-sum games, where agents within each team collaborate to compete against the opposing team, is known to be the best a team can do for coordination. Many existing works on ex ante equilibrium solutions are aimi

Externí odkaz: http://arxiv.org/abs/2410.01575

Zobrazit plný text záznamu

Report

Tailed Low-Rank Matrix Factorization for Similarity Matrix Completion

Autor: Ma, Changyi, Yu, Runsheng, Chen, Xiao, Zhang, Youzhi

Similarity matrix serves as a fundamental tool at the core of numerous downstream machine-learning tasks. However, missing data is inevitable and often results in an inaccurate similarity matrix. To address this issue, Similarity Matrix Completion (S

Externí odkaz: http://arxiv.org/abs/2409.19550

Zobrazit plný text záznamu

Report

In-Context Exploiter for Extensive-Form Games

Autor: Li, Shuxin, Yang, Chang, Zhang, Youzhi, Li, Pengdeng, Wang, Xinrun, Huang, Xiao, Chan, Hau, An, Bo

Nash equilibrium (NE) is a widely adopted solution concept in game theory due to its stability property. However, we observe that the NE strategy might not always yield the best results, especially against opponents who do not adhere to NE strategies

Externí odkaz: http://arxiv.org/abs/2408.05575

Zobrazit plný text záznamu

Akademický článek

An Experimental Study on Strength Characteristics and Hydration Mechanism of Cemented Ultra-Fine Tailings Backfill

Autor: Gan Deqing, Li Hongbao, Chen Chao, Lu Hongjian, Zhang Youzhi

Publikováno v: Frontiers in Materials, Vol 8 (2021)

In order to study the strength characteristics and hydration mechanism of the cemented ultra-fine tailings backfill (CUTB), the uniaxial compressive strength (UCS) tests of CUTB and cemented classified tailings backfill (CCTB) with cement-tailing rat

Externí odkaz: https://doaj.org/article/b598b4700aa84fc99bfe51b7a1d43d78

Zobrazit plný text záznamu

Report

Direct Alignment of Language Models via Quality-Aware Self-Refinement

Autor: Yu, Runsheng, Wang, Yong, Jiao, Xiaoqi, Zhang, Youzhi, Kwok, James T.

Reinforcement Learning from Human Feedback (RLHF) has been commonly used to align the behaviors of Large Language Models (LLMs) with human preferences. Recently, a popular alternative is Direct Policy Optimization (DPO), which replaces an LLM-based r

Externí odkaz: http://arxiv.org/abs/2405.21040

Zobrazit plný text záznamu

Report

Grasper: A Generalist Pursuer for Pursuit-Evasion Problems

Autor: Li, Pengdeng, Li, Shuxin, Wang, Xinrun, Cerny, Jakub, Zhang, Youzhi, McAleer, Stephen, Chan, Hau, An, Bo

Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm i

Externí odkaz: http://arxiv.org/abs/2404.12626

Zobrazit plný text záznamu

Report

Leveraging Team Correlation for Approximating Equilibrium in Two-Team Zero-Sum Games

Autor: Liu, Naming, Wang, Mingzhi, Zhang, Youzhi, Yang, Yaodong, An, Bo, Wen, Ying

Two-team zero-sum games are one of the most important paradigms in game theory. In this paper, we focus on finding an unexploitable equilibrium in large team games. An unexploitable equilibrium is a worst-case policy, where members in the opponent te

Externí odkaz: http://arxiv.org/abs/2403.00255

Zobrazit plný text záznamu

Report

Offline Equilibrium Finding

Autor: Li, Shuxin, Wang, Xinrun, Zhang, Youzhi, Cerny, Jakub, Li, Pengdeng, Chan, Hau, An, Bo

Offline reinforcement learning (offline RL) is an emerging field that has recently begun gaining attention across various application domains due to its ability to learn strategies from earlier collected datasets. Offline RL proved very successful, p

Externí odkaz: http://arxiv.org/abs/2207.05285

Zobrazit plný text záznamu

Akademický článek

A large-area less-wires stretchable robot electronic skin

Autor: Chen, Jinmiao, Chen, Xiao, Li, Hangze, Ma, Chaolin, Yu, Ping, Zhang, Youzhi

Publikováno v: In Sensors and Actuators: A. Physical 1 October 2024 376

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání