Výsledky vyhledávání

Report

Deep reinforcement learning for tracking a moving target in jellyfish-like swimming

Autor: Chen, Yihao, Yang, Yue

We develop a deep reinforcement learning method for training a jellyfish-like swimmer to effectively track a moving target in a two-dimensional flow. This swimmer is a flexible object equipped with a muscle model based on torsional springs. We employ

Externí odkaz: http://arxiv.org/abs/2409.08815

Zobrazit plný text záznamu

Report

RoomDiffusion: A Specialized Diffusion Model in the Interior Design Industry

Autor: Wang, Zhaowei, Hao, Ying, Wei, Hao, Xiao, Qing, Chen, Lulu, Li, Yulong, Yang, Yue, Li, Tianyi

Recent advancements in text-to-image diffusion models have significantly transformed visual content generation, yet their application in specialized fields such as interior design remains underexplored. In this paper, we present RoomDiffusion, a pion

Externí odkaz: http://arxiv.org/abs/2409.03198

Zobrazit plný text záznamu

Report

Optimizing Automated Picking Systems in Warehouse Robots Using Machine Learning

Autor: Li, Keqin, Wang, Jin, Wu, Xubo, Peng, Xirui, Chang, Runmian, Deng, Xiaoyu, Kang, Yiwen, Yang, Yue, Ni, Fanghao, Hong, Bo

With the rapid growth of global e-commerce, the demand for automation in the logistics industry is increasing. This study focuses on automated picking systems in warehouses, utilizing deep learning and reinforcement learning technologies to enhance p

Externí odkaz: http://arxiv.org/abs/2408.16633

Zobrazit plný text záznamu

Report

Degrade to Function: Towards Eco-friendly Morphing Devices that Function Through Programmed Sequential Degradation

Autor: Lu, Qiuyu, Yi, Semina, Gan, Mentian, Huang, Jihong, Zhang, Xiao, Yang, Yue, Shen, Chenyi, Yao, Lining

While it seems counterintuitive to think of degradation within an operating device as beneficial, one may argue that when rationally designed, the controlled breakdown of materials can be harnessed for specific functions. To apply this principle to t

Externí odkaz: http://arxiv.org/abs/2408.01660

Zobrazit plný text záznamu

Report

Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research

Autor: Xu, Boyan, Wen, Liang, Li, Zihao, Yang, Yuxing, Wu, Guanlan, Tang, Xiongpeng, Li, Yu, Wu, Zihao, Su, Qingxian, Shi, Xueqing, Yang, Yue, Tong, Rui, Ng, How Yong

Recent advancements in Large Language Models (LLMs) have sparked interest in their potential applications across various fields. This paper embarked on a pivotal inquiry: Can existing LLMs effectively serve as "water expert models" for water engineer

Externí odkaz: http://arxiv.org/abs/2407.21045

Zobrazit plný text záznamu

Report

Data Adaptive Traceback for Vision-Language Foundation Models in Image Classification

Autor: Peng, Wenshuo, Zhang, Kaipeng, Yang, Yue, Zhang, Hao, Qiao, Yu

Vision-language foundation models have been incredibly successful in a wide range of downstream computer vision tasks using adaptation methods. However, due to the high cost of obtaining pre-training datasets, pairs with weak image-text correlation i

Externí odkaz: http://arxiv.org/abs/2407.08787

Zobrazit plný text záznamu

Report

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

Autor: Yang, Zhantao, Feng, Ruili, Yan, Keyu, Wang, Huangji, Wang, Zhicai, Zhu, Shangwen, Zhang, Han, Xiao, Jie, Wu, Pingyu, Zhu, Kai, Chen, Jixuan, Xie, Chen-Wei, Mao, Chaojie, Yang, Yue, Zhang, Hongyang, Liu, Yu, Cheng, Fan

This paper presents Bag-of-Concept Graph (BACON) to gift models with limited linguistic abilities to taste the privilege of Vision Language Models (VLMs) and boost downstream tasks such as detection, visual question answering (VQA), and image generat

Externí odkaz: http://arxiv.org/abs/2407.03314

Zobrazit plný text záznamu

Report

Generative prediction of flow field based on the diffusion model

Autor: Hu, Jiajun, Lu, Zhen, Yang, Yue

We propose a geometry-to-flow diffusion model that utilizes the input of obstacle shape to predict a flow field past the obstacle. The model is based on a learnable Markov transition kernel to recover the data distribution from the Gaussian distribut

Externí odkaz: http://arxiv.org/abs/2407.00735

Zobrazit plný text záznamu

Report

PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models

Autor: Meng, Fanqing, Shao, Wenqi, Luo, Lixin, Wang, Yahong, Chen, Yiran, Lu, Quanfeng, Yang, Yue, Yang, Tianshuo, Zhang, Kaipeng, Qiao, Yu, Luo, Ping

Text-to-image (T2I) models have made substantial progress in generating images from textual prompts. However, they frequently fail to produce images consistent with physical commonsense, a vital capability for applications in world simulation and eve

Externí odkaz: http://arxiv.org/abs/2406.11802

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání