Zobrazeno 1 - 10
of 39 240
pro vyhledávání: '"Yang, Yi-An"'
We introduce the Multi-Instance Generation (MIG) task, which focuses on generating multiple instances within a single image, each accurately placed at predefined positions with attributes such as category, color, and shape, strictly following user sp
Externí odkaz:
http://arxiv.org/abs/2407.02329
Autor:
Li, Cheng-Yi, Chang, Kao-Jung, Yang, Cheng-Fu, Wu, Hsin-Yu, Chen, Wenting, Bansal, Hritik, Chen, Ling, Yang, Yi-Ping, Chen, Yu-Chun, Chen, Shih-Pin, Lirng, Jiing-Feng, Chang, Kai-Wei, Chiou, Shih-Hwa
Multi-modal large language models (MLLMs) have been given free rein to explore exciting medical applications with a primary focus on radiology report generation. Nevertheless, the preliminary success in 2D radiology captioning is incompetent to refle
Externí odkaz:
http://arxiv.org/abs/2407.02235
Large Language Models (LLMs) are widely used for writing economic analysis reports or providing financial advice, but their ability to understand economic knowledge and reason about potential results of specific economic events lacks systematic evalu
Externí odkaz:
http://arxiv.org/abs/2407.01212
Scene flow estimation predicts the 3D motion at each point in successive LiDAR scans. This detailed, point-level, information can help autonomous vehicles to accurately predict and understand dynamic changes in their surroundings. Current state-of-th
Externí odkaz:
http://arxiv.org/abs/2407.01702
Large language models (LLMs) are now rapidly advancing and surpassing human abilities on many natural language tasks. However, aligning these super-human LLMs with human knowledge remains challenging because the supervision signals from human annotat
Externí odkaz:
http://arxiv.org/abs/2406.19032
Autor:
Zhang, Ze, Song, Kewei, Zhuang, Rongyi, He, Jianxian, Yang, Yi, Pan, Yifan, Mino, Takeshi, Hirose, Kayo, Umezu, Shinjiro
Polyetheretherketone (PEEK), as a semi-crystalline high-performance engineering plastic, has demonstrated good application prospects since its introduction. The ability of PEEK to be fabricated in complex architecture is a major limitation due to the
Externí odkaz:
http://arxiv.org/abs/2406.18157
Point cloud registration is a fundamental task in the fields of computer vision and robotics. Recent developments in transformer-based methods have demonstrated enhanced performance in this domain. However, the standard attention mechanism utilized i
Externí odkaz:
http://arxiv.org/abs/2406.17530
Autor:
Yang, Yi, Holvoet, Tom
Developing autonomous decision-making requires safety assurance. Agent programming languages like AgentSpeak and Gwendolen provide tools for programming autonomous decision-making. However, despite numerous efforts to apply model checking to these la
Externí odkaz:
http://arxiv.org/abs/2406.17206
Autor:
Chen, Yu-Hua, Choi, Woosung, Liao, Wei-Hsiang, Martínez-Ramírez, Marco, Cheuk, Kin Wai, Mitsufuji, Yuki, Jang, Jyh-Shing Roger, Yang, Yi-Hsuan
Recent years have seen increasing interest in applying deep learning methods to the modeling of guitar amplifiers or effect pedals. Existing methods are mainly based on the supervised approach, requiring temporally-aligned data pairs of unprocessed a
Externí odkaz:
http://arxiv.org/abs/2406.15751
In this article, we investigate the notion of model-based deep learning in the realm of music information research (MIR). Loosely speaking, we refer to the term model-based deep learning for approaches that combine traditional knowledge-based methods
Externí odkaz:
http://arxiv.org/abs/2406.11540