Zobrazeno 1 - 10
of 14 348
pro vyhledávání: '"Liang,Yan"'
Current large multimodal models (LMMs) face challenges in grounding, which requires the model to relate language components to visual entities. Contrary to the common practice that fine-tunes LMMs with additional grounding supervision, we find that t
Externí odkaz:
http://arxiv.org/abs/2410.08209
Decoding neurophysiological signals into language is of great research interest within brain-computer interface (BCI) applications. Electroencephalography (EEG), known for its non-invasiveness, ease of use, and cost-effectiveness, has been a popular
Externí odkaz:
http://arxiv.org/abs/2409.16312
Complex 3D scene understanding has gained increasing attention, with scene encoding strategies playing a crucial role in this success. However, the optimal scene encoding strategies for various scenarios remain unclear, particularly compared to their
Externí odkaz:
http://arxiv.org/abs/2409.03757
Based on our previous study [S. Wang $\textit{et al}$. J. Chem. Phys. $\textbf{153}$, 184102 (2020)], we generalize the theory of molecular emission power spectra (EPS) from one molecule to multichromophoric systems in the framework of macroscopic qu
Externí odkaz:
http://arxiv.org/abs/2408.14569
Recent advancements in 3D object reconstruction from single images have primarily focused on improving the accuracy of object shapes. Yet, these techniques often fail to accurately capture the inter-relation between the object, ground, and camera. As
Externí odkaz:
http://arxiv.org/abs/2407.18914
The unsourced random access (URA) has emerged as a viable scheme for supporting the massive machine-type communications (mMTC) in the sixth generation (6G) wireless networks. Notably, the tensor-based URA (TURA), with its inherent tensor structure, s
Externí odkaz:
http://arxiv.org/abs/2406.16381
Through integrating the evolutionary correlations across global states in the bidirectional recursion, an explainable Bayesian recurrent neural smoother (EBRNS) is proposed for offline data-assisted fixed-interval state smoothing. At first, the propo
Externí odkaz:
http://arxiv.org/abs/2406.11163
Being able to carry out complicated vision language reasoning tasks in 3D space represents a significant milestone in developing household robots and human-centered embodied AI. In this work, we demonstrate that a critical and distinct challenge in 3
Externí odkaz:
http://arxiv.org/abs/2406.07544
Autor:
Cao, Shengcao, Gu, Jiuxiang, Kuen, Jason, Tan, Hao, Zhang, Ruiyi, Zhao, Handong, Nenkova, Ani, Gui, Liang-Yan, Sun, Tong, Wang, Yu-Xiong
Open-world entity segmentation, as an emerging computer vision task, aims at segmenting entities in images without being restricted by pre-defined classes, offering impressive generalization capabilities on unseen images and concepts. Despite its pro
Externí odkaz:
http://arxiv.org/abs/2404.12386
Autor:
Wang, Jingyue, Huang, Junwei, Kaplan, Daniel, Zhou, Xuehan, Tan, Congwei, Zhang, Jing, Jin, Gangjian, Cong, Xuzhong, Zhu, Yongchao, Gao, Xiaoyin, Liang, Yan, Zuo, Huakun, Zhu, Zengwei, Zhu, Ruixue, Stern, Ady, Liu, Hongtao, Gao, Peng, Yan, Binghai, Yuan, Hongtao, Peng, Hailin
In the presence of high magnetic field, quantum Hall systems usually host both even- and odd-integer quantized states because of lifted band degeneracies. Selective control of these quantized states is challenging but essential to understand the exot
Externí odkaz:
http://arxiv.org/abs/2404.00695