Zobrazeno 1 - 10
of 11 523
pro vyhledávání: '"Xiao-bin An"'
Benefiting from large-scale pre-training of text-video pairs, current text-to-video (T2V) diffusion models can generate high-quality videos from the text description. Besides, given some reference images or videos, the parameter-efficient fine-tuning
Externí odkaz:
http://arxiv.org/abs/2412.15646
Autor:
Liu, Shanshan, Burgos, Rhonald, Zhang, Enze, Wang, Naizhou, Qiang, Xiao-Bin, Li, Chuanzhao, Zhang, Qihan, Du, Z. Z., Zheng, Rui, Chen, Jingsheng, Xu, Qing-Hua, Leng, Kai, Gao, Weibo, Xiu, Faxian, Culcer, Dimitrie, Loh, Kian Ping
Publikováno v:
Commun Phys 7, 413 (2024)
The discovery of the nonlinear Hall effect provides an avenue for studying the interplay among symmetry, topology, and phase transitions, with potential applications in signal doubling and high-frequency rectification. However, practical applications
Externí odkaz:
http://arxiv.org/abs/2412.15591
Autor:
Hu, Jinwu, Wang, Yufeng, Zhang, Shuhai, Zhou, Kai, Chen, Guohao, Hu, Yu, Xiao, Bin, Tan, Mingkui
Ensemble reasoning for the strengths of different LLM experts is critical to achieving consistent and satisfactory performance on diverse inputs across a wide range of tasks. However, existing LLM ensemble methods are either computationally intensive
Externí odkaz:
http://arxiv.org/abs/2412.07448
We investigate density fluctuations and scalar-induced gravitational waves (GWs) arising from the production of long-lived solitons and oscillons, which can dominate the early Universe and drive reheating prior to the standard radiation-dominated era
Externí odkaz:
http://arxiv.org/abs/2412.08057
Remote Sensing (RS) image deblurring and Super-Resolution (SR) are common tasks in computer vision that aim at restoring RS image detail and spatial scale, respectively. However, real-world RS images often suffer from a complex combination of global
Externí odkaz:
http://arxiv.org/abs/2412.05696
We present Florence-VL, a new family of multimodal large language models (MLLMs) with enriched visual representations produced by Florence-2, a generative vision foundation model. Unlike the widely used CLIP-style vision transformer trained by contra
Externí odkaz:
http://arxiv.org/abs/2412.04424
The recent Segment Anything Model (SAM) represents a significant breakthrough in scaling segmentation models, delivering strong performance across various downstream applications in the RGB modality. However, directly applying SAM to emerging visual
Externí odkaz:
http://arxiv.org/abs/2412.04220
Autor:
Chen, Xiao-Bin, Wang, Kai, Huang, Yi-Yun, Zhang, Hai-Ming, Xi, Shao-Qiang, Liu, Ruo-Yu, Wang, Xiang-Yu
The supersonic flow motions associated with infall of baryonic gas toward sheets and filaments, as well as cluster mergers, produces large-scale shock waves. The shocks associated with galaxy clusters can be classified mainly into two categories: int
Externí odkaz:
http://arxiv.org/abs/2412.02436
Autor:
Liu, Fang, Ji, Xiao-Bin, Sun, Sheng-Sen, Liu, Huai-Min, Fang, Shuang-Shi, Li, Xiao-Ling, Chen, Tong, Wang, Xin-Nan, Li, Ming-Run, Wang, Liang-Liang, Wu, Ling-Hui, Yuan, Ye, Zhang, Yao, Zhu, Wen-Jing
Using $(10087 \pm 44) \times 10^6$ $J/\psi$ events collected with the BESIII detector in 2009, 2012, 2018 and 2019, the tracking efficiency of charged pions is studied using the decay $J/\psi \rightarrow \pi^+ \pi^- \pi^0$. The systematic uncertainty
Externí odkaz:
http://arxiv.org/abs/2412.00469
Deep neural networks exhibit vulnerability to adversarial examples that can transfer across different models. A particularly challenging problem is developing transferable targeted attacks that can mislead models into predicting specific target class
Externí odkaz:
http://arxiv.org/abs/2411.15553