Zobrazeno 1 - 10
of 34 617
pro vyhledávání: '"An, Xiaomeng"'
Large Language Models (LLMs) are increasingly being integrated into services such as ChatGPT to provide responses to user queries. To mitigate potential harm and prevent misuse, there have been concerted efforts to align the LLMs with human values an
Externí odkaz:
http://arxiv.org/abs/2412.18171
The unusually warm sea surface temperature events known as marine heatwaves (MHWs) have a profound impact on marine ecosystems. Accurate prediction of extreme MHWs has significant scientific and financial worth. However, existing methods still have c
Externí odkaz:
http://arxiv.org/abs/2412.15532
Compositional generalization is the capability of a model to understand novel compositions composed of seen concepts. There are multiple levels of novel compositions including phrase-phrase level, phrase-word level, and word-word level. Existing meth
Externí odkaz:
http://arxiv.org/abs/2412.13636
We propose Radar-Camera fusion transformer (RaCFormer) to boost the accuracy of 3D object detection by the following insight. The Radar-Camera fusion in outdoor 3D scene perception is capped by the image-to-BEV transformation--if the depth of pixels
Externí odkaz:
http://arxiv.org/abs/2412.12725
Autor:
Ren, Sucheng, Li, Xiaomeng
Vision Transformer shows great superiority in medical image segmentation due to the ability in learning long-range dependency. For medical image segmentation from 3D data, such as computed tomography (CT), existing methods can be broadly classified i
Externí odkaz:
http://arxiv.org/abs/2412.11458
Existing multi-modal learning methods on fundus and OCT images mostly require both modalities to be available and strictly paired for training and testing, which appears less practical in clinical scenarios. To expand the scope of clinical applicatio
Externí odkaz:
http://arxiv.org/abs/2412.09402
Autor:
Ouyang, Linke, Qu, Yuan, Zhou, Hongbin, Zhu, Jiawei, Zhang, Rui, Lin, Qunshu, Wang, Bin, Zhao, Zhiyuan, Jiang, Man, Zhao, Xiaomeng, Shi, Jin, Wu, Fan, Chu, Pei, Liu, Minghao, Li, Zhenxiang, Xu, Chao, Zhang, Bo, Shi, Botian, Tu, Zhongying, He, Conghui
Document content extraction is crucial in computer vision, especially for meeting the high-quality data needs of large language models (LLMs) and retrieval-augmented generation (RAG) technologies. However, current document parsing methods suffer from
Externí odkaz:
http://arxiv.org/abs/2412.07626
Autor:
Wang, Cunshi, Hu, Xinjie, Zhang, Yu, Chen, Xunhao, Du, Pengliang, Mao, Yiming, Wang, Rui, Li, Yuyang, Wu, Ying, Yang, Hang, Li, Yansong, Wang, Beichuan, Mu, Haiyang, Wang, Zheng, Tian, Jianfeng, Ge, Liang, Mao, Yongna, Li, Shengming, Lu, Xiaomeng, Zou, Jinhang, Huang, Yang, Sun, Ningchen, Zheng, Jie, He, Min, Bai, Yu, Jin, Junjie, Wu, Hong, Shang, Chaohui, Liu, Jifeng
With the rapid advancements in Large Language Models (LLMs), LLM-based agents have introduced convenient and user-friendly methods for leveraging tools across various domains. In the field of astronomical observation, the construction of new telescop
Externí odkaz:
http://arxiv.org/abs/2412.06412
Recent advancements in text-to-video (T2V) generative models have shown impressive capabilities. However, these models are still inadequate in aligning synthesized videos with human preferences (e.g., accurately reflecting text descriptions), which i
Externí odkaz:
http://arxiv.org/abs/2412.04814
Autor:
Gui, Ke, Zhang, Xutao, Che, Huizheng, Li, Lei, Zheng, Yu, An, Linchang, Miao, Yucong, Zhao, Hujia, Dubovik, Oleg, Holben, Brent, Wang, Jun, Gupta, Pawan, Lind, Elena S., Toledano, Carlos, Wang, Hong, Wang, Zhili, Wang, Yaqiang, Huang, Xiaomeng, Dai, Kan, Xia, Xiangao, Xu, Xiaofeng, Zhang, Xiaoye
Aerosol forecasting is essential for air quality warnings, health risk assessment, and climate change mitigation. However, it is more complex than weather forecasting due to the intricate interactions between aerosol physicochemical processes and atm
Externí odkaz:
http://arxiv.org/abs/2412.02498