Zobrazeno 1 - 10
of 9 092
pro vyhledávání: '"ZHANG, YIFAN"'
The ever-increasing sizes of large language models necessitate distributed solutions for fast inference that exploit multi-dimensional parallelism, where computational loads are split across various accelerators such as GPU clusters. However, this ap
Externí odkaz:
http://arxiv.org/abs/2412.04964
Autor:
Zheng, Longtao, Zhang, Yifan, Guo, Hanzhong, Pan, Jiachun, Tan, Zhenxiong, Lu, Jiahao, Tang, Chuanxin, An, Bo, Yan, Shuicheng
Recent advances in video diffusion models have unlocked new potential for realistic audio-driven talking video generation. However, achieving seamless audio-lip synchronization, maintaining long-term identity consistency, and producing natural, audio
Externí odkaz:
http://arxiv.org/abs/2412.04448
This work introduces RARE (Retrieval-Augmented Reasoning Enhancement), a versatile extension to the mutual reasoning framework (rStar), aimed at enhancing reasoning accuracy and factual integrity across large language models (LLMs) for complex, knowl
Externí odkaz:
http://arxiv.org/abs/2412.02830
Autor:
Wang, Jiangtao, Qin, Zhen, Zhang, Yifan, Hu, Vincent Tao, Ommer, Björn, Briq, Rania, Kesselheim, Stefan
Vision tokenizers have gained a lot of attraction due to their scalability and compactness; previous works depend on old-school GAN-based hyperparameters, biased comparisons, and a lack of comprehensive analysis of the scaling behaviours. To tackle t
Externí odkaz:
http://arxiv.org/abs/2412.02632
Autor:
Pan, Zhewen, Zhang, Yifan
Existing identification and estimation methods for semiparametric sample selection models rely heavily on exclusion restrictions. However, it is difficult in practice to find a credible excluded variable that has a correlation with selection but no c
Externí odkaz:
http://arxiv.org/abs/2412.01208
Autor:
Zhang, Yifan
The rapid advancement of large language models (LLMs) such as GPT-3, PaLM, and Llama has significantly transformed natural language processing, showcasing remarkable capabilities in understanding and generating language. However, these models often s
Externí odkaz:
http://arxiv.org/abs/2411.18104
Autor:
Zhou, Chen, Cheng, Peng, Fang, Junfeng, Zhang, Yifan, Yan, Yibo, Jia, Xiaojun, Xu, Yanyan, Wang, Kun, Cao, Xiaochun
Multispectral object detection, utilizing RGB and TIR (thermal infrared) modalities, is widely recognized as a challenging task. It requires not only the effective extraction of features from both modalities and robust fusion strategies, but also the
Externí odkaz:
http://arxiv.org/abs/2411.18288
Autor:
Wang, Xin, Zhang, Yifan, Zhang, Xiaojing, Yu, Longhui, Lin, Xinna, Jiang, Jindong, Ma, Bin, Yu, Kaicheng
Pharmaceutical patents play a vital role in biochemical industries, especially in drug discovery, providing researchers with unique early access to data, experimental results, and research insights. With the advancement of machine learning, patent an
Externí odkaz:
http://arxiv.org/abs/2410.21312
Autor:
Huang, Yiming, Xiao, Jingyu, Tao, Lian, Zhang, Shuang-Nan, Yin, Qian-Qing, Wang, Yusa, Zhao, Zijian, Zhang, Chen, Zhao, Qingchang, Ma, Xiang, Zhao, Shujie, Zhou, Heng, Wen, Xiangyang, Li, Zhengwei, Xiong, Shaolin, Zhang, Juan, Bu, Qingcui, Cang, Jirong, Cao, Dezhi, Chen, Wen, Ding, Siran, Dai, Yanfeng, Gao, Min, Gao, Yang, He, Huilin, Hou, Shujin, Hou, Dongjie, Hu, Tai, Huang, Guoli, Huang, Yue, Jia, Liping, Jin, Ge, Li, Dalin, Li, Jinsong, Li, Panping, Li, Yajun, Liu, Xiaojing, Ma, Ruican, Men, Lingling, Pan, Xingyu, Qi, Liqiang, Song, Liming, Sun, Xianfei, Tang, Qingwen, Xiong, Liyuan, Xu, Yibo, Yang, Sheng, Yang, Yanji, Yang, Yong, Zhang, Aimei, Zhang, Wei, Zhang, Yifan, Zhang, Yueting, Zhao, Donghua, Zhao, Kang, Zhu, Yuxuan
The Chasing All Transients Constellation Hunters (CATCH) space mission is focused on exploring the dynamic universe via X-ray follow-up observations of various transients. The first pathfinder of the CATCH mission, CATCH-1, was launched on June 22, 2
Externí odkaz:
http://arxiv.org/abs/2410.17833
Autor:
Zhang, Xinjie, Liu, Zhening, Zhang, Yifan, Ge, Xingtong, He, Dailan, Xu, Tongda, Wang, Yan, Lin, Zehong, Yan, Shuicheng, Zhang, Jun
4D Gaussian Splatting (4DGS) has recently emerged as a promising technique for capturing complex dynamic 3D scenes with high fidelity. It utilizes a 4D Gaussian representation and a GPU-friendly rasterizer, enabling rapid rendering speeds. Despite it
Externí odkaz:
http://arxiv.org/abs/2410.13613