Zobrazeno 1 - 10
of 23 713
pro vyhledávání: '"LI, Zhong"'
Autor:
Deng, Chao, Yuan, Jiale, Bu, Pi, Wang, Peijie, Li, Zhong-Zhi, Xu, Jian, Li, Xiao-Hui, Gao, Yuan, Song, Jun, Zheng, Bo, Liu, Cheng-Lin
Large vision language models (LVLMs) have improved the document understanding capabilities remarkably, enabling the handling of complex document elements, longer contexts, and a wider range of tasks. However, existing document understanding benchmark
Externí odkaz:
http://arxiv.org/abs/2412.18424
Autor:
Shen, Zhuowen, Liu, Yuan, Chen, Zhang, Li, Zhong, Wang, Jiepeng, Liang, Yongqing, Yu, Zhengming, Zhang, Jingdong, Xu, Yi, Schaefer, Scott, Li, Xin, Wang, Wenping
Gaussian splatting has achieved impressive improvements for both novel-view synthesis and surface reconstruction from multi-view images. However, current methods still struggle to reconstruct high-quality surfaces from only sparse view input images u
Externí odkaz:
http://arxiv.org/abs/2412.15400
Recent studies have proposed integrating Chain-of-Thought (CoT) reasoning to further enhance the reliability of Code Language Models (CLMs) in generating code, a step-by-step approach that breaks down complex programming tasks into manageable sub-pro
Externí odkaz:
http://arxiv.org/abs/2412.05829
Recently, several studies have combined Gaussian Splatting to obtain scene representations with language embeddings for open-vocabulary 3D scene understanding. While these methods perform well, they essentially require very dense multi-view inputs, l
Externí odkaz:
http://arxiv.org/abs/2412.02245
Vision representation learning, especially self-supervised learning, is pivotal for various vision applications. Ensemble learning has also succeeded in enhancing the performance and robustness of the vision models. However, traditional ensemble stra
Externí odkaz:
http://arxiv.org/abs/2411.15787
Masked image modeling has achieved great success in learning representations but is limited by the huge computational costs. One cost-saving strategy makes the decoder reconstruct only a subset of masked tokens and throw the others, and we refer to t
Externí odkaz:
http://arxiv.org/abs/2411.15746
Existing object detection methods often consider sRGB input, which was compressed from RAW data using ISP originally designed for visualization. However, such compression might lose crucial information for detection, especially under complex light an
Externí odkaz:
http://arxiv.org/abs/2411.15678
Deep learning methods have significantly advanced medical image segmentation, yet their success hinges on large volumes of manually annotated data, which require specialized expertise for accurate labeling. Additionally, these methods often demand su
Externí odkaz:
http://arxiv.org/abs/2409.00884
Autor:
Xue, Kun, Cao, Yue, Wan, Feng, Li, Zhong-Peng, Zhao, Qian, Liu, Si-Man, Liu, Xin-Yu, Hu, Li-Xiang, Zhao, Yong-Tao, Xu, Zhong-Feng, Yu, Tong-Pu, Li, Jian-Xing
Relativistic spin-polarized electron beams are important for fundamental research and the industry, but their generation currently requires conventional accelerators or ultrastrong laser facilities, limiting their accessibility and broad applications
Externí odkaz:
http://arxiv.org/abs/2408.08563
Publikováno v:
Phys. Rev. A 110, 052207 (2024)
Investigating the interactions of vortex electrons with electromagnetic fields is crucial for advancing particle acceleration techniques, scattering theory in background fields, and developing novel electron beams for material diagnostics. In this wo
Externí odkaz:
http://arxiv.org/abs/2408.02390