Zobrazeno 1 - 10
of 37
pro vyhledávání: '"Di, Xinhan"'
The insufficient supervision limit the performance of the deep supervised models for brain disease diagnosis. It is important to develop a learning framework that can capture more information in limited data and insufficient supervision. To address t
Externí odkaz:
http://arxiv.org/abs/2410.05342
Autor:
Qiu, Wenmo, Di, Xinhan
There is a gap in the understanding of occluded objects in existing large-scale visual language multi-modal models. Current state-of-the-art multimodal models fail to provide satisfactory results in describing occluded objects for visual-language mul
Externí odkaz:
http://arxiv.org/abs/2410.01261
Autor:
Yang, Shuxin, Di, Xinhan
There is a gap in the understanding of occluded objects in existing large-scale visual language multi-modal models. Current state-of-the-art multi-modal models fail to provide satisfactory results in describing occluded objects through universal visu
Externí odkaz:
http://arxiv.org/abs/2410.01861
Adaptation methods are developed to adapt depth foundation models to endoscopic depth estimation recently. However, such approaches typically under-perform training since they limit the parameter search to a low-rank subspace and alter the training d
Externí odkaz:
http://arxiv.org/abs/2410.00979
Autor:
Yang, Huan, Chen, Jiahui, Ding, Chaofan, Shi, Runhua, Xiong, Siyu, Hong, Qingqi, Mo, Xiaoqi, Di, Xinhan
Gestures are pivotal in enhancing co-speech communication. While recent works have mostly focused on point-level motion transformation or fully supervised motion representations through data-driven approaches, we explore the representation of gesture
Externí odkaz:
http://arxiv.org/abs/2409.17674
Large-scale text-to-speech (TTS) models have made significant progress recently.However, they still fall short in the generation of Chinese dialectal speech. Toaddress this, we propose Bailing-TTS, a family of large-scale TTS models capable of genera
Externí odkaz:
http://arxiv.org/abs/2408.00284
Autor:
Di, Xinhan, Yu, Pengqian
In real life, the decoration of 3D indoor scenes through designing furniture layout provides a rich experience for people. In this paper, we explore the furniture layout task as a Markov decision process (MDP) in virtual reality, which is solved by h
Externí odkaz:
http://arxiv.org/abs/2210.10431
Autor:
Di, Xinhan, Yu, Pengqian
Recent years have witnessed great success for hand reconstruction in real-time applications such as visual reality and augmented reality while interacting with two-hand reconstruction through efficient transformers is left unexplored. In this paper,
Externí odkaz:
http://arxiv.org/abs/2208.09815
Autor:
Di, Xinhan, Yu, Pengqian
In the industrial interior design process, professional designers plan the furniture layout to achieve a satisfactory 3D design for selling. In this paper, we explore the interior graphics scenes design task as a Markov decision process (MDP) in 3D s
Externí odkaz:
http://arxiv.org/abs/2102.09137
Autor:
Di, Xinhan, Yu, Pengqian
In the industrial interior design process, professional designers plan the size and position of furniture in a room to achieve a satisfactory design for selling. In this paper, we explore the interior scene design task as a Markov decision process (M
Externí odkaz:
http://arxiv.org/abs/2101.07462