Zobrazeno 1 - 10
of 368
pro vyhledávání: '"Zhang, Chenxu"'
Audio-driven talking video generation has advanced significantly, but existing methods often depend on video-to-video translation techniques and traditional generative networks like GANs and they typically generate taking heads and co-speech gestures
Externí odkaz:
http://arxiv.org/abs/2409.07649
Urban waterlogging poses a major risk to public safety and infrastructure. Conventional methods using water-level sensors need high-maintenance to hardly achieve full coverage. Recent advances employ surveillance camera imagery and deep learning for
Externí odkaz:
http://arxiv.org/abs/2407.08109
State Space Model (SSM) is a mathematical model used to describe and analyze the behavior of dynamic systems. This model has witnessed numerous applications in several fields, including control theory, signal processing, economics and machine learnin
Externí odkaz:
http://arxiv.org/abs/2405.04404
Autor:
Yang, Fan, Zhang, Jianfeng, Shi, Yichun, Chen, Bowen, Zhang, Chenxu, Zhang, Huichao, Yang, Xiaofeng, Feng, Jiashi, Lin, Guosheng
Benefiting from the rapid development of 2D diffusion models, 3D content creation has made significant progress recently. One promising solution involves the fine-tuning of pre-trained 2D diffusion models to harness their capacity for producing multi
Externí odkaz:
http://arxiv.org/abs/2404.06429
This paper addresses the issue of active speaker detection (ASD) in noisy environments and formulates a robust active speaker detection (rASD) problem. Existing ASD approaches leverage both audio and visual modalities, but non-speech sounds in the su
Externí odkaz:
http://arxiv.org/abs/2403.19002
The recently developed Sora model [1] has exhibited remarkable capabilities in video generation, sparking intense discussions regarding its ability to simulate real-world phenomena. Despite its growing popularity, there is a lack of established metri
Externí odkaz:
http://arxiv.org/abs/2402.17403
Autor:
Zhang, Chenxu, Wang, Chao, Zhang, Jianfeng, Xu, Hongyi, Song, Guoxian, Xie, You, Luo, Linjie, Tian, Yapeng, Guo, Xiaohu, Feng, Jiashi
The generation of emotional talking faces from a single portrait image remains a significant challenge. The simultaneous achievement of expressive emotional talking and accurate lip-sync is particularly difficult, as expressiveness is often compromis
Externí odkaz:
http://arxiv.org/abs/2312.13578
Autor:
Teetaert, Spencer, Zhao, Wenda, Xinyuan, Niu, Zahir, Hashir, Leong, Huiyu, Hidalgo, Michel, Puga, Gerardo, Lorente, Tomas, Espinosa, Nahuel, Carrasco, John Alejandro Duarte, Zhang, Kaizheng, Di, Jian, Jin, Tao, Li, Xiaohan, Zhou, Yijia, Liang, Xiuhua, Zhang, Chenxu, Loquercio, Antonio, Zhou, Siqi, Brunke, Lukas, Greeff, Melissa, Hoenig, Wolfgang, Panerati, Jacopo, Schoellig, Angela P.
Shared benchmark problems have historically been a fundamental driver of progress for scientific communities. In the context of academic conferences, competitions offer the opportunity to researchers with different origins, backgrounds, and levels of
Externí odkaz:
http://arxiv.org/abs/2308.16743
Autor:
Zhang, Biao, Lang, Yihan, Zhang, Xuecheng, Zhang, Chenxu, Qiu, Yulou, Sun, Kai, Shentu, Xuping, Yu, Xiaoping, Lin, Xiaodong
Publikováno v:
In Chemical Engineering Journal 1 November 2024 499
Autor:
Zhang, Chenxu, Tan, Yiping, Yin, Fengxiang, Zhao, Jian, Gao, Zhiyong, Sun, Wei, McFadzean, Belinda, Cao, Jian
Publikováno v:
In Minerals Engineering October 2024 217