Zobrazeno 1 - 10
of 10 778
pro vyhledávání: '"An, Guocheng"'
Point cloud frame interpolation is a challenging task that involves accurate scene flow estimation across frames and maintaining the geometry structure. Prevailing techniques often rely on pre-trained motion estimators or intensive testing-time optim
Externí odkaz:
http://arxiv.org/abs/2410.19573
Autor:
Liu, Xiaoqian, Du, Yangfan, Wang, Jianjin, Ge, Yuan, Xu, Chen, Xiao, Tong, Chen, Guocheng, Zhu, Jingbo
Simultaneous Speech Translation (SimulST) involves generating target language text while continuously processing streaming speech input, presenting significant real-time challenges. Multi-task learning is often employed to enhance SimulST performance
Externí odkaz:
http://arxiv.org/abs/2409.15911
Autor:
Mai, Jinjie, Zhu, Wenxuan, Rojas, Sara, Zarzar, Jesus, Hamdi, Abdullah, Qian, Guocheng, Li, Bing, Giancola, Silvio, Ghanem, Bernard
Neural radiance fields (NeRFs) generally require many images with accurate poses for accurate novel view synthesis, which does not reflect realistic setups where views can be sparse and poses can be noisy. Previous solutions for learning NeRFs with s
Externí odkaz:
http://arxiv.org/abs/2408.10739
Autor:
Bahmani, Sherwin, Skorokhodov, Ivan, Siarohin, Aliaksandr, Menapace, Willi, Qian, Guocheng, Vasilkovsky, Michael, Lee, Hsin-Ying, Wang, Chaoyang, Zou, Jiaxu, Tagliasacchi, Andrea, Lindell, David B., Tulyakov, Sergey
Modern text-to-video synthesis models demonstrate coherent, photorealistic generation of complex videos from a text description. However, most existing models lack fine-grained control over camera movement, which is critical for downstream applicatio
Externí odkaz:
http://arxiv.org/abs/2407.12781
Collaborative Edge Computing (CEC) is an emerging paradigm that collaborates heterogeneous edge devices as a resource pool to compute DNN inference tasks in proximity such as edge video analytics. Nevertheless, as the key knob to improve network util
Externí odkaz:
http://arxiv.org/abs/2406.19613
The noisy permutation channel is a useful abstraction introduced by Makur for point-to-point communication networks and biological storage. While the asymptotic capacity results exist for this model, the characterization of the second-order asymptoti
Externí odkaz:
http://arxiv.org/abs/2406.15031
Autor:
Qiu, Yanqi, Zhen, Guocheng
We study the limiting spectral measure of large random Helson matrices and large random matrices of certain patterned structures. Given a real random variable $X \in L^{2+ \varepsilon}(\mathbb{P}) $ for some $\varepsilon > 0$ and $\mathrm{Var}(X) = 1
Externí odkaz:
http://arxiv.org/abs/2405.18796
Age of Information (AoI) is an emerging metric used to assess the timeliness of information, gaining research interest in real-time multicast applications such as video streaming and metaverse platforms. In this paper, we consider a dynamic multicast
Externí odkaz:
http://arxiv.org/abs/2404.18084
This paper addresses task planning problems for language-instructed robot teams. Tasks are expressed in natural language (NL), requiring the robots to apply their capabilities at various locations and semantic objects. Several recent works have addre
Externí odkaz:
http://arxiv.org/abs/2402.15368
Autor:
Hamdi, Abdullah, Melas-Kyriazi, Luke, Mai, Jinjie, Qian, Guocheng, Liu, Ruoshi, Vondrick, Carl, Ghanem, Bernard, Vedaldi, Andrea
Advancements in 3D Gaussian Splatting have significantly accelerated 3D reconstruction and generation. However, it may require a large number of Gaussians, which creates a substantial memory footprint. This paper introduces GES (Generalized Exponenti
Externí odkaz:
http://arxiv.org/abs/2402.10128