Zobrazeno 1 - 10
of 196
pro vyhledávání: '"Michael Bi"'
We embark on the age-old quest: unveiling the hidden dimensions of objects from mere glimpses of their visible parts. To address this, we present Vista3D, a framework that realizes swift and consistent 3D generation within a mere 5 minutes. At the he
Externí odkaz:
http://arxiv.org/abs/2409.12193
Structured pruning reduces the computational overhead of deep neural networks by removing redundant sub-structures. However, assessing the relative importance of different sub-structures remains a significant challenge, particularly in advanced visio
Externí odkaz:
http://arxiv.org/abs/2407.04616
Diffusion Transformers have recently demonstrated unprecedented generative capabilities for various tasks. The encouraging results, however, come with the cost of slow inference, since each denoising step requires inference on a transformer model wit
Externí odkaz:
http://arxiv.org/abs/2406.01733
Controllable generation of 3D human motions becomes an important topic as the world embraces digital transformation. Existing works, though making promising progress with the advent of diffusion models, heavily rely on meticulously captured and annot
Externí odkaz:
http://arxiv.org/abs/2401.11115
Semantic segmentation's performance is often compromised when applied to unlabeled adverse weather conditions. Unsupervised domain adaptation is a potential approach to enhancing the model's adaptability and robustness to adverse weather. However, ex
Externí odkaz:
http://arxiv.org/abs/2401.07459
Unsupervised object discovery and localization aims to detect or segment objects in an image without any supervision. Recent efforts have demonstrated a notable potential to identify salient foreground objects by utilizing self-supervised transformer
Externí odkaz:
http://arxiv.org/abs/2312.17492
We introduce DreamDrone, a novel zero-shot and training-free pipeline for generating unbounded flythrough scenes from textual prompts. Different from other methods that focus on warping images frame by frame, we advocate explicitly warping the interm
Externí odkaz:
http://arxiv.org/abs/2312.08746
Autor:
Gu, Kerui, Li, Zhihao, Liu, Shiyong, Liu, Jianzhuang, Xu, Songcen, Yan, Youliang, Mi, Michael Bi, Kawaguchi, Kenji, Yao, Angela
Estimating 3D rotations is a common procedure for 3D computer vision. The accuracy depends heavily on the rotation representation. One form of representation -- rotation matrices -- is popular due to its continuity, especially for pose estimation tas
Externí odkaz:
http://arxiv.org/abs/2312.00462
Text-to-motion generation is a formidable task, aiming to produce human motions that align with the input text while also adhering to human capabilities and physical laws. While there have been advancements in diffusion models, their application in d
Externí odkaz:
http://arxiv.org/abs/2308.14480
Autor:
Nie, Ming, Xue, Yujing, Wang, Chunwei, Ye, Chaoqiang, Xu, Hang, Zhu, Xinge, Huang, Qingqiu, Mi, Michael Bi, Wang, Xinchao, Zhang, Li
Recently, polar-based representation has shown promising properties in perceptual tasks. In addition to Cartesian-based approaches, which separate point clouds unevenly, representing point clouds as polar grids has been recognized as an alternative d
Externí odkaz:
http://arxiv.org/abs/2308.03982