Zobrazeno 1 - 10
of 27 932
pro vyhledávání: '"WANG, XI"'
Recent advances in text-conditioned video diffusion have greatly improved video quality. However, these methods offer limited or sometimes no control to users on camera aspects, including dynamic camera motion, zoom, distorted lens and focus shifts.
Externí odkaz:
http://arxiv.org/abs/2412.14158
Image rendering from line drawings is vital in design and image generation technologies reduce costs, yet professional line drawings demand preserving complex details. Text prompts struggle with accuracy, and image translation struggles with consiste
Externí odkaz:
http://arxiv.org/abs/2412.11519
Construction robots operate in unstructured construction sites, where effective visual perception is crucial for ensuring safe and seamless operations. However, construction robots often handle large elements and perform tasks across expansive areas,
Externí odkaz:
http://arxiv.org/abs/2412.11275
Autor:
Hassan, Mariam, Stapf, Sebastian, Rahimi, Ahmad, Rezende, Pedro M B, Haghighi, Yasaman, Brüggemann, David, Katircioglu, Isinsu, Zhang, Lin, Chen, Xiaoran, Saha, Suman, Cannici, Marco, Aljalbout, Elie, Ye, Botao, Wang, Xi, Davtyan, Aram, Salzmann, Mathieu, Scaramuzza, Davide, Pollefeys, Marc, Favaro, Paolo, Alahi, Alexandre
We present GEM, a Generalizable Ego-vision Multimodal world model that predicts future frames using a reference frame, sparse features, human poses, and ego-trajectories. Hence, our model has precise control over object dynamics, ego-agent motion and
Externí odkaz:
http://arxiv.org/abs/2412.11198
In this paper, we develop a novel method for deriving a global optimal control strategy for stochastic attitude kinematics on the special orthogonal group SO(3). We first introduce a stochastic Lie-Hamilton-Jacobi-Bellman (SL-HJB) equation on SO(3),
Externí odkaz:
http://arxiv.org/abs/2412.08124
Autor:
Diao, Wenting, Wang, Xi, Di, Ke, Liu, Yu, Cheng, Anyu, Cai, Chunxiao, Yang, Wenhai, Du, Jiajia
We investigate the absorption and transmission properties of a weak probe field in an atom opto-magnomechanics system. The system comprises an assembly of two-level atoms and a magnon mode within a ferrimagnetic crystal, which directly interacts with
Externí odkaz:
http://arxiv.org/abs/2412.06369
Autor:
Halacheva, Anna-Maria, Miao, Yang, Zaech, Jan-Nico, Wang, Xi, Van Gool, Luc, Paudel, Danda Pani
3D scene understanding is a long-standing challenge in computer vision and a key component in enabling mixed reality, wearable computing, and embodied AI. Providing a solution to these applications requires a multifaceted approach that covers scene-c
Externí odkaz:
http://arxiv.org/abs/2412.01398
Autor:
Balauca, Ada-Astrid, Garai, Sanjana, Balauca, Stefan, Shetty, Rasesh Udayakumar, Agrawal, Naitik, Shah, Dhwanil Subhashbhai, Fu, Yuqian, Wang, Xi, Toutanova, Kristina, Paudel, Danda Pani, Van Gool, Luc
Museums serve as vital repositories of cultural heritage and historical artifacts spanning diverse epochs, civilizations, and regions, preserving well-documented collections. Data reveal key attributes such as age, origin, material, and cultural sign
Externí odkaz:
http://arxiv.org/abs/2412.01370
Autor:
Liu, Zuhao, Yanev, Aleksandar, Mahmood, Ahmad, Nikolov, Ivan, Motamed, Saman, Zheng, Wei-Shi, Wang, Xi, Van Gool, Luc, Paudel, Danda Pani
Advances in video generation have significantly improved the realism and quality of created scenes. This has fueled interest in developing intuitive tools that let users leverage video generation as world simulators. Text-to-video (T2V) generation is
Externí odkaz:
http://arxiv.org/abs/2411.16804
Autor:
Hu, Yuchen, Ye, Junhao, Xu, Ke, Sun, Jialin, Zhang, Shiyue, Jiao, Xinyao, Pan, Dingrong, Zhou, Jie, Wang, Ning, Shan, Weiwei, Fang, Xinwei, Wang, Xi, Guan, Nan, Jiang, Zhe
Verifying hardware designs in embedded systems is crucial but often labor-intensive and time-consuming. While existing solutions have improved automation, they frequently rely on unrealistic assumptions. To address these challenges, we introduce a no
Externí odkaz:
http://arxiv.org/abs/2411.16238