Zobrazeno 1 - 10
of 6 569
pro vyhledávání: '"Zhu, Kai"'
Variational Autoencoder (VAE) aims to compress pixel data into low-dimensional latent space, playing an important role in OpenAI's Sora and other latent video diffusion generation models. While most of existing video VAEs inflate a pretrained image V
Externí odkaz:
http://arxiv.org/abs/2411.06449
Autor:
Zhu, Kai, Guo, Chenkai, Yan, Kuihao, Jia, Xiaoqi, Du, Haichao, Huang, Qingjia, Xie, Yamin, Tang, Jing
Analyzing programs with loops is a challenging task, suffering from potential issues such as indeterminate number of iterations and exponential growth of control flow complexity. Loop summarization, as a static analysis method for concrete semantic i
Externí odkaz:
http://arxiv.org/abs/2411.02863
An accurate description of the photon spectrum line shape is essential for extracting resonance parameters of the $\eta_c$ meson through the radiative transition $J/\psi \to \gamma \eta_{c}$. However, a persistent challenge remains in the form of a d
Externí odkaz:
http://arxiv.org/abs/2411.01984
The standard $\rm \Lambda$CDM cosmological model predicts that a large amount of diffuse neutral hydrogen distributes in cosmic filaments, which could be mapped through Lyman-alpha (Ly$\alpha$) emission observations. We use the hydrodynamical simulat
Externí odkaz:
http://arxiv.org/abs/2409.11088
Autor:
Yang, Zhantao, Feng, Ruili, Yan, Keyu, Wang, Huangji, Wang, Zhicai, Zhu, Shangwen, Zhang, Han, Xiao, Jie, Wu, Pingyu, Zhu, Kai, Chen, Jixuan, Xie, Chen-Wei, Mao, Chaojie, Yang, Yue, Zhang, Hongyang, Liu, Yu, Cheng, Fan
This paper presents Bag-of-Concept Graph (BACON) to gift models with limited linguistic abilities to taste the privilege of Vision Language Models (VLMs) and boost downstream tasks such as detection, visual question answering (VQA), and image generat
Externí odkaz:
http://arxiv.org/abs/2407.03314
Autor:
Fang, Zixun, Zhai, Wei, Su, Aimin, Song, Hongliang, Zhu, Kai, Wang, Mao, Chen, Yu, Liu, Zhiheng, Cao, Yang, Zha, Zheng-Jun
Video virtual try-on aims to transfer a clothing item onto the video of a target person. Directly applying the technique of image-based try-on to the video domain in a frame-wise manner will cause temporal-inconsistent outcomes while previous video-b
Externí odkaz:
http://arxiv.org/abs/2405.11794
Autor:
Liu, Zhiheng, Ouyang, Hao, Wang, Qiuyu, Cheng, Ka Leong, Xiao, Jie, Zhu, Kai, Xue, Nan, Liu, Yu, Shen, Yujun, Cao, Yang
3D Gaussians have recently emerged as an efficient representation for novel view synthesis. This work studies its editability with a particular focus on the inpainting task, which aims to supplement an incomplete set of 3D Gaussians with additional p
Externí odkaz:
http://arxiv.org/abs/2404.11613
Recent methods utilize graph contrastive Learning within graph-structured user-item interaction data for collaborative filtering and have demonstrated their efficacy in recommendation tasks. However, they ignore that the difference relation density o
Externí odkaz:
http://arxiv.org/abs/2403.15075
This paper introduces a novel Partial Wave Analysis Code Generator (PWACG) that automatically generates high-performance partial wave analysis codes. This is achieved by leveraging the JAX automatic differentiation library and the jinja2 template eng
Externí odkaz:
http://arxiv.org/abs/2403.09225
Ego-to-exo video generation refers to generating the corresponding exocentric video according to the egocentric video, providing valuable applications in AR/VR and embodied AI. Benefiting from advancements in diffusion model techniques, notable progr
Externí odkaz:
http://arxiv.org/abs/2403.09194