Výsledky vyhledávání

Report

Improved Video VAE for Latent Video Diffusion Model

Autor: Wu, Pingyu, Zhu, Kai, Liu, Yu, Zhao, Liming, Zhai, Wei, Cao, Yang, Zha, Zheng-Jun

Variational Autoencoder (VAE) aims to compress pixel data into low-dimensional latent space, playing an important role in OpenAI's Sora and other latent video diffusion generation models. While most of existing video VAEs inflate a pretrained image V

Externí odkaz: http://arxiv.org/abs/2411.06449

Zobrazit plný text záznamu

Report

LoopSCC: Towards Summarizing Multi-branch Loops within Determinate Cycles

Autor: Zhu, Kai, Guo, Chenkai, Yan, Kuihao, Jia, Xiaoqi, Du, Haichao, Huang, Qingjia, Xie, Yamin, Tang, Jing

Analyzing programs with loops is a challenging task, suffering from potential issues such as indeterminate number of iterations and exponential growth of control flow complexity. Loop summarization, as a static analysis method for concrete semantic i

Externí odkaz: http://arxiv.org/abs/2411.02863

Zobrazit plný text záznamu

Report

Line shape of the $J\psi \to \gamma \eta_{c}$ decay

Autor: Wang, Ting, Wang, Xiaolong, Liao, Guangrui, Zhu, Kai

An accurate description of the photon spectrum line shape is essential for extracting resonance parameters of the $\eta_c$ meson through the radiative transition $J/\psi \to \gamma \eta_{c}$. However, a persistent challenge remains in the form of a d

Externí odkaz: http://arxiv.org/abs/2411.01984

Zobrazit plný text záznamu

Report

Prospects for detecting cosmic filaments in Lyman-alpha emission across redshifts $z=2-5$

Autor: Liu, Yizhou, Gao, Liang, Liao, Shihong, Zhu, Kai

The standard $\rm \Lambda$CDM cosmological model predicts that a large amount of diffuse neutral hydrogen distributes in cosmic filaments, which could be mapped through Lyman-alpha (Ly$\alpha$) emission observations. We use the hydrodynamical simulat

Externí odkaz: http://arxiv.org/abs/2409.11088

Zobrazit plný text záznamu

Report

BACON: Supercharge Your VLM with Bag-of-Concept Graph to Mitigate Hallucinations

Autor: Yang, Zhantao, Feng, Ruili, Yan, Keyu, Wang, Huangji, Wang, Zhicai, Zhu, Shangwen, Zhang, Han, Xiao, Jie, Wu, Pingyu, Zhu, Kai, Chen, Jixuan, Xie, Chen-Wei, Mao, Chaojie, Yang, Yue, Zhang, Hongyang, Liu, Yu, Cheng, Fan

This paper presents Bag-of-Concept Graph (BACON) to gift models with limited linguistic abilities to taste the privilege of Vision Language Models (VLMs) and boost downstream tasks such as detection, visual question answering (VQA), and image generat

Externí odkaz: http://arxiv.org/abs/2407.03314

Zobrazit plný text záznamu

Report

ViViD: Video Virtual Try-on using Diffusion Models

Autor: Fang, Zixun, Zhai, Wei, Su, Aimin, Song, Hongliang, Zhu, Kai, Wang, Mao, Chen, Yu, Liu, Zhiheng, Cao, Yang, Zha, Zheng-Jun

Video virtual try-on aims to transfer a clothing item onto the video of a target person. Directly applying the technique of image-based try-on to the video domain in a frame-wise manner will cause temporal-inconsistent outcomes while previous video-b

Externí odkaz: http://arxiv.org/abs/2405.11794

Zobrazit plný text záznamu

Report

InFusion: Inpainting 3D Gaussians via Learning Depth Completion from Diffusion Prior

Autor: Liu, Zhiheng, Ouyang, Hao, Wang, Qiuyu, Cheng, Ka Leong, Xiao, Jie, Zhu, Kai, Xue, Nan, Liu, Yu, Shen, Yujun, Cao, Yang

3D Gaussians have recently emerged as an efficient representation for novel view synthesis. This work studies its editability with a particular focus on the inpainting task, which aims to supplement an incomplete set of 3D Gaussians with additional p

Externí odkaz: http://arxiv.org/abs/2404.11613

Zobrazit plný text záznamu

Report

Bilateral Unsymmetrical Graph Contrastive Learning for Recommendation

Autor: Yu, Jiaheng, Li, Jing, He, Yue, Zhu, Kai, Zhang, Shuyi, Hu, Wen

Recent methods utilize graph contrastive Learning within graph-structured user-item interaction data for collaborative filtering and have demonstrated their efficacy in recommendation tasks. However, they ignore that the difference relation density o

Externí odkaz: http://arxiv.org/abs/2403.15075

Zobrazit plný text záznamu

Report

PWACG: Partial Wave Analysis Code Generator supporting Newton-conjugate gradient method

Autor: Dong, Xiang, Sun, Yu-Chang, Pan, Chu-Cheng, Cheng, Ao-Yan, Wang, Ao-Bo, Cai, Hao, Zhu, Kai

This paper introduces a novel Partial Wave Analysis Code Generator (PWACG) that automatically generates high-performance partial wave analysis codes. This is achieved by leveraging the JAX automatic differentiation library and the jinja2 template eng

Externí odkaz: http://arxiv.org/abs/2403.09225

Zobrazit plný text záznamu

Report

Intention-driven Ego-to-Exo Video Generation

Autor: Luo, Hongchen, Zhu, Kai, Zhai, Wei, Cao, Yang

Ego-to-exo video generation refers to generating the corresponding exocentric video according to the egocentric video, providing valuable applications in AR/VR and embodied AI. Benefiting from advancements in diffusion model techniques, notable progr

Externí odkaz: http://arxiv.org/abs/2403.09194

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání