Zobrazeno 1 - 10
of 138
pro vyhledávání: '"Tan, Zhiyu"'
Autor:
Gao, Ziyi, Chen, Kai, Wei, Zhipeng, Mou, Tingshu, Chen, Jingjing, Tan, Zhiyu, Li, Hao, Jiang, Yu-Gang
Recent diffusion-based unrestricted attacks generate imperceptible adversarial examples with high transferability compared to previous unrestricted attacks and restricted attacks. However, existing works on diffusion-based unrestricted attacks are mo
Externí odkaz:
http://arxiv.org/abs/2408.05479
The quality of video-text pairs fundamentally determines the upper bound of text-to-video models. Currently, the datasets used for training these models suffer from significant shortcomings, including low temporal consistency, poor-quality captions,
Externí odkaz:
http://arxiv.org/abs/2408.02629
The recent advancements in text-to-image generative models have been remarkable. Yet, the field suffers from a lack of evaluation metrics that accurately reflect the performance of these models, particularly lacking fine-grained metrics that can guid
Externí odkaz:
http://arxiv.org/abs/2406.16562
Autor:
Tan, Zhiyu, Yang, Mengping, Qin, Luozheng, Yang, Hao, Qian, Ye, Zhou, Qiang, Zhang, Cheng, Li, Hao
One critical prerequisite for faithful text-to-image generation is the accurate understanding of text inputs. Existing methods leverage the text encoder of the CLIP model to represent input prompts. However, the pre-trained CLIP model can merely enco
Externí odkaz:
http://arxiv.org/abs/2405.12914
Autor:
Wang, Junyan, Sun, Zhenhong, Tan, Zhiyu, Chen, Xuanbai, Chen, Weihua, Li, Hao, Zhang, Cheng, Song, Yang
Vanilla text-to-image diffusion models struggle with generating accurate human images, commonly resulting in imperfect anatomies such as unnatural postures or disproportionate limbs.Existing methods address this issue mostly by fine-tuning the model
Externí odkaz:
http://arxiv.org/abs/2403.05239
Autor:
Tan, Zhiyu
In this paper we develop a new decomposition framework to deal with Lagrange multipliers of the Karush-Kuhn-Tucker (KKT) system of constrained optimization problems and variational inequalities in Hilbert spaces. It is different from existing framewo
Externí odkaz:
http://arxiv.org/abs/2306.03261
Autor:
Gong, Wei, Tan, Zhiyu
In this paper we propose a new finite element method for solving elliptic optimal control problems with pointwise state constraints, including the distributed controls and the Dirichlet or Neumann boundary controls. The main idea is to use energy spa
Externí odkaz:
http://arxiv.org/abs/2306.03246
Semantic occupancy prediction aims to infer dense geometry and semantics of surroundings for an autonomous agent to operate safely in the 3D environment. Existing occupancy prediction methods are almost entirely trained on human-annotated volumetric
Externí odkaz:
http://arxiv.org/abs/2305.16133
We investigate discontinuous Galerkin methods for an elliptic optimal control problem with a general state equation and pointwise state constraints on general polygonal domains. We show that discontinuous Galerkin methods for general second-order ell
Externí odkaz:
http://arxiv.org/abs/2303.04973
Publikováno v:
International Conference on Learning Representations (2022)
One critical component in lossy deep image compression is the entropy model, which predicts the probability distribution of the quantized latent representation in the encoding and decoding modules. Previous works build entropy models upon convolution
Externí odkaz:
http://arxiv.org/abs/2202.05492