Zobrazeno 1 - 10
of 755
pro vyhledávání: '"Dai, Tao"'
Fine-tuning large-scale text-to-image diffusion models for various downstream tasks has yielded impressive results. However, the heavy computational burdens of tuning large models prevent personal customization. Recent advances have attempted to empl
Externí odkaz:
http://arxiv.org/abs/2410.21759
Adaptation of pretrained vision-language models such as CLIP to various downstream tasks have raised great interest in recent researches. Previous works have proposed a variety of test-time adaptation (TTA) methods to achieve strong generalization wi
Externí odkaz:
http://arxiv.org/abs/2410.15430
Autor:
Zha, Yaohua, Dai, Tao, Wang, Yanzi, Guo, Hang, Zhang, Taolin, Ouyang, Zhihao, Fan, Chunlin, Chen, Bin, Chen, Ke, Xia, Shu-Tao
Point clouds, as a primary representation of 3D data, can be categorized into scene domain point clouds and object domain point clouds based on the modeled content. Masked autoencoders (MAE) have become the mainstream paradigm in point clouds self-su
Externí odkaz:
http://arxiv.org/abs/2410.09886
Autor:
Zhang, Taolin, Pan, Junwei, Wang, Jinpeng, Zha, Yaohua, Dai, Tao, Chen, Bin, Luo, Ruisheng, Deng, Xiaoxiang, Wang, Yuan, Yue, Ming, Jiang, Jie, Xia, Shu-Tao
With recent advances in large language models (LLMs), there has been emerging numbers of research in developing Semantic IDs based on LLMs to enhance the performance of recommendation systems. However, the dimension of these embeddings needs to match
Externí odkaz:
http://arxiv.org/abs/2410.09560
Recent advances in diffusion-based Large Restoration Models (LRMs) have significantly improved photo-realistic image restoration by leveraging the internal knowledge embedded within model weights. However, existing LRMs often suffer from the hallucin
Externí odkaz:
http://arxiv.org/abs/2410.05601
Non-stationarity poses significant challenges for multivariate time series forecasting due to the inherent short-term fluctuations and long-term trends that can lead to spurious regressions or obscure essential long-term relationships. Most existing
Externí odkaz:
http://arxiv.org/abs/2410.04442
Recently, image-to-3D approaches have significantly advanced the generation quality and speed of 3D assets based on large reconstruction models, particularly 3D Gaussian reconstruction models. Existing large 3D Gaussian models directly map 2D image t
Externí odkaz:
http://arxiv.org/abs/2408.10935
High-resolution point clouds~(HRPCD) anomaly detection~(AD) plays a critical role in precision machining and high-end equipment manufacturing. Despite considerable 3D-AD methods that have been proposed recently, they still cannot meet the requirement
Externí odkaz:
http://arxiv.org/abs/2408.04604
Transferable targeted adversarial attacks aim to mislead models into outputting adversary-specified predictions in black-box scenarios. Recent studies have introduced \textit{single-target} generative attacks that train a generator for each target cl
Externí odkaz:
http://arxiv.org/abs/2407.10179
The pre-trained point cloud model based on Masked Point Modeling (MPM) has exhibited substantial improvements across various tasks. However, two drawbacks hinder their practical application. Firstly, the positional embedding of masked patches in the
Externí odkaz:
http://arxiv.org/abs/2407.09344