Výsledky vyhledávání

Report

3D-GP-LMVIC: Learning-based Multi-View Image Coding with 3D Gaussian Geometric Priors

Autor: Huang, Yujun, Chen, Bin, Lian, Niu, An, Baoyi, Xia, Shu-Tao

Multi-view image compression is vital for 3D-related applications. To effectively model correlations between views, existing methods typically predict disparity between two views on a 2D plane, which works well for small disparities, such as in stere

Externí odkaz: http://arxiv.org/abs/2409.04013

Zobrazit plný text záznamu

Report

Anno-incomplete Multi-dataset Detection

Autor: Xu, Yiran, Zhong, Haoxiang, Wu, Kai, Li, Jialin, Liu, Yong, Wang, Chengjie, Xia, Shu-Tao, Liao, Hongen

Object detectors have shown outstanding performance on various public datasets. However, annotating a new dataset for a new task is usually unavoidable in real, since 1) a single existing dataset usually does not contain all object categories needed;

Externí odkaz: http://arxiv.org/abs/2408.16247

Zobrazit plný text záznamu

Report

Large Point-to-Gaussian Model for Image-to-3D Generation

Autor: Lu, Longfei, Gao, Huachen, Dai, Tao, Zha, Yaohua, Hou, Zhi, Wu, Junta, Xia, Shu-Tao

Recently, image-to-3D approaches have significantly advanced the generation quality and speed of 3D assets based on large reconstruction models, particularly 3D Gaussian reconstruction models. Existing large 3D Gaussian models directly map 2D image t

Externí odkaz: http://arxiv.org/abs/2408.10935

Zobrazit plný text záznamu

Report

A Closer Look at GAN Priors: Exploiting Intermediate Features for Enhanced Model Inversion Attacks

Autor: Qiu, Yixiang, Fang, Hao, Yu, Hongyao, Chen, Bin, Qiu, MeiKang, Xia, Shu-Tao

Model Inversion (MI) attacks aim to reconstruct privacy-sensitive training data from released models by utilizing output information, raising extensive concerns about the security of Deep Neural Networks (DNNs). Recent advances in generative adversar

Externí odkaz: http://arxiv.org/abs/2407.13863

Zobrazit plný text záznamu

Report

CLIP-Guided Generative Networks for Transferable Targeted Adversarial Attacks

Autor: Fang, Hao, Kong, Jiawei, Chen, Bin, Dai, Tao, Wu, Hao, Xia, Shu-Tao

Transferable targeted adversarial attacks aim to mislead models into outputting adversary-specified predictions in black-box scenarios. Recent studies have introduced \textit{single-target} generative attacks that train a generator for each target cl

Externí odkaz: http://arxiv.org/abs/2407.10179

Zobrazit plný text záznamu

Report

Pre-training Point Cloud Compact Model with Partial-aware Reconstruction

Autor: Zha, Yaohua, Wang, Yanzi, Dai, Tao, Xia, Shu-Tao

The pre-trained point cloud model based on Masked Point Modeling (MPM) has exhibited substantial improvements across various tasks. However, two drawbacks hinder their practical application. Firstly, the positional embedding of masked patches in the

Externí odkaz: http://arxiv.org/abs/2407.09344

Zobrazit plný text záznamu

Report

Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach

Autor: Zhang, Taolin, Bai, Jiawang, Lu, Zhihe, Lian, Dongze, Wang, Genping, Wang, Xinchao, Xia, Shu-Tao

Recent works on parameter-efficient transfer learning (PETL) show the potential to adapt a pre-trained Vision Transformer to downstream recognition tasks with only a few learnable parameters. However, since they usually insert new structures into the

Externí odkaz: http://arxiv.org/abs/2407.06964

Zobrazit plný text záznamu

Report

Video Watermarking: Safeguarding Your Video from (Unauthorized) Annotations by Video-based LLMs

Autor: Li, Jinmin, Gao, Kuofeng, Bai, Yang, Zhang, Jingyun, Xia, Shu-Tao

The advent of video-based Large Language Models (LLMs) has significantly enhanced video understanding. However, it has also raised some safety concerns regarding data protection, as videos can be more easily annotated, even without authorization. Thi

Externí odkaz: http://arxiv.org/abs/2407.02411

Zobrazit plný text záznamu

Report

DreamBench++: A Human-Aligned Benchmark for Personalized Image Generation

Autor: Peng, Yuang, Cui, Yuxin, Tang, Haomiao, Qi, Zekun, Dong, Runpei, Bai, Jing, Han, Chunrui, Ge, Zheng, Zhang, Xiangyu, Xia, Shu-Tao

Personalized image generation holds great promise in assisting humans in everyday work and life due to its impressive function in creatively generating personalized content. However, current evaluations either are automated but misalign with humans o

Externí odkaz: http://arxiv.org/abs/2406.16855

Zobrazit plný text záznamu

Report

Hierarchical Features Matter: A Deep Exploration of GAN Priors for Improved Dataset Distillation

Autor: Zhong, Xinhao, Fang, Hao, Chen, Bin, Gu, Xulin, Dai, Tao, Qiu, Meikang, Xia, Shu-Tao

Dataset distillation is an emerging dataset reduction method, which condenses large-scale datasets while maintaining task accuracy. Current methods have integrated parameterization techniques to boost synthetic dataset performance by shifting the opt

Externí odkaz: http://arxiv.org/abs/2406.05704

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání