Výsledky vyhledávání

Report

Heterogeneous Generative Knowledge Distillation with Masked Image Modeling

Autor: Wang, Ziming, Han, Shumin, Wang, Xiaodi, Hao, Jing, Cao, Xianbin, Zhang, Baochang

Small CNN-based models usually require transferring knowledge from a large model before they are deployed in computationally resource-limited edge devices. Masked image modeling (MIM) methods achieve great success in various visual tasks but remain l

Externí odkaz: http://arxiv.org/abs/2309.09571

Zobrazit plný text záznamu

Kniha

Hydrogen storage alloys : with RE-Mg-Ni based negative electrodes / Shumin Han, Yuan Li, Baozhong Liu. [elektronicky zdroj]

Autor: Han, Shumin, author

Externí odkaz: Kolekce e-knih KNAV

Report

Prompt Tuning Inversion for Text-Driven Image Editing Using Diffusion Models

Autor: Dong, Wenkai, Xue, Song, Duan, Xiaoyue, Han, Shumin

Recently large-scale language-image models (e.g., text-guided diffusion models) have considerably improved the image generation capabilities to generate photorealistic images in various domains. Based on this success, current image editing methods us

Externí odkaz: http://arxiv.org/abs/2305.04441

Zobrazit plný text záznamu

Report

Language-aware Multiple Datasets Detection Pretraining for DETRs

Autor: Hao, Jing, Chen, Song, Wang, Xiaodi, Han, Shumin

Pretraining on large-scale datasets can boost the performance of object detectors while the annotated datasets for object detection are hard to scale up due to the high labor cost. What we possess are numerous isolated filed-specific datasets, thus,

Externí odkaz: http://arxiv.org/abs/2304.03580

Zobrazit plný text záznamu

Report

Rethinking the Number of Shots in Robust Model-Agnostic Meta-Learning

Autor: Duan, Xiaoyue, Kang, Guoliang, Wang, Runqi, Han, Shumin, Xue, Song, Wang, Tian, Zhang, Baochang

Robust Model-Agnostic Meta-Learning (MAML) is usually adopted to train a meta-model which may fast adapt to novel classes with only a few exemplars and meanwhile remain robust to adversarial attacks. The conventional solution for robust MAML is to in

Externí odkaz: http://arxiv.org/abs/2211.15180

Zobrazit plný text záznamu

Report

CAE v2: Context Autoencoder with CLIP Target

Autor: Zhang, Xinyu, Chen, Jiahui, Yuan, Junkun, Chen, Qiang, Wang, Jian, Wang, Xiaodi, Han, Shumin, Chen, Xiaokang, Pi, Jimin, Yao, Kun, Han, Junyu, Ding, Errui, Wang, Jingdong

Masked image modeling (MIM) learns visual representation by masking and reconstructing image patches. Applying the reconstruction supervision on the CLIP representation has been proven effective for MIM. However, it is still under-explored how CLIP s

Externí odkaz: http://arxiv.org/abs/2211.09799

Zobrazit plný text záznamu

Akademický článek

Enhanced Multi-Party Privacy-Preserving Record Linkage Using Trusted Execution Environments.

Autor: Han, Shumin¹ (AUTHOR) hanshumin@lnpu.edu.cn, Shen, Kuixing¹ (AUTHOR) wangchuang@lnpu.edu.cn, Shen, Derong² (AUTHOR) shenderong@ise.neu.edu.cn, Wang, Chuang¹ (AUTHOR)

Publikováno v: Mathematics (2227-7390). Aug2024, Vol. 12 Issue 15, p2337. 19p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

MAFormer: A Transformer Network with Multi-scale Attention Fusion for Visual Recognition

Autor: Wang, Yunhao, Sun, Huixin, Wang, Xiaodi, Zhang, Bin, Li, Chao, Xin, Ying, Zhang, Baochang, Ding, Errui, Han, Shumin

Vision Transformer and its variants have demonstrated great potential in various computer vision tasks. But conventional vision transformers often focus on global dependency at a coarse level, which suffer from a learning challenge on global relation

Externí odkaz: http://arxiv.org/abs/2209.01620

Zobrazit plný text záznamu

Report

Context Autoencoder for Self-Supervised Representation Learning

Autor: Chen, Xiaokang, Ding, Mingyu, Wang, Xiaodi, Xin, Ying, Mo, Shentong, Wang, Yunhao, Han, Shumin, Luo, Ping, Zeng, Gang, Wang, Jingdong

We present a novel masked image modeling (MIM) approach, context autoencoder (CAE), for self-supervised representation pretraining. We pretrain an encoder by making predictions in the encoded representation space. The pretraining tasks include two ta

Externí odkaz: http://arxiv.org/abs/2202.03026

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání