Výsledky vyhledávání - "Zhai, Zhonghua"

Report

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Large Models

Autor: Ju, Chen, Wang, Haicheng, Cheng, Haozhe, Chen, Xu, Zhai, Zhonghua, Huang, Weilin, Lan, Jinsong, Xiao, Shuai, Zheng, Bo

Vision-Language Large Models (VLMs) recently become primary backbone of AI, due to the impressive performance. However, their expensive computation costs, i.e., throughput and delay, impede potentials in the real-world scenarios. To achieve accelerat

Externí odkaz: http://arxiv.org/abs/2407.11717

Zobrazit plný text záznamu

Report

Tunnel Try-on: Excavating Spatial-temporal Tunnels for High-quality Virtual Try-on in Videos

Autor: Xu, Zhengze, Chen, Mengting, Wang, Zhao, Xing, Linyu, Zhai, Zhonghua, Sang, Nong, Lan, Jinsong, Xiao, Shuai, Gao, Changxin

Video try-on is a challenging task and has not been well tackled in previous works. The main obstacle lies in preserving the details of the clothing and modeling the coherent motions simultaneously. Faced with those difficulties, we address video try

Externí odkaz: http://arxiv.org/abs/2404.17571

Zobrazit plný text záznamu

Report

Cell Variational Information Bottleneck Network

Autor: Zhai, Zhonghua, Ju, Chen, Lan, Jinsong, Xiao, Shuai

In this work, we propose Cell Variational Information Bottleneck Network (cellVIB), a convolutional neural network using information bottleneck mechanism, which can be combined with the latest feedforward network architecture in an end-to-end trainin

Externí odkaz: http://arxiv.org/abs/2403.15082

Zobrazit plný text záznamu

Report

Wear-Any-Way: Manipulable Virtual Try-on via Sparse Correspondence Alignment

Autor: Chen, Mengting, Chen, Xi, Zhai, Zhonghua, Ju, Chen, Hong, Xuewen, Lan, Jinsong, Xiao, Shuai

This paper introduces a novel framework for virtual try-on, termed Wear-Any-Way. Different from previous methods, Wear-Any-Way is a customizable solution. Besides generating high-fidelity results, our method supports users to precisely manipulate the

Externí odkaz: http://arxiv.org/abs/2403.12965

Zobrazit plný text záznamu

Report

Turbo: Informativity-Driven Acceleration Plug-In for Vision-Language Models

Autor: Ju, Chen, Wang, Haicheng, Li, Zeqian, Chen, Xu, Zhai, Zhonghua, Huang, Weilin, Xiao, Shuai

Vision-Language Large Models (VLMs) have become primary backbone of AI, due to the impressive performance. However, their expensive computation costs, i.e., throughput and delay, impede potentials in real-world scenarios. To achieve acceleration for

Externí odkaz: http://arxiv.org/abs/2312.07408

Zobrazit plný text záznamu

Report

Category-Oriented Representation Learning for Image to Multi-Modal Retrieval

Autor: Cheng, Zida, Ju, Chen, Xiao, Shuai, Chen, Xu, Zhai, Zhonghua, Zeng, Xiaoyi, Huang, Weilin, Yan, Junchi

The rise of multi-modal search requests from users has highlighted the importance of multi-modal retrieval (i.e. image-to-text or text-to-image retrieval), yet the more complex task of image-to-multi-modal retrieval, crucial for many industry applica

Externí odkaz: http://arxiv.org/abs/2305.03972

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Mixer: Image to Multi-Modal Retrieval Learning for Industrial Application

Autor: Cheng, Zida, Xiao, Shuai, Zhai, Zhonghua, Zeng, Xiaoyi, Huang, Weilin

Cross-modal retrieval, where the query is an image and the doc is an item with both image and text description, is ubiquitous in e-commerce platforms and content-sharing social media. However, little research attention has been paid to this important

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2b43673822b22e93ac48e8edda816a3e
http://arxiv.org/abs/2305.03972

Zobrazit plný text záznamu

Akademický článek

Auto-encoder generative adversarial networks.

Autor: Zhai, Zhonghua¹ huangzyone@163.com, Lima, Stanley, Rocha, Álvaro

Publikováno v: Journal of Intelligent & Fuzzy Systems. 2018, Vol. 35 Issue 3, p3043-3049. 7p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Vyhledávací nástroje:

Upřesnit hledání