Výsledky vyhledávání - "Zhuge, Mingcheng"

Report

Masked Vision-Language Transformer in Fashion

Autor: Ji, Ge-Peng, Zhuge, Mingcheng, Gao, Dehong, Fan, Deng-Ping, Sakaridis, Christos, Van Gool, Luc

Publikováno v: Machine Intelligence Research. 20, 421-434 (2023)

We present a masked vision-language transformer (MVLT) for fashion-specific multi-modal representation. Technically, we simply utilize vision transformer architecture for replacing the BERT in the pre-training model, making MVLT the first end-to-end

Externí odkaz: http://arxiv.org/abs/2210.15110

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání