Zobrazeno 1 - 1
of 1
pro vyhledávání: '"Zhuge, Mingcheng"'
Autor:
Ji, Ge-Peng, Zhuge, Mingcheng, Gao, Dehong, Fan, Deng-Ping, Sakaridis, Christos, Van Gool, Luc
Publikováno v:
Machine Intelligence Research. 20, 421-434 (2023)
We present a masked vision-language transformer (MVLT) for fashion-specific multi-modal representation. Technically, we simply utilize vision transformer architecture for replacing the BERT in the pre-training model, making MVLT the first end-to-end
Externí odkaz:
http://arxiv.org/abs/2210.15110