Výsledky vyhledávání - "Zhou, Yuanen"

Report

Embedded Heterogeneous Attention Transformer for Cross-lingual Image Captioning

Autor: Song, Zijie, Hu, Zhenzhen, Zhou, Yuanen, Zhao, Ye, Hong, Richang, Wang, Meng

Cross-lingual image captioning is a challenging task that requires addressing both cross-lingual and cross-modal obstacles in multimedia analysis. The crucial issue in this task is to model the global and the local matching between the image and diff

Externí odkaz: http://arxiv.org/abs/2307.09915

Zobrazit plný text záznamu

Report

Compact Bidirectional Transformer for Image Captioning

Autor: Zhou, Yuanen, Hu, Zhenzhen, Liu, Daqing, Ben, Huixia, Wang, Meng

Most current image captioning models typically generate captions from left to right. This unidirectional property makes them can only leverage past context but not future context. Though recent refinement-based models can exploit both past and future

Externí odkaz: http://arxiv.org/abs/2201.01984

Zobrazit plný text záznamu

Report

Semi-Autoregressive Transformer for Image Captioning

Autor: Zhou, Yuanen, Zhang, Yong, Hu, Zhenzhen, Wang, Meng

Current state-of-the-art image captioning models adopt autoregressive decoders, \ie they generate each word by conditioning on previously generated words, which leads to heavy latency during inference. To tackle this issue, non-autoregressive image c

Externí odkaz: http://arxiv.org/abs/2106.09436

Zobrazit plný text záznamu

Report

More Grounded Image Captioning by Distilling Image-Text Matching Model

Autor: Zhou, Yuanen, Wang, Meng, Liu, Daqing, Hu, Zhenzhen, Zhang, Hanwang

Visual attention not only improves the performance of image captioners, but also serves as a visual interpretation to qualitatively measure the caption rationality and model transparency. Specifically, we expect that a captioner can fix its attentive

Externí odkaz: http://arxiv.org/abs/2004.00390

Zobrazit plný text záznamu

Akademický článek

Efficiently Gluing Pre-Trained Language and Vision Models for Image Captioning.

Autor: Song, Peipei, Zhou, Yuanen, Yang, Xun, Liu, Daqing, Hu, Zhenzhen, Wang, Depeng, Wang, Meng

Publikováno v: ACM Transactions on Intelligent Systems & Technology; Dec2024, Vol. 15 Issue 6, p1-16, 16p

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Sequential image encoding for vision-to-language problems.

Autor: Wang, Jicheng, Zhou, Yuanen, Hu, Zhenzhen, Zhang, Xu, Wang, Meng

Publikováno v: Multimedia Tools & Applications; May2021, Vol. 80 Issue 11, p16141-16152, 12p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání