Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Chen, Yunkuo"'
Autor:
Xu, Jiaqi, Zou, Xinyi, Huang, Kunzhe, Chen, Yunkuo, Liu, Bo, Cheng, MengLi, Shi, Xing, Huang, Jun
This paper presents EasyAnimate, an advanced method for video generation that leverages the power of transformer architecture for high-performance outcomes. We have expanded the DiT framework originally designed for 2D image synthesis to accommodate
Externí odkaz:
http://arxiv.org/abs/2405.18991
Video-and-language understanding has a variety of applications in the industry, such as video question answering, text-video retrieval, and multi-label classification. Existing video-and-language understanding methods generally adopt heavy multi-moda
Externí odkaz:
http://arxiv.org/abs/2303.05707
MuLTI: Efficient Video-and-Language Understanding with MultiWay-Sampler and Multiple Choice Modeling
Video-and-language understanding has a variety of applications in the industry, such as video question answering, text-video retrieval and multi-label classification. Existing video-and-language understanding methods generally adopt heavy multi-modal
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::59eee4bbc681178b4d1e06789b38f0cb
http://arxiv.org/abs/2303.05707
http://arxiv.org/abs/2303.05707