Zobrazeno 1 - 10
of 350
pro vyhledávání: '"Zhu Yukun"'
Publikováno v:
E3S Web of Conferences, Vol 561, p 02008 (2024)
Under the background of increasingly serious energy crisis, promoting the development of building energy conservation can effectively alleviate the problem of an energy shortage, ensure stable economic development, and energy-saving design has gradua
Externí odkaz:
https://doaj.org/article/12ba26f9ca1d4a6ba8e1c98f12a76328
Autor:
Huang, Hsin-Ping, Su, Yu-Chuan, Sun, Deqing, Jiang, Lu, Jia, Xuhui, Zhu, Yukun, Yang, Ming-Hsuan
Text-to-video generation has shown promising results. However, by taking only natural languages as input, users often face difficulties in providing detailed information to precisely control the model's output. In this work, we propose fine-grained c
Externí odkaz:
http://arxiv.org/abs/2312.02919
Autor:
Ayyubi, Hammad A., Liu, Tianqi, Nagrani, Arsha, Lin, Xudong, Zhang, Mingda, Arnab, Anurag, Han, Feng, Zhu, Yukun, Liu, Jialu, Chang, Shih-Fu
Existing popular video captioning benchmarks and models deal with generic captions devoid of specific person, place or organization named entities. In contrast, news videos present a challenging setting where the caption requires such named entities
Externí odkaz:
http://arxiv.org/abs/2312.02188
Autor:
Zhu, Alex Zihao, Mei, Jieru, Qiao, Siyuan, Yan, Hang, Zhu, Yukun, Chen, Liang-Chieh, Kretzschmar, Henrik
Semantic segmentation, which aims to classify every pixel in an image, is a key task in machine perception, with many applications across robotics and autonomous driving. Due to the high dimensionality of this task, most existing approaches use local
Externí odkaz:
http://arxiv.org/abs/2309.16889
Autor:
Yang, Chenglin, Qiao, Siyuan, Yu, Qihang, Yuan, Xiaoding, Zhu, Yukun, Yuille, Alan, Adam, Hartwig, Chen, Liang-Chieh
This paper presents MOAT, a family of neural networks that build on top of MObile convolution (i.e., inverted residual blocks) and ATtention. Unlike the current works that stack separate mobile convolution and transformer blocks, we effectively merge
Externí odkaz:
http://arxiv.org/abs/2210.01820
Autor:
Yu, Qihang, Wang, Huiyu, Qiao, Siyuan, Collins, Maxwell, Zhu, Yukun, Adam, Hartwig, Yuille, Alan, Chen, Liang-Chieh
The rise of transformers in vision tasks not only advances network backbone designs, but also starts a brand-new page to achieve end-to-end image recognition (e.g., object detection and panoptic segmentation). Originated from Natural Language Process
Externí odkaz:
http://arxiv.org/abs/2207.04044
Autor:
Yu, Qihang, Wang, Huiyu, Kim, Dahun, Qiao, Siyuan, Collins, Maxwell, Zhu, Yukun, Adam, Hartwig, Yuille, Alan, Chen, Liang-Chieh
We propose Clustering Mask Transformer (CMT-DeepLab), a transformer-based framework for panoptic segmentation designed around clustering. It rethinks the existing transformer architectures used in segmentation and detection; CMT-DeepLab considers the
Externí odkaz:
http://arxiv.org/abs/2206.08948
Autor:
Mei, Jieru, Zhu, Alex Zihao, Yan, Xinchen, Yan, Hang, Qiao, Siyuan, Zhu, Yukun, Chen, Liang-Chieh, Kretzschmar, Henrik, Anguelov, Dragomir
Panoptic image segmentation is the computer vision task of finding groups of pixels in an image and assigning semantic classes and object instance identifiers to them. Research in image segmentation has become increasingly popular due to its critical
Externí odkaz:
http://arxiv.org/abs/2206.07704
Autor:
Zhu, Yukun, Ji, Cheng, Chen, Yunbo, Hu, Huiqin, He, Ning, Zhang, Jinfeng, Chen, Youhua, Liu, Wenjie, Kuang, Cuifang
Publikováno v:
In Optics and Laser Technology October 2024 177
Autor:
Lu, Ping, Du, Baoyin, Liu, Ke, Luo, Ze, Sikandaier, Abiduweili, Diao, Lipeng, Sun, Jin, Jiang, Luhua, Zhu, Yukun
Publikováno v:
In Chinese Journal of Structural Chemistry August 2024 43(8)