Zobrazeno 1 - 10
of 61
pro vyhledávání: '"Liu, Runtao"'
Autor:
He, Yingqing, Liu, Zhaoyang, Chen, Jingye, Tian, Zeyue, Liu, Hongyu, Chi, Xiaowei, Liu, Runtao, Yuan, Ruibin, Xing, Yazhou, Wang, Wenhai, Dai, Jifeng, Zhang, Yong, Xue, Wei, Liu, Qifeng, Guo, Yike, Chen, Qifeng
With the recent advancement in large language models (LLMs), there is a growing interest in combining LLMs with multimodal learning. Previous surveys of multimodal large language models (MLLMs) mainly focus on multimodal understanding. This survey el
Externí odkaz:
http://arxiv.org/abs/2405.19334
With the ability to generate high-quality images, text-to-image (T2I) models can be exploited for creating inappropriate content. To prevent misuse, existing safety measures are either based on text blacklists, which can be easily circumvented, or ha
Externí odkaz:
http://arxiv.org/abs/2404.08031
Multimodal Large Language Models (MLLMs) excel in generating responses based on visual inputs. However, they often suffer from a bias towards generating responses similar to their pretraining corpus, overshadowing the importance of visual information
Externí odkaz:
http://arxiv.org/abs/2403.08730
Humans can easily segment moving objects without knowing what they are. That objectness could emerge from continuous visual observations motivates us to model grouping and movement concurrently from unlabeled videos. Our premise is that a video has d
Externí odkaz:
http://arxiv.org/abs/2111.06394
Autor:
Bao, Hongfei, Chen, Diancheng, Cao, Jiaqi, Jiang, Pengfeng, Li, Kaili, Liu, Runtao, Zhao, Yuling, Zheng, Yichun, Liao, Beiqi, Zhang, Yaming, Lu, Xia, Sun, Yang
Publikováno v:
In Journal of Energy Chemistry August 2024 95:511-518
Sketches are the most abstract 2D representations of real-world objects. Although a sketch usually has geometrical distortion and lacks visual cues, humans can effortlessly envision a 3D object from it. This suggests that sketches encode the informat
Externí odkaz:
http://arxiv.org/abs/2006.09694
Humans can envision a realistic photo given a free-hand sketch that is not only spatially imprecise and geometrically distorted but also without colors and visual details. We study unsupervised sketch-to-photo synthesis for the first time, learning f
Externí odkaz:
http://arxiv.org/abs/1909.08313
Publikováno v:
In Journal of Manufacturing Processes 8 September 2023 101:171-180
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Referring object detection and referring image segmentation are important tasks that require joint understanding of visual information and natural language. Yet there has been evidence that current benchmark datasets suffer from bias, and current sta
Externí odkaz:
http://arxiv.org/abs/1901.00850