Výsledky vyhledávání

Report

LLMs Meet Multimodal Generation and Editing: A Survey

Autor: He, Yingqing, Liu, Zhaoyang, Chen, Jingye, Tian, Zeyue, Liu, Hongyu, Chi, Xiaowei, Liu, Runtao, Yuan, Ruibin, Xing, Yazhou, Wang, Wenhai, Dai, Jifeng, Zhang, Yong, Xue, Wei, Liu, Qifeng, Guo, Yike, Chen, Qifeng

With the recent advancement in large language models (LLMs), there is a growing interest in combining LLMs with multimodal learning. Previous surveys of multimodal large language models (MLLMs) mainly focus on multimodal understanding. This survey el

Externí odkaz: http://arxiv.org/abs/2405.19334

Zobrazit plný text záznamu

Report

Latent Guard: a Safety Framework for Text-to-image Generation

Autor: Liu, Runtao, Khakzar, Ashkan, Gu, Jindong, Chen, Qifeng, Torr, Philip, Pizzati, Fabio

With the ability to generate high-quality images, text-to-image (T2I) models can be exploited for creating inappropriate content. To prevent misuse, existing safety measures are either based on text blacklists, which can be easily circumvented, or ha

Externí odkaz: http://arxiv.org/abs/2404.08031

Zobrazit plný text záznamu

Report

Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization

Autor: Pi, Renjie, Han, Tianyang, Xiong, Wei, Zhang, Jipeng, Liu, Runtao, Pan, Rui, Zhang, Tong

Multimodal Large Language Models (MLLMs) excel in generating responses based on visual inputs. However, they often suffer from a bias towards generating responses similar to their pretraining corpus, overshadowing the importance of visual information

Externí odkaz: http://arxiv.org/abs/2403.08730

Zobrazit plný text záznamu

Report

The Emergence of Objectness: Learning Zero-Shot Segmentation from Videos

Autor: Liu, Runtao, Wu, Zhirong, Yu, Stella X., Lin, Stephen

Humans can easily segment moving objects without knowing what they are. That objectness could emerge from continuous visual observations motivates us to model grouping and movement concurrently from unlabeled videos. Our premise is that a video has d

Externí odkaz: http://arxiv.org/abs/2111.06394

Zobrazit plný text záznamu

Akademický článek

Boosting the cycling stability of all-solid-state lithium metal batteries through MOF-based polymeric protective layers

Autor: Bao, Hongfei, Chen, Diancheng, Cao, Jiaqi, Jiang, Pengfeng, Li, Kaili, Liu, Runtao, Zhao, Yuling, Zheng, Yichun, Liao, Beiqi, Zhang, Yaming, Lu, Xia, Sun, Yang

Publikováno v: In Journal of Energy Chemistry August 2024 95:511-518

Zobrazit plný text záznamu

Report

3D Shape Reconstruction from Free-Hand Sketches

Autor: Wang, Jiayun, Lin, Jierui, Yu, Qian, Liu, Runtao, Chen, Yubei, Yu, Stella X.

Sketches are the most abstract 2D representations of real-world objects. Although a sketch usually has geometrical distortion and lacks visual cues, humans can effortlessly envision a 3D object from it. This suggests that sketches encode the informat

Externí odkaz: http://arxiv.org/abs/2006.09694

Zobrazit plný text záznamu

Report

Unsupervised Sketch-to-Photo Synthesis

Autor: Liu, Runtao, Yu, Qian, Yu, Stella

Humans can envision a realistic photo given a free-hand sketch that is not only spatially imprecise and geometrically distorted but also without colors and visual details. We study unsupervised sketch-to-photo synthesis for the first time, learning f

Externí odkaz: http://arxiv.org/abs/1909.08313

Zobrazit plný text záznamu

Akademický článek

Study of twin tungsten electrode – wire electrode indirect arc welding assisted by alternating magnetic field

Autor: Liu, Liming, Zhu, Yanli, Liu, Runtao

Publikováno v: In Journal of Manufacturing Processes 8 September 2023 101:171-180

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Report

CLEVR-Ref+: Diagnosing Visual Reasoning with Referring Expressions

Autor: Liu, Runtao, Liu, Chenxi, Bai, Yutong, Yuille, Alan

Referring object detection and referring image segmentation are important tasks that require joint understanding of visual information and natural language. Yet there has been evidence that current benchmark datasets suffer from bias, and current sta

Externí odkaz: http://arxiv.org/abs/1901.00850

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání