Výsledky vyhledávání - "Qin, Luozheng"

Report

VidGen-1M: A Large-Scale Dataset for Text-to-video Generation

Autor: Tan, Zhiyu, Yang, Xiaomeng, Qin, Luozheng, Li, Hao

The quality of video-text pairs fundamentally determines the upper bound of text-to-video models. Currently, the datasets used for training these models suffer from significant shortcomings, including low temporal consistency, poor-quality captions,

Externí odkaz: http://arxiv.org/abs/2408.02629

Zobrazit plný text záznamu

Report

EVALALIGN: Supervised Fine-Tuning Multimodal LLMs with Human-Aligned Data for Evaluating Text-to-Image Models

Autor: Tan, Zhiyu, Yang, Xiaomeng, Qin, Luozheng, Yang, Mengping, Zhang, Cheng, Li, Hao

The recent advancements in text-to-image generative models have been remarkable. Yet, the field suffers from a lack of evaluation metrics that accurately reflect the performance of these models, particularly lacking fine-grained metrics that can guid

Externí odkaz: http://arxiv.org/abs/2406.16562

Zobrazit plný text záznamu

Report

Guiding ChatGPT to Generate Salient Domain Summaries

Autor: Gao, Jun, Cao, Ziqiang, Huang, Shaoyao, Qin, Luozheng, Ai, Chunhui

ChatGPT is instruct-tuned to generate general and human-expected content to align with human preference through Reinforcement Learning from Human Feedback (RLHF), meanwhile resulting in generated responses not salient enough. Therefore, in this case,

Externí odkaz: http://arxiv.org/abs/2406.01070

Zobrazit plný text záznamu

Report

An Empirical Study and Analysis of Text-to-Image Generation Using Large Language Model-Powered Textual Representation

Autor: Tan, Zhiyu, Yang, Mengping, Qin, Luozheng, Yang, Hao, Qian, Ye, Zhou, Qiang, Zhang, Cheng, Li, Hao

One critical prerequisite for faithful text-to-image generation is the accurate understanding of text inputs. Existing methods leverage the text encoder of the CLIP model to represent input prompts. However, the pre-trained CLIP model can merely enco

Externí odkaz: http://arxiv.org/abs/2405.12914

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání