Výsledky vyhledávání

Akademický článek

Study on cause mechanism of corner oxygen deficiency in mining face of extra-thick spontaneous combustion coal seam

Autor: GAO Ruiqing, WANG Fei, LIU Hongwei, ZHANG Xiaolong, HE Zhihong, LIU Zhenming, WANG Zibang, GONG Biao

Publikováno v: Meikuang Anquan, Vol 53, Iss 10, Pp 160-167 (2022)

Taking the 8105 working face of Madaotou Mine as the engineering background, the numerical calculation model of the flow field of the mining face in the extra-thick and spontaneous coal seam was established to analyze the oxygen consumption rate in t

Externí odkaz: https://doaj.org/article/82c53b70d0594220ab14108ef1dda4d3

Zobrazit plný text záznamu

Report

StyleTokenizer: Defining Image Style by a Single Instance for Controlling Diffusion Models

Autor: Li, Wen, Fang, Muyuan, Zou, Cheng, Gong, Biao, Zheng, Ruobing, Wang, Meng, Chen, Jingdong, Yang, Ming

Despite the burst of innovative methods for controlling the diffusion process, effectively controlling image styles in text-to-image generation remains a challenging task. Many adapter-based methods impose image representation conditions on the denoi

Externí odkaz: http://arxiv.org/abs/2409.02543

Zobrazit plný text záznamu

Report

Focus-Consistent Multi-Level Aggregation for Compositional Zero-Shot Learning

Autor: Dai, Fengyuan, Huang, Siteng, Zhang, Min, Gong, Biao, Wang, Donglin

To transfer knowledge from seen attribute-object compositions to recognize unseen ones, recent compositional zero-shot learning (CZSL) methods mainly discuss the optimal classification branches to identify the elements, leading to the popularity of e

Externí odkaz: http://arxiv.org/abs/2408.17083

Zobrazit plný text záznamu

Report

CURE4Rec: A Benchmark for Recommendation Unlearning with Deeper Influence

Autor: Chen, Chaochao, Zhang, Jiaming, Zhang, Yizhao, Zhang, Li, Lyu, Lingjuan, Li, Yuyuan, Gong, Biao, Yan, Chenggang

With increasing privacy concerns in artificial intelligence, regulations have mandated the right to be forgotten, granting individuals the right to withdraw their data from models. Machine unlearning has emerged as a potential solution to enable sele

Externí odkaz: http://arxiv.org/abs/2408.14393

Zobrazit plný text záznamu

Report

Accelerating Pre-training of Multimodal LLMs via Chain-of-Sight

Autor: Huang, Ziyuan, Ji, Kaixiang, Gong, Biao, Qing, Zhiwu, Zhang, Qinglong, Zheng, Kecheng, Wang, Jian, Chen, Jingdong, Yang, Ming

This paper introduces Chain-of-Sight, a vision-language bridge module that accelerates the pre-training of Multimodal Large Language Models (MLLMs). Our approach employs a sequence of visual resamplers that capture visual details at various spacial s

Externí odkaz: http://arxiv.org/abs/2407.15819

Zobrazit plný text záznamu

Report

ResMaster: Mastering High-Resolution Image Generation via Structural and Fine-Grained Guidance

Autor: Shi, Shuwei, Li, Wenbo, Zhang, Yuechen, He, Jingwen, Gong, Biao, Zheng, Yinqiang

Diffusion models excel at producing high-quality images; however, scaling to higher resolutions, such as 4K, often results in over-smoothed content, structural distortions, and repetitive patterns. To this end, we introduce ResMaster, a novel, traini

Externí odkaz: http://arxiv.org/abs/2406.16476

Zobrazit plný text záznamu

Akademický článek

Analysis of Vibration Characteristics of Optical Fiber Composite Submarine Cable under Anchoring

Autor: SHANG Qiu-feng, GONG Biao, ZHENG Guo-qiang

Publikováno v: Guangtongxin yanjiu, Vol , Pp 45-49,56 (2021)

In view of the insufficient research on the vibration characteristics of the submarine cable fault signal, A three-dimensional finite element model of the optical fiber composite submarine cable (submarine cable) under the anchoring action is establi

Externí odkaz: https://doaj.org/article/b6eb0552f4344262b59ea7f285398f00

Zobrazit plný text záznamu

Report

A Recipe for Scaling up Text-to-Video Generation with Text-free Videos

Autor: Wang, Xiang, Zhang, Shiwei, Yuan, Hangjie, Qing, Zhiwu, Gong, Biao, Zhang, Yingya, Shen, Yujun, Gao, Changxin, Sang, Nong

Diffusion-based text-to-video generation has witnessed impressive progress in the past year yet still falls behind text-to-image generation. One of the key reasons is the limited scale of publicly available data (e.g., 10M video-text pairs in WebVid1

Externí odkaz: http://arxiv.org/abs/2312.15770

Zobrazit plný text záznamu

Report

Ranni: Taming Text-to-Image Diffusion for Accurate Instruction Following

Autor: Feng, Yutong, Gong, Biao, Chen, Di, Shen, Yujun, Liu, Yu, Zhou, Jingren

Existing text-to-image (T2I) diffusion models usually struggle in interpreting complex prompts, especially those with quantity, object-attribute binding, and multi-subject descriptions. In this work, we introduce a semantic panel as the middleware in

Externí odkaz: http://arxiv.org/abs/2311.17002

Zobrazit plný text záznamu

Report

Learning Disentangled Identifiers for Action-Customized Text-to-Image Generation

Autor: Huang, Siteng, Gong, Biao, Feng, Yutong, Chen, Xi, Fu, Yuqian, Liu, Yu, Wang, Donglin

This study focuses on a novel task in text-to-image (T2I) generation, namely action customization. The objective of this task is to learn the co-existing action from limited data and generalize it to unseen humans or even animals. Experimental result

Externí odkaz: http://arxiv.org/abs/2311.15841

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání