Výsledky vyhledávání

Report

CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization

Autor: Wu, Feize, Pang, Yun, Zhang, Junyi, Pang, Lianyu, Yin, Jian, Zhao, Baoquan, Li, Qing, Mao, Xudong

Recent advances in text-to-image personalization have enabled high-quality and controllable image synthesis for user-provided concepts. However, existing methods still struggle to balance identity preservation with text alignment. Our approach is bas

Externí odkaz: http://arxiv.org/abs/2408.15914

Zobrazit plný text záznamu

Report

Artificial Human Lecturers: Initial Findings From Asia's First AI Lecturers in Class to Promote Innovation in Education

Autor: Pang, Ching Christie, Zhao, Yawei, Yin, Zhizhuo, Sun, Jia, Mogavi, Reza Hadi, Hui, Pan

In recent years, artificial intelligence (AI) has become increasingly integrated into education, reshaping traditional learning environments. Despite this, there has been limited investigation into fully operational artificial human lecturers. To the

Externí odkaz: http://arxiv.org/abs/2410.03525

Zobrazit plný text záznamu

Report

Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration

Autor: Wu, Kangxi, Pang, Liang, Shen, Huawei, Cheng, Xueqi

The black-box nature of large language models (LLMs) poses challenges in interpreting results, impacting issues such as data intellectual property protection and hallucination tracing. Training data attribution (TDA) methods are considered effective

Externí odkaz: http://arxiv.org/abs/2410.01285

Zobrazit plný text záznamu

Report

Backdooring Vision-Language Models with Out-Of-Distribution Data

Autor: Lyu, Weimin, Yao, Jiachen, Gupta, Saumya, Pang, Lu, Sun, Tao, Yi, Lingjie, Hu, Lijie, Ling, Haibin, Chen, Chao

The emergence of Vision-Language Models (VLMs) represents a significant advancement in integrating computer vision with Large Language Models (LLMs) to generate detailed text descriptions from visual inputs. Despite their growing importance, the secu

Externí odkaz: http://arxiv.org/abs/2410.01264

Zobrazit plný text záznamu

Report

EC-DIT: Scaling Diffusion Transformers with Adaptive Expert-Choice Routing

Autor: Sun, Haotian, Zhang, Bowen, Li, Yanghao, Huang, Haoshuo, Lei, Tao, Pang, Ruoming, Dai, Bo, Du, Nan

Diffusion transformers have been widely adopted for text-to-image synthesis. While scaling these models up to billions of parameters shows promise, the effectiveness of scaling beyond current sizes remains underexplored and challenging. By explicitly

Externí odkaz: http://arxiv.org/abs/2410.02098

Zobrazit plný text záznamu

Report

Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo

Autor: Feng, Shengyu, Kong, Xiang, Ma, Shuang, Zhang, Aonan, Yin, Dong, Wang, Chong, Pang, Ruoming, Yang, Yiming

Augmenting the multi-step reasoning abilities of Large Language Models (LLMs) has been a persistent challenge. Recently, verification has shown promise in improving solution consistency by evaluating generated outputs. However, current verification a

Externí odkaz: http://arxiv.org/abs/2410.01920

Zobrazit plný text záznamu

Report

The Early Bird Catches the Leak: Unveiling Timing Side Channels in LLM Serving Systems

Autor: Song, Linke, Pang, Zixuan, Wang, Wenhao, Wang, Zihao, Wang, XiaoFeng, Chen, Hongbo, Song, Wei, Jin, Yier, Meng, Dan, Hou, Rui

The wide deployment of Large Language Models (LLMs) has given rise to strong demands for optimizing their inference performance. Today's techniques serving this purpose primarily focus on reducing latency and improving throughput through algorithmic

Externí odkaz: http://arxiv.org/abs/2409.20002

Zobrazit plný text záznamu

Report

Multimodal Misinformation Detection by Learning from Synthetic Data with Multimodal LLMs

Autor: Zeng, Fengzhu, Li, Wenqian, Gao, Wei, Pang, Yan

Detecting multimodal misinformation, especially in the form of image-text pairs, is crucial. Obtaining large-scale, high-quality real-world fact-checking datasets for training detectors is costly, leading researchers to use synthetic datasets generat

Externí odkaz: http://arxiv.org/abs/2409.19656

Zobrazit plný text záznamu

Report

TrojVLM: Backdoor Attack Against Vision Language Models

Autor: Lyu, Weimin, Pang, Lu, Ma, Tengfei, Ling, Haibin, Chen, Chao

The emergence of Vision Language Models (VLMs) is a significant advancement in integrating computer vision with Large Language Models (LLMs) to produce detailed text descriptions based on visual inputs, yet it introduces new security vulnerabilities.

Externí odkaz: http://arxiv.org/abs/2409.19232

Zobrazit plný text záznamu

Report

SinoSynth: A Physics-based Domain Randomization Approach for Generalizable CBCT Image Enhancement

Autor: Pang, Yunkui, Liu, Yilin, Chen, Xu, Yap, Pew-Thian, Lian, Jun

Cone Beam Computed Tomography (CBCT) finds diverse applications in medicine. Ensuring high image quality in CBCT scans is essential for accurate diagnosis and treatment delivery. Yet, the susceptibility of CBCT images to noise and artifacts undermine

Externí odkaz: http://arxiv.org/abs/2409.18355

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání