Výsledky vyhledávání

Report

Ultra-High Resolution Segmentation via Boundary-Enhanced Patch-Merging Transformer

Autor: Sun, Haopeng, Zhang, Yingwei, Xu, Lumin, Jin, Sheng, Chen, Yiqiang

Segmentation of ultra-high resolution (UHR) images is a critical task with numerous applications, yet it poses significant challenges due to high spatial resolution and rich fine details. Recent approaches adopt a dual-branch architecture, where a gl

Externí odkaz: http://arxiv.org/abs/2412.10181

Zobrazit plný text záznamu

Report

Statistical inference for mean-field queueing systems

Autor: Lambadaris, Ioannis, Sid-Ali, Ahmed, Sun, Wei, Zhao, Yiqiang Q.

Mean-field limits have been used now as a standard tool in approximations, including for networks with a large number of nodes. Statistical inference on mean-filed models has attracted more attention recently mainly due to the rapid emergence of data

Externí odkaz: http://arxiv.org/abs/2411.12936

Zobrazit plný text záznamu

Report

Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars

Autor: Huang, Xuan, Li, Hanhui, Liu, Wanquan, Liang, Xiaodan, Yan, Yiqiang, Cheng, Yuhao, Gao, Chengqiang

In this paper, we propose to create animatable avatars for interacting hands with 3D Gaussian Splatting (GS) and single-image inputs. Existing GS-based methods designed for single subjects often yield unsatisfactory results due to limited input views

Externí odkaz: http://arxiv.org/abs/2410.08840

Zobrazit plný text záznamu

Report

Decoupled and Interactive Regression Modeling for High-performance One-stage 3D Object Detection

Autor: Xiao, Weiping, Wu, Yiqiang, Liu, Chang, Qin, Yu, Li, Xiaomao, Xin, Liming

Inadequate bounding box modeling in regression tasks constrains the performance of one-stage 3D object detection. Our study reveals that the primary reason lies in two aspects: (1) The limited center-offset prediction seriously impairs the bounding b

Externí odkaz: http://arxiv.org/abs/2409.00690

Zobrazit plný text záznamu

Report

Leveraging Self-supervised Audio Representations for Data-Efficient Acoustic Scene Classification

Autor: Cai, Yiqiang, Li, Shengchen, Shao, Xi

Acoustic scene classification (ASC) predominantly relies on supervised approaches. However, acquiring labeled data for training ASC models is often costly and time-consuming. Recently, self-supervised learning (SSL) has emerged as a powerful method f

Externí odkaz: http://arxiv.org/abs/2408.14862

Zobrazit plný text záznamu

Report

GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections

Autor: Zhang, Shiyue, Chong, Zheng, Zhang, Xujie, Li, Hanhui, Cheng, Yuhao, Yan, Yiqiang, Liang, Xiaodan

General text-to-image models bring revolutionary innovation to the fields of arts, design, and media. However, when applied to garment generation, even the state-of-the-art text-to-image models suffer from fine-grained semantic misalignment, particul

Externí odkaz: http://arxiv.org/abs/2408.12352

Zobrazit plný text záznamu

Report

Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application

Autor: Yang, Chuanpeng, Lu, Wang, Zhu, Yao, Wang, Yidong, Chen, Qian, Gao, Chenlong, Yan, Bingjie, Chen, Yiqiang

Large Language Models (LLMs) have showcased exceptional capabilities in various domains, attracting significant interest from both academia and industry. Despite their impressive performance, the substantial size and computational demands of LLMs pos

Externí odkaz: http://arxiv.org/abs/2407.01885

Zobrazit plný text záznamu

Report

GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents

Recently, Multimodal Large Language Models (MLLMs) have been used as agents to control keyboard and mouse inputs by directly perceiving the Graphical User Interface (GUI) and generating corresponding code. However, current agents primarily exhibit ex

Externí odkaz: http://arxiv.org/abs/2406.10819

Zobrazit plný text záznamu

Report

The $q$-Schur algebras in type $D$, I, fundamental multiplication formulas

Autor: Du, Jie, Li, Yiqiang, Zhao, Zhaozhao

By embedding the Hecke algebra $\check H_q$ of type $D$ into the Hecke algebra $H_{q,1}$ of type $B$ with unequal parameters $(q,1)$, the $q$-Schur algebras $S^\kappa_q(n,r)$ of type $D$ is naturally defined as the endomorphism algebra of the tensor

Externí odkaz: http://arxiv.org/abs/2406.09057

Zobrazit plný text záznamu

Report

AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation

Autor: Cheng, Junhao, Lu, Xi, Li, Hanhui, Zai, Khun Loun, Yin, Baiqiao, Cheng, Yuhao, Yan, Yiqiang, Liang, Xiaodan

As cutting-edge Text-to-Image (T2I) generation models already excel at producing remarkable single images, an even more challenging task, i.e., multi-turn interactive image generation begins to attract the attention of related research communities. T

Externí odkaz: http://arxiv.org/abs/2406.01388

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání