Výsledky vyhledávání - "Gong, YongShun"

Report

Content-aware Balanced Spectrum Encoding in Masked Modeling for Time Series Classification

Autor: Han, Yudong, Wang, Haocong, Hu, Yupeng, Gong, Yongshun, Song, Xuemeng, Guan, Weili

Due to the superior ability of global dependency, transformer and its variants have become the primary choice in Masked Time-series Modeling (MTM) towards time-series classification task. In this paper, we experimentally analyze that existing transfo

Externí odkaz: http://arxiv.org/abs/2412.13232

Zobrazit plný text záznamu

Report

An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models

Autor: Qu, Wentao, Wang, Jing, Gong, YongShun, Huang, Xiaoshui, Xiao, Liang

Existing conditional Denoising Diffusion Probabilistic Models (DDPMs) with a Noise-Conditional Framework (NCF) remain challenging for 3D scene understanding tasks, as the complex geometric details in scenes increase the difficulty of fitting the grad

Externí odkaz: http://arxiv.org/abs/2411.16308

Zobrazit plný text záznamu

Report

Deceiving Question-Answering Models: A Hybrid Word-Level Adversarial Approach

Autor: Li, Jiyao, Ni, Mingze, Gong, Yongshun, Liu, Wei

Deep learning underpins most of the currently advanced natural language processing (NLP) tasks such as textual classification, neural machine translation (NMT), abstractive summarization and question-answering (QA). However, the robustness of the mod

Externí odkaz: http://arxiv.org/abs/2411.08248

Zobrazit plný text záznamu

Report

Fine-Grained Urban Flow Inference with Multi-scale Representation Learning

Autor: Yuan, Shilu, Li, Dongfeng, Liu, Wei, Zhang, Xinxin, Chen, Meng, Zhang, Junjie, Gong, Yongshun

Fine-grained urban flow inference (FUFI) is a crucial transportation service aimed at improving traffic efficiency and safety. FUFI can infer fine-grained urban traffic flows based solely on observed coarse-grained data. However, most of existing met

Externí odkaz: http://arxiv.org/abs/2406.09710

Zobrazit plný text záznamu

Report

Diverse Teacher-Students for Deep Safe Semi-Supervised Learning under Class Mismatch

Autor: Wang, Qikai, He, Rundong, Gong, Yongshun, Ren, Chunxiao, Sun, Haoliang, Huang, Xiaoshui, Yin, Yilong

Semi-supervised learning can significantly boost model performance by leveraging unlabeled data, particularly when labeled data is scarce. However, real-world unlabeled data often contain unseen-class samples, which can hinder the classification of s

Externí odkaz: http://arxiv.org/abs/2405.16093

Zobrazit plný text záznamu

Report

3DBench: A Scalable 3D Benchmark and Instruction-Tuning Dataset

Autor: Zhang, Junjie, Hu, Tianci, Huang, Xiaoshui, Gong, Yongshun, Zeng, Dan

Evaluating the performance of Multi-modal Large Language Models (MLLMs), integrating both point cloud and language, presents significant challenges. The lack of a comprehensive assessment hampers determining whether these models truly represent advan

Externí odkaz: http://arxiv.org/abs/2404.14678

Zobrazit plný text záznamu

Report

CLIP-driven Outliers Synthesis for few-shot OOD detection

Autor: Sun, Hao, He, Rundong, Han, Zhongyi, Lin, Zhicong, Gong, Yongshun, Yin, Yilong

Few-shot OOD detection focuses on recognizing out-of-distribution (OOD) images that belong to classes unseen during training, with the use of only a small number of labeled in-distribution (ID) images. Up to now, a mainstream strategy is based on lar

Externí odkaz: http://arxiv.org/abs/2404.00323

Zobrazit plný text záznamu

Report

CodeS: Natural Language to Code Repository via Multi-Layer Sketch

Autor: Zan, Daoguang, Yu, Ailun, Liu, Wei, Chen, Dong, Shen, Bo, Li, Wei, Yao, Yafen, Gong, Yongshun, Chen, Xiaolin, Guan, Bei, Yang, Zhiguang, Wang, Yongji, Wang, Qianxiang, Cui, Lizhen

The impressive performance of large language models (LLMs) on code-related tasks has shown the potential of fully automated software development. In light of this, we introduce a new software engineering task, namely Natural Language to code Reposito

Externí odkaz: http://arxiv.org/abs/2403.16443

Zobrazit plný text záznamu

Report

Uni3D-LLM: Unifying Point Cloud Perception, Generation and Editing with Large Language Models

Autor: Liu, Dingning, Huang, Xiaoshui, Hou, Yuenan, Wang, Zhihui, Yin, Zhenfei, Gong, Yongshun, Gao, Peng, Ouyang, Wanli

In this paper, we introduce Uni3D-LLM, a unified framework that leverages a Large Language Model (LLM) to integrate tasks of 3D perception, generation, and editing within point cloud scenes. This framework empowers users to effortlessly generate and

Externí odkaz: http://arxiv.org/abs/2402.03327

Zobrazit plný text záznamu

Report

3DAxiesPrompts: Unleashing the 3D Spatial Task Capabilities of GPT-4V

Autor: Liu, Dingning, Dong, Xiaomeng, Zhang, Renrui, Luo, Xu, Gao, Peng, Huang, Xiaoshui, Gong, Yongshun, Wang, Zhihui

In this work, we present a new visual prompting method called 3DAxiesPrompts (3DAP) to unleash the capabilities of GPT-4V in performing 3D spatial tasks. Our investigation reveals that while GPT-4V exhibits proficiency in discerning the position and

Externí odkaz: http://arxiv.org/abs/2312.09738

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání