Výsledky vyhledávání

Report

See Detail Say Clear: Towards Brain CT Report Generation via Pathological Clue-driven Representation Learning

Autor: Zheng, Chengxin, Ji, Junzhong, Shi, Yanzhao, Zhang, Xiaodan, Qu, Liangqiong

Brain CT report generation is significant to aid physicians in diagnosing cranial diseases. Recent studies concentrate on handling the consistency between visual and textual pathological features to improve the coherence of report. However, there exi

Externí odkaz: http://arxiv.org/abs/2409.19676

Zobrazit plný text záznamu

Report

In-depth Analysis of Privacy Threats in Federated Learning for Medical Data

Autor: Das, Badhan Chandra, Amini, M. Hadi, Wu, Yanzhao

Federated learning is emerging as a promising machine learning technique in the medical field for analyzing medical images, as it is considered an effective method to safeguard sensitive patient data and comply with privacy regulations. However, rece

Externí odkaz: http://arxiv.org/abs/2409.18907

Zobrazit plný text záznamu

Report

DA-MoE: Towards Dynamic Expert Allocation for Mixture-of-Experts Models

Autor: Aghdam, Maryam Akhavan, Jin, Hongpeng, Wu, Yanzhao

Transformer-based Mixture-of-Experts (MoE) models have been driving several recent technological advancements in Natural Language Processing (NLP). These MoE models adopt a router mechanism to determine which experts to activate for routing input tok

Externí odkaz: http://arxiv.org/abs/2409.06669

Zobrazit plný text záznamu

Report

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Large language models (LLMs) have advanced financial applications, yet they often lack sufficient financial knowledge and struggle with tasks involving multi-modal inputs like tables and time series data. To address these limitations, we introduce \t

Externí odkaz: http://arxiv.org/abs/2408.11878

Zobrazit plný text záznamu

Report

SysBench: Can Large Language Models Follow System Messages?

Autor: Qin, Yanzhao, Zhang, Tao, Shen, Yanjun, Luo, Wenjing, Sun, Haoze, Zhang, Yan, Qiao, Yujing, Chen, Weipeng, Zhou, Zenan, Zhang, Wentao, Cui, Bin

Large Language Models (LLMs) have become instrumental across various applications, with the customization of these models to specific scenarios becoming increasingly critical. System message, a fundamental component of LLMs, is consist of carefully c

Externí odkaz: http://arxiv.org/abs/2408.10943

Zobrazit plný text záznamu

Report

Depth-guided Texture Diffusion for Image Semantic Segmentation

Autor: Sun, Wei, Li, Yuan, Ye, Qixiang, Jiao, Jianbin, Zhou, Yanzhao

Depth information provides valuable insights into the 3D structure especially the outline of objects, which can be utilized to improve the semantic segmentation tasks. However, a naive fusion of depth information can disrupt feature and compromise ac

Externí odkaz: http://arxiv.org/abs/2408.09097

Zobrazit plný text záznamu

Report

Correspondence-Guided SfM-Free 3D Gaussian Splatting for NVS

Autor: Sun, Wei, Zhang, Xiaosong, Wan, Fang, Zhou, Yanzhao, Li, Yuan, Ye, Qixiang, Jiao, Jianbin

Novel View Synthesis (NVS) without Structure-from-Motion (SfM) pre-processed camera poses--referred to as SfM-free methods--is crucial for promoting rapid response capabilities and enhancing robustness against variable operating conditions. Recent Sf

Externí odkaz: http://arxiv.org/abs/2408.08723

Zobrazit plný text záznamu

Report

An End-to-End Model for Photo-Sharing Multi-modal Dialogue Generation

Autor: Guo, Peiming, Liu, Sinuo, Zhang, Yanzhao, Long, Dingkun, Xie, Pengjun, Zhang, Meishan, Zhang, Min

Photo-Sharing Multi-modal dialogue generation requires a dialogue agent not only to generate text responses but also to share photos at the proper moment. Using image text caption as the bridge, a pipeline model integrates an image caption model, a t

Externí odkaz: http://arxiv.org/abs/2408.08650

Zobrazit plný text záznamu

Report

Contrastive masked auto-encoders based self-supervised hashing for 2D image and 3D point cloud cross-modal retrieval

Autor: Wei, Rukai, Cui, Heng, Liu, Yu, Hou, Yufeng, Xie, Yanzhao, Zhou, Ke

Implementing cross-modal hashing between 2D images and 3D point-cloud data is a growing concern in real-world retrieval systems. Simply applying existing cross-modal approaches to this new task fails to adequately capture latent multi-modal semantics

Externí odkaz: http://arxiv.org/abs/2408.05711

Zobrazit plný text záznamu

Report

Strong convergence of an explicit full-discrete scheme for stochastic Burgers-Huxley equation

Autor: Wang, Yibo, Cao, Wanrong, Cao, Yanzhao

The strong convergence of an explicit full-discrete scheme is investigated for the stochastic Burgers-Huxley equation driven by additive space-time white noise, which possesses both Burgers-type and cubic nonlinearities. To discretize the continuous

Externí odkaz: http://arxiv.org/abs/2408.00947

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání