Výsledky vyhledávání - "Wang, Xiangdong"

Report

A Multimodal Object-level Contrast Learning Method for Cancer Survival Risk Prediction

Autor: Yang, Zekang, Liu, Hong, Wang, Xiangdong

Computer-aided cancer survival risk prediction plays an important role in the timely treatment of patients. This is a challenging weakly supervised ordinal regression task associated with multiple clinical factors involved such as pathological images

Externí odkaz: http://arxiv.org/abs/2409.02145

Zobrazit plný text záznamu

Report

Advancing Multi-grained Alignment for Contrastive Language-Audio Pre-training

Autor: Li, Yiming, Guo, Zhifang, Wang, Xiangdong, Liu, Hong

Recent advances have been witnessed in audio-language joint learning, such as CLAP, that shows much success in multi-modal understanding tasks. These models usually aggregate uni-modal local representations, namely frame or word features, into global

Externí odkaz: http://arxiv.org/abs/2408.07919

Zobrazit plný text záznamu

Report

SCMIL: Sparse Context-aware Multiple Instance Learning for Predicting Cancer Survival Probability Distribution in Whole Slide Images

Autor: Yang, Zekang, Liu, Hong, Wang, Xiangdong

Cancer survival prediction is a challenging task that involves analyzing of the tumor microenvironment within Whole Slide Image (WSI). Previous methods cannot effectively capture the intricate interaction features among instances within the local are

Externí odkaz: http://arxiv.org/abs/2407.00664

Zobrazit plný text záznamu

Report

Frame Pairwise Distance Loss for Weakly-supervised Sound Event Detection

Autor: Tao, Rui, Huang, Yuxing, Wang, Xiangdong, Yan, Long, Zhai, Lufeng, Ouchi, Kazushige, Li, Taihao

Weakly-supervised learning has emerged as a promising approach to leverage limited labeled data in various domains by bridging the gap between fully supervised methods and unsupervised techniques. Acquisition of strong annotations for detecting sound

Externí odkaz: http://arxiv.org/abs/2309.11783

Zobrazit plný text záznamu

Report

Audio-free Prompt Tuning for Language-Audio Models

Autor: Li, Yiming, Wang, Xiangdong, Liu, Hong

Contrastive Language-Audio Pretraining (CLAP) is pre-trained to associate audio features with human language, making it a natural zero-shot classifier to recognize unseen sound categories. To adapt CLAP to downstream tasks, prior works inevitably req

Externí odkaz: http://arxiv.org/abs/2309.08357

Zobrazit plný text záznamu

Report

Semi-supervised Sound Event Detection with Local and Global Consistency Regularization

Autor: Li, Yiming, Wang, Xiangdong, Liu, Hong, Tao, Rui, Yan, Long, Ouchi, Kazushige

Learning meaningful frame-wise features on a partially labeled dataset is crucial to semi-supervised sound event detection. Prior works either maintain consistency on frame-level predictions or seek feature-level similarity among neighboring frames,

Externí odkaz: http://arxiv.org/abs/2309.08355

Zobrazit plný text záznamu

Kniha

Lipidomics in Health and Disease : Methods and Application. [elektronicky zdroj]

Autor: Wang, Xiangdong

Externí odkaz: Kolekce e-knih KNAV Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on request.

Elektronická kniha

Genomic Approach to Asthma. [electronic resource]

Autor: Wang, Xiangdong

Externí odkaz: Kolekce e-knih KNAV Registrovani uzivatele: plny text online 5 minut, dalsi pristup na vyzadani. Registered users: full text online 5 minutes, further access on requests.

Report

Audio Generation with Multiple Conditional Diffusion Model

Autor: Guo, Zhifang, Mao, Jianguo, Tao, Rui, Yan, Long, Ouchi, Kazushige, Liu, Hong, Wang, Xiangdong

Text-based audio generation models have limitations as they cannot encompass all the information in audio, leading to restricted controllability when relying solely on text. To address this issue, we propose a novel model that enhances the controllab

Externí odkaz: http://arxiv.org/abs/2308.11940

Zobrazit plný text záznamu

Report

Leveraging Language Model Capabilities for Sound Event Detection

Autor: Wang, Hualei, Mao, Jianguo, Guo, Zhifang, Wan, Jiarui, Liu, Hong, Wang, Xiangdong

Large language models reveal deep comprehension and fluent generation in the field of multi-modality. Although significant advancements have been achieved in audio multi-modality, existing methods are rarely leverage language model for sound event de

Externí odkaz: http://arxiv.org/abs/2308.11530

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání