Výsledky vyhledávání

Akademický článek

Effect of modified attention and interpretation therapy on fatigue and sleep quality in patients with post-stroke hemiplegia

Autor: Cheng Jie, Chen Lu, Ma Qing, Zheng Siqi, Wang Yuhan, Wang Yunlong

Publikováno v: Sichuan jingshen weisheng, Vol 37, Iss 5, Pp 427-432 (2024)

BackgroundPatients with post-stroke hemiplegia are likely to experience fatigue and sleep disorder. Attention and interpretation therapy （AIT） has been shown to promote psychological flexibility， thereby alleviating their stress， improving em

Externí odkaz: https://doaj.org/article/848bc889395b4f3fb64e63ed3fb941b2

Zobrazit plný text záznamu

Akademický článek

Manifestations and pathogenesis of the sebaceous gland aging

Autor: ZHENG Siqi, CHEN Shuqiong, ZHONG Meizhen, ZHENG Yue, HUANG Xiaowen

Publikováno v: Pifu-xingbing zhenliaoxue zazhi, Vol 31, Iss 9, Pp 634-642 (2024)

Aging in the sebaceous glands typically begins on the sun-exposed areas, characterized by sebaceous gland hyperplasia and lipid dysbiosis. These changes result in dry and dull skin, alterations in the microbiota, pigmentation, and various skin diseas

Externí odkaz: https://doaj.org/article/5b91fda9436344f9bffe09f58b5dd918

Zobrazit plný text záznamu

Report

The deuterium fractionation of NH$_3$ in massive star-forming regions

Autor: Li, Yuqiang, Wang, Junzhi, Li, Juan, Rayalacheruvu, Prathap, Majumdar, Liton, Yan, Yaoting, Quan, Donghui, Lu, Xing, Zheng, Siqi

Deuteration is sensitive to environmental conditions in star-forming regions. To investigate NH$_2$D chemistry, we compared the spatial distribution of ortho-NH$_2$D $1_{11}^s-1_{01}^a$, NH$_3$(1,1) and NH$_3$(2,2) in 12 late-stage massive star-formi

Externí odkaz: http://arxiv.org/abs/2411.17121

Zobrazit plný text záznamu

Report

OmniSep: Unified Omni-Modality Sound Separation with Query-Mixup

Autor: Cheng, Xize, Zheng, Siqi, Wang, Zehan, Fang, Minghui, Zhang, Ziang, Huang, Rongjie, Ma, Ziyang, Ji, Shengpeng, Zuo, Jialong, Jin, Tao, Zhao, Zhou

The scaling up has brought tremendous success in the fields of vision and language in recent years. When it comes to audio, however, researchers encounter a major challenge in scaling up the training data, as most natural audio contains diverse inter

Externí odkaz: http://arxiv.org/abs/2410.21269

Zobrazit plný text záznamu

Report

OmniFlatten: An End-to-end GPT Model for Seamless Voice Conversation

Autor: Zhang, Qinglin, Cheng, Luyao, Deng, Chong, Chen, Qian, Wang, Wen, Zheng, Siqi, Liu, Jiaqing, Yu, Hai, Tan, Chaohong, Du, Zhihao, Zhang, Shiliang

Full-duplex spoken dialogue systems significantly surpass traditional turn-based dialogue systems, as they allow simultaneous bidirectional communication, closely mirroring human-human interactions. However, achieving low latency and natural interact

Externí odkaz: http://arxiv.org/abs/2410.17799

Zobrazit plný text záznamu

Report

MuVi: Video-to-Music Generation with Semantic Alignment and Rhythmic Synchronization

Autor: Li, Ruiqi, Zheng, Siqi, Cheng, Xize, Zhang, Ziang, Ji, Shengpeng, Zhao, Zhou

Generating music that aligns with the visual content of a video has been a challenging task, as it requires a deep understanding of visual semantics and involves generating music whose melody, rhythm, and dynamics harmonize with the visual narratives

Externí odkaz: http://arxiv.org/abs/2410.12957

Zobrazit plný text záznamu

Report

Exploring Text-Queried Sound Event Detection with Audio Source Separation

Autor: Yin, Han, Bai, Jisheng, Xiao, Yang, Wang, Hui, Zheng, Siqi, Chen, Yafeng, Das, Rohan Kumar, Deng, Chong, Chen, Jianfeng

In sound event detection (SED), overlapping sound events pose a significant challenge, as certain events can be easily masked by background noise or other events, resulting in poor detection performance. To address this issue, we propose the text-que

Externí odkaz: http://arxiv.org/abs/2409.13292

Zobrazit plný text záznamu

Report

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Autor: Ji, Shengpeng, Jiang, Ziyue, Wang, Wen, Chen, Yifu, Fang, Minghui, Zuo, Jialong, Yang, Qian, Cheng, Xize, Wang, Zehan, Li, Ruiqi, Zhang, Ziang, Yang, Xiaoda, Huang, Rongjie, Jiang, Yidi, Chen, Qian, Zheng, Siqi, Zhao, Zhou

Language models have been effectively applied to modeling natural signals, such as images, video, speech, and audio. A crucial component of these models is the codec tokenizer, which compresses high-dimensional natural signals into lower-dimensional

Externí odkaz: http://arxiv.org/abs/2408.16532

Zobrazit plný text záznamu

Report

Integrating Audio, Visual, and Semantic Information for Enhanced Multimodal Speaker Diarization

Autor: Cheng, Luyao, Wang, Hui, Zheng, Siqi, Chen, Yafeng, Huang, Rongjie, Zhang, Qinglin, Chen, Qian, Li, Xihao

Speaker diarization, the process of segmenting an audio stream or transcribed speech content into homogenous partitions based on speaker identity, plays a crucial role in the interpretation and analysis of human speech. Most existing speaker diarizat

Externí odkaz: http://arxiv.org/abs/2408.12102

Zobrazit plný text záznamu

Report

CosyVoice: A Scalable Multilingual Zero-shot Text-to-speech Synthesizer based on Supervised Semantic Tokens

Autor: Du, Zhihao, Chen, Qian, Zhang, Shiliang, Hu, Kai, Lu, Heng, Yang, Yexin, Hu, Hangrui, Zheng, Siqi, Gu, Yue, Ma, Ziyang, Gao, Zhifu, Yan, Zhijie

Recent years have witnessed a trend that large language model (LLM) based text-to-speech (TTS) emerges into the mainstream due to their high naturalness and zero-shot capacity. In this paradigm, speech signals are discretized into token sequences, wh

Externí odkaz: http://arxiv.org/abs/2407.05407

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání