Zobrazeno 1 - 10
of 719
pro vyhledávání: '"ZHENG Siqi"'
Publikováno v:
Sichuan jingshen weisheng, Vol 37, Iss 5, Pp 427-432 (2024)
BackgroundPatients with post-stroke hemiplegia are likely to experience fatigue and sleep disorder. Attention and interpretation therapy (AIT) has been shown to promote psychological flexibility, thereby alleviating their stress, improving em
Externí odkaz:
https://doaj.org/article/848bc889395b4f3fb64e63ed3fb941b2
Publikováno v:
Pifu-xingbing zhenliaoxue zazhi, Vol 31, Iss 9, Pp 634-642 (2024)
Aging in the sebaceous glands typically begins on the sun-exposed areas, characterized by sebaceous gland hyperplasia and lipid dysbiosis. These changes result in dry and dull skin, alterations in the microbiota, pigmentation, and various skin diseas
Externí odkaz:
https://doaj.org/article/5b91fda9436344f9bffe09f58b5dd918
Autor:
Li, Yuqiang, Wang, Junzhi, Li, Juan, Rayalacheruvu, Prathap, Majumdar, Liton, Yan, Yaoting, Quan, Donghui, Lu, Xing, Zheng, Siqi
Deuteration is sensitive to environmental conditions in star-forming regions. To investigate NH$_2$D chemistry, we compared the spatial distribution of ortho-NH$_2$D $1_{11}^s-1_{01}^a$, NH$_3$(1,1) and NH$_3$(2,2) in 12 late-stage massive star-formi
Externí odkaz:
http://arxiv.org/abs/2411.17121
Autor:
Cheng, Xize, Zheng, Siqi, Wang, Zehan, Fang, Minghui, Zhang, Ziang, Huang, Rongjie, Ma, Ziyang, Ji, Shengpeng, Zuo, Jialong, Jin, Tao, Zhao, Zhou
The scaling up has brought tremendous success in the fields of vision and language in recent years. When it comes to audio, however, researchers encounter a major challenge in scaling up the training data, as most natural audio contains diverse inter
Externí odkaz:
http://arxiv.org/abs/2410.21269
Autor:
Zhang, Qinglin, Cheng, Luyao, Deng, Chong, Chen, Qian, Wang, Wen, Zheng, Siqi, Liu, Jiaqing, Yu, Hai, Tan, Chaohong, Du, Zhihao, Zhang, Shiliang
Full-duplex spoken dialogue systems significantly surpass traditional turn-based dialogue systems, as they allow simultaneous bidirectional communication, closely mirroring human-human interactions. However, achieving low latency and natural interact
Externí odkaz:
http://arxiv.org/abs/2410.17799
Generating music that aligns with the visual content of a video has been a challenging task, as it requires a deep understanding of visual semantics and involves generating music whose melody, rhythm, and dynamics harmonize with the visual narratives
Externí odkaz:
http://arxiv.org/abs/2410.12957
Autor:
Yin, Han, Bai, Jisheng, Xiao, Yang, Wang, Hui, Zheng, Siqi, Chen, Yafeng, Das, Rohan Kumar, Deng, Chong, Chen, Jianfeng
In sound event detection (SED), overlapping sound events pose a significant challenge, as certain events can be easily masked by background noise or other events, resulting in poor detection performance. To address this issue, we propose the text-que
Externí odkaz:
http://arxiv.org/abs/2409.13292
Autor:
Ji, Shengpeng, Jiang, Ziyue, Wang, Wen, Chen, Yifu, Fang, Minghui, Zuo, Jialong, Yang, Qian, Cheng, Xize, Wang, Zehan, Li, Ruiqi, Zhang, Ziang, Yang, Xiaoda, Huang, Rongjie, Jiang, Yidi, Chen, Qian, Zheng, Siqi, Zhao, Zhou
Language models have been effectively applied to modeling natural signals, such as images, video, speech, and audio. A crucial component of these models is the codec tokenizer, which compresses high-dimensional natural signals into lower-dimensional
Externí odkaz:
http://arxiv.org/abs/2408.16532
Autor:
Cheng, Luyao, Wang, Hui, Zheng, Siqi, Chen, Yafeng, Huang, Rongjie, Zhang, Qinglin, Chen, Qian, Li, Xihao
Speaker diarization, the process of segmenting an audio stream or transcribed speech content into homogenous partitions based on speaker identity, plays a crucial role in the interpretation and analysis of human speech. Most existing speaker diarizat
Externí odkaz:
http://arxiv.org/abs/2408.12102
Autor:
Du, Zhihao, Chen, Qian, Zhang, Shiliang, Hu, Kai, Lu, Heng, Yang, Yexin, Hu, Hangrui, Zheng, Siqi, Gu, Yue, Ma, Ziyang, Gao, Zhifu, Yan, Zhijie
Recent years have witnessed a trend that large language model (LLM) based text-to-speech (TTS) emerges into the mainstream due to their high naturalness and zero-shot capacity. In this paradigm, speech signals are discretized into token sequences, wh
Externí odkaz:
http://arxiv.org/abs/2407.05407