Zobrazeno 1 - 10
of 2 884
pro vyhledávání: '"WANG, YIWEN"'
Autor:
Zhou, Xinyi, Li, Xing, Lian, Yingzhao, Wang, Yiwen, Chen, Lei, Yuan, Mingxuan, Hao, Jianye, Chen, Guangyong, Heng, Pheng Ann
We introduce SeaDAG, a semi-autoregressive diffusion model for conditional generation of Directed Acyclic Graphs (DAGs). Considering their inherent layer-wise structure, we simulate layer-wise autoregressive generation by designing different denoisin
Externí odkaz:
http://arxiv.org/abs/2410.16119
Autor:
Sun, Jiamin, Shu, Shibo, Chai, Ye, Zhu, Lin, Zhang, Lingmei, Li, Yongping, Liu, Zhouhui, Li, Zhengwei, Xu, Yu, Yan, Daikang, Guo, Weijie, Wang, Yiwen, Liu, Congzhan
Fabrication of dielectrics at low temperature is required for temperature-sensitive detectors. For superconducting detectors, such as transition edge sensors and kinetic inductance detectors, AlMn is widely studied due to its variable superconducting
Externí odkaz:
http://arxiv.org/abs/2409.09301
Target speech extraction (TSE) focuses on extracting the speech of a specific target speaker from a mixture of signals. Existing TSE models typically utilize static embeddings as conditions for extracting the target speaker's voice. However, the stat
Externí odkaz:
http://arxiv.org/abs/2409.06136
The Transformer model, particularly its cross-attention module, is widely used for feature fusion in target sound extraction which extracts the signal of interest based on given clues. Despite its effectiveness, this approach suffers from low computa
Externí odkaz:
http://arxiv.org/abs/2409.04803
Autor:
Yao, Xufeng, Wang, Yiwen, Li, Xing, Lian, Yingzhao, Chen, Ran, Chen, Lei, Yuan, Mingxuan, Xu, Hong, Yu, Bei
Register Transfer Level (RTL) code optimization is crucial for enhancing the efficiency and performance of digital circuits during early synthesis stages. Currently, optimization relies heavily on manual efforts by skilled engineers, often requiring
Externí odkaz:
http://arxiv.org/abs/2409.11414
While previous audio-driven talking head generation (THG) methods generate head poses from driving audio, the generated poses or lips cannot match the audio well or are not editable. In this study, we propose \textbf{PoseTalk}, a THG system that can
Externí odkaz:
http://arxiv.org/abs/2409.02657
Autor:
Zheng, Yuxiang, Sun, Shichao, Qiu, Lin, Ru, Dongyu, Jiayang, Cheng, Li, Xuefeng, Lin, Jifan, Wang, Binjie, Luo, Yun, Pan, Renjie, Xu, Yang, Min, Qingkai, Zhang, Zizhao, Wang, Yiwen, Li, Wenjie, Liu, Pengfei
The rapid growth of scientific literature imposes significant challenges for researchers endeavoring to stay updated with the latest advancements in their fields and delve into new areas. We introduce OpenResearcher, an innovative platform that lever
Externí odkaz:
http://arxiv.org/abs/2408.06941
Rate splitting multiple access (RSMA) relies on beamforming design for attaining spectral efficiency and energy efficiency gains over traditional multiple access schemes. While conventional optimization approaches such as weighted minimum mean square
Externí odkaz:
http://arxiv.org/abs/2407.06530
Autor:
Wang, Yiwen, Wu, Xihong
Target sound extraction (TSE) separates the target sound from the mixture signals based on provided clues. However, the performance of existing models significantly degrades under reverberant conditions. Inspired by auditory scene analysis (ASA), thi
Externí odkaz:
http://arxiv.org/abs/2406.08716
Researchers have reported high decoding accuracy (>95%) using non-invasive Electroencephalogram (EEG) signals for brain-computer interface (BCI) decoding tasks like image decoding, emotion recognition, auditory spatial attention detection, etc. Since
Externí odkaz:
http://arxiv.org/abs/2405.17024