Výsledky vyhledávání

Report

SeaDAG: Semi-autoregressive Diffusion for Conditional Directed Acyclic Graph Generation

Autor: Zhou, Xinyi, Li, Xing, Lian, Yingzhao, Wang, Yiwen, Chen, Lei, Yuan, Mingxuan, Hao, Jianye, Chen, Guangyong, Heng, Pheng Ann

We introduce SeaDAG, a semi-autoregressive diffusion model for conditional generation of Directed Acyclic Graphs (DAGs). Considering their inherent layer-wise structure, we simulate layer-wise autoregressive generation by designing different denoisin

Externí odkaz: http://arxiv.org/abs/2410.16119

Zobrazit plný text záznamu

Report

Cryogenic microwave performance of silicon nitride and amorphous silicon deposited using low-temperature ICPCVD

Autor: Sun, Jiamin, Shu, Shibo, Chai, Ye, Zhu, Lin, Zhang, Lingmei, Li, Yongping, Liu, Zhouhui, Li, Zhengwei, Xu, Yu, Yan, Daikang, Guo, Weijie, Wang, Yiwen, Liu, Congzhan

Fabrication of dielectrics at low temperature is required for temperature-sensitive detectors. For superconducting detectors, such as transition edge sensors and kinetic inductance detectors, AlMn is widely studied due to its variable superconducting

Externí odkaz: http://arxiv.org/abs/2409.09301

Zobrazit plný text záznamu

Report

DENSE: Dynamic Embedding Causal Target Speech Extraction

Autor: Wang, Yiwen, Yuan, Zeyu, Wu, Xihong

Target speech extraction (TSE) focuses on extracting the speech of a specific target speaker from a mixture of signals. Existing TSE models typically utilize static embeddings as conditions for extracting the target speaker's voice. However, the stat

Externí odkaz: http://arxiv.org/abs/2409.06136

Zobrazit plný text záznamu

Report

Cross-attention Inspired Selective State Space Models for Target Sound Extraction

Autor: Wu, Donghang, Wang, Yiwen, Wu, Xihong, Qu, Tianshu

The Transformer model, particularly its cross-attention module, is widely used for feature fusion in target sound extraction which extracts the signal of interest based on given clues. Despite its effectiveness, this approach suffers from low computa

Externí odkaz: http://arxiv.org/abs/2409.04803

Zobrazit plný text záznamu

Report

RTLRewriter: Methodologies for Large Models aided RTL Code Optimization

Autor: Yao, Xufeng, Wang, Yiwen, Li, Xing, Lian, Yingzhao, Chen, Ran, Chen, Lei, Yuan, Mingxuan, Xu, Hong, Yu, Bei

Register Transfer Level (RTL) code optimization is crucial for enhancing the efficiency and performance of digital circuits during early synthesis stages. Currently, optimization relies heavily on manual efforts by skilled engineers, often requiring

Externí odkaz: http://arxiv.org/abs/2409.11414

Zobrazit plný text záznamu

Report

PoseTalk: Text-and-Audio-based Pose Control and Motion Refinement for One-Shot Talking Head Generation

Autor: Ling, Jun, Wang, Yiwen, Xue, Han, Xie, Rong, Song, Li

While previous audio-driven talking head generation (THG) methods generate head poses from driving audio, the generated poses or lips cannot match the audio well or are not editable. In this study, we propose \textbf{PoseTalk}, a THG system that can

Externí odkaz: http://arxiv.org/abs/2409.02657

Zobrazit plný text záznamu

Report

OpenResearcher: Unleashing AI for Accelerated Scientific Research

Autor: Zheng, Yuxiang, Sun, Shichao, Qiu, Lin, Ru, Dongyu, Jiayang, Cheng, Li, Xuefeng, Lin, Jifan, Wang, Binjie, Luo, Yun, Pan, Renjie, Xu, Yang, Min, Qingkai, Zhang, Zizhao, Wang, Yiwen, Li, Wenjie, Liu, Pengfei

The rapid growth of scientific literature imposes significant challenges for researchers endeavoring to stay updated with the latest advancements in their fields and delve into new areas. We introduce OpenResearcher, an innovative platform that lever

Externí odkaz: http://arxiv.org/abs/2408.06941

Zobrazit plný text záznamu

Report

RS-BNN: A Deep Learning Framework for the Optimal Beamforming Design of Rate-Splitting Multiple Access

Autor: Wang, Yiwen, Mao, Yijie, Ji, Sijie

Rate splitting multiple access (RSMA) relies on beamforming design for attaining spectral efficiency and energy efficiency gains over traditional multiple access schemes. While conventional optimization approaches such as weighted minimum mean square

Externí odkaz: http://arxiv.org/abs/2407.06530

Zobrazit plný text záznamu

Report

TSE-PI: Target Sound Extraction under Reverberant Environments with Pitch Information

Autor: Wang, Yiwen, Wu, Xihong

Target sound extraction (TSE) separates the target sound from the mixture signals based on provided clues. However, the performance of existing models significantly degrades under reverberant conditions. Inspired by auditory scene analysis (ASA), thi

Externí odkaz: http://arxiv.org/abs/2406.08716

Zobrazit plný text záznamu

Report

Beware of Overestimated Decoding Performance Arising from Temporal Autocorrelations in Electroencephalogram Signals

Autor: Xu, Xiran, Wang, Bo, Xiao, Boda, Niu, Yadong, Wang, Yiwen, Wu, Xihong, Chen, Jing

Researchers have reported high decoding accuracy (>95%) using non-invasive Electroencephalogram (EEG) signals for brain-computer interface (BCI) decoding tasks like image decoding, emotion recognition, auditory spatial attention detection, etc. Since

Externí odkaz: http://arxiv.org/abs/2405.17024

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání