Autor: |
Anemüller, Carlotta, Thiergart, Oliver, Habets, Emanuël A. P. |
Předmět: |
|
Zdroj: |
EURASIP Journal on Audio Speech & Music Processing; 11/5/2024, Vol. 2024 Issue 1, p1-14, 14p |
Abstrakt: |
The degree of correlation between the sounds received by the ears significantly influences the spatial perception of a sound image. Audio signal decorrelation is, therefore, a commonly used tool in various spatial audio rendering applications. In this paper, we propose a multi-channel extension of a previously proposed decorrelation method based on generative adversarial networks. A separate generator network is employed for each output channel. All generator networks are optimized jointly to obtain a multi-channel output signal with the desired properties. The training objective includes a number of individual loss terms to control both the input-output and the inter-channel correlation as well as the quality of the individual output channels. The proposed approach is trained on music signals and evaluated both objectively and through formal listening tests. Thereby, a comparison with two classical signal processing-based multi-channel decorrelators is performed. Additionally, the influence of the number of output channels, the individual loss term weightings, and the employed training data on the proposed method's performance is investigated. [ABSTRACT FROM AUTHOR] |
Databáze: |
Complementary Index |
Externí odkaz: |
|