Zobrazeno 1 - 10
of 96
pro vyhledávání: '"Politis, Archontis"'
Autor:
Garcia-Martinez, Jaime, Diaz-Guerra, David, Politis, Archontis, Virtanen, Tuomas, Carabias-Orti, Julio J., Vera-Candeas, Pedro
Recent advancements in music source separation have significantly progressed, particularly in isolating vocals, drums, and bass elements from mixed tracks. These developments owe much to the creation and use of large-scale, multitrack datasets dedica
Externí odkaz:
http://arxiv.org/abs/2409.10995
Autor:
Politis, Archontis
Acoustical signal processing of directional representations of sound fields, including source, receiver, and scatterer transfer functions, are often expressed and modeled in the spherical harmonic domain (SHD). Certain such modeling operations, or ap
Externí odkaz:
http://arxiv.org/abs/2407.06847
Reference Channel Selection by Multi-Channel Masking for End-to-End Multi-Channel Speech Enhancement
In end-to-end multi-channel speech enhancement, the traditional approach of designating one microphone signal as the reference for processing may not always yield optimal results. The limitation is particularly in scenarios with large distributed mic
Externí odkaz:
http://arxiv.org/abs/2406.03228
Distance estimation from audio plays a crucial role in various applications, such as acoustic scene analysis, sound source localization, and room modeling. Most studies predominantly center on employing a classification approach, where distances are
Externí odkaz:
http://arxiv.org/abs/2403.17514
Sound Event Detection and Localization (SELD) is a combined task of identifying sound events and their corresponding direction-of-arrival (DOA). While this task has numerous applications and has been extensively researched in recent years, it fails t
Externí odkaz:
http://arxiv.org/abs/2403.11827
Scene-based spatial audio formats, such as Ambisonics, are playback system agnostic and may therefore be favoured for delivering immersive audio experiences to a wide range of (potentially unknown) devices. The number of channels required to deliver
Externí odkaz:
http://arxiv.org/abs/2401.13401
Ambisonics encoding of microphone array signals can enable various spatial audio applications, such as virtual reality or telepresence, but it is typically designed for uniformly-spaced spherical microphone arrays. This paper proposes a method for Am
Externí odkaz:
http://arxiv.org/abs/2401.05916
Current multichannel speech enhancement algorithms typically assume a stationary sound source, a common mismatch with reality that limits their performance in real-world scenarios. This paper focuses on attention-driven spatial filtering techniques d
Externí odkaz:
http://arxiv.org/abs/2312.10756
This paper proposes neural networks for compensating sensorineural hearing loss. The aim of the hearing loss compensation task is to transform a speech signal to increase speech intelligibility after further processing by a person with a hearing impa
Externí odkaz:
http://arxiv.org/abs/2310.16550
Autor:
Shimada, Kazuki, Politis, Archontis, Sudarsanam, Parthasaarathy, Krause, Daniel, Uchida, Kengo, Adavanne, Sharath, Hakala, Aapo, Koyama, Yuichiro, Takahashi, Naoya, Takahashi, Shusuke, Virtanen, Tuomas, Mitsufuji, Yuki
While direction of arrival (DOA) of sound events is generally estimated from multichannel audio data recorded in a microphone array, sound events usually derive from visually perceptible source objects, e.g., sounds of footsteps come from the feet of
Externí odkaz:
http://arxiv.org/abs/2306.09126