Zobrazeno 1 - 10
of 37
pro vyhledávání: '"Donley, Jacob"'
The increasing popularity of spatial audio in applications such as teleconferencing, entertainment, and virtual reality has led to the recent developments of binaural reproduction methods. However, only a few of these methods are well-suited for wear
Externí odkaz:
http://arxiv.org/abs/2409.11731
Autor:
Yang, Yufeng, Raj, Desh, Lin, Ju, Moritz, Niko, Jia, Junteng, Keren, Gil, Lakomkin, Egor, Huang, Yiteng, Donley, Jacob, Mahadeokar, Jay, Kalinli, Ozlem
The growing popularity of multi-channel wearable devices, such as smart glasses, has led to a surge of applications such as targeted speech recognition and enhanced hearing. However, current approaches to solve these tasks use independently trained m
Externí odkaz:
http://arxiv.org/abs/2409.11494
Autor:
Yun, Heeseung, Gao, Ruohan, Ananthabhotla, Ishwarya, Kumar, Anurag, Donley, Jacob, Li, Chao, Kim, Gunhee, Ithapu, Vamsi Krishna, Murdock, Calvin
Egocentric videos provide comprehensive contexts for user and scene understanding, spanning multisensory perception to behavioral interaction. We propose Spherical World-Locking (SWL) as a general framework for egocentric scene representation, which
Externí odkaz:
http://arxiv.org/abs/2408.05364
Binaural reproduction is rapidly becoming a topic of great interest in the research community, especially with the surge of new and popular devices, such as virtual reality headsets, smart glasses, and head-tracked headphones. In order to immerse the
Externí odkaz:
http://arxiv.org/abs/2408.03581
In the rapidly evolving fields of virtual and augmented reality, accurate spatial audio capture and reproduction are essential. For these applications, Ambisonics has emerged as a standard format. However, existing methods for encoding Ambisonics sig
Externí odkaz:
http://arxiv.org/abs/2402.17362
Publikováno v:
in Proceedings of the 24th International Congress on Acoustics (ICA 2022), ABS-0302, 2022
The capture and reproduction of spatial audio is becoming increasingly popular, with the mushrooming of applications in teleconferencing, entertainment and virtual reality. Many binaural reproduction methods have been developed and studied extensivel
Externí odkaz:
http://arxiv.org/abs/2311.13390
Autor:
Hafezi, Sina, Moore, Alastair H., Guiraud, Pierre, Naylor, Patrick A., Donley, Jacob, Tourbabin, Vladimir, Lunner, Thomas
A two-stage multi-channel speech enhancement method is proposed which consists of a novel adaptive beamformer, Hybrid Minimum Variance Distortionless Response (MVDR), Isotropic-MVDR (Iso), and a novel multi-channel spectral Principal Components Analy
Externí odkaz:
http://arxiv.org/abs/2303.08967
Prior works on improving speech quality with visual input typically study each type of auditory distortion separately (e.g., separation, inpainting, video-to-speech) and present tailored algorithms. This paper proposes to unify these subjects and stu
Externí odkaz:
http://arxiv.org/abs/2212.11377
Autor:
Mira, Rodrigo, Xu, Buye, Donley, Jacob, Kumar, Anurag, Petridis, Stavros, Ithapu, Vamsi Krishna, Pantic, Maja
Audio-visual speech enhancement aims to extract clean speech from a noisy environment by leveraging not only the audio itself but also the target speaker's lip movements. This approach has been shown to yield improvements over audio-only speech enhan
Externí odkaz:
http://arxiv.org/abs/2211.10999
Autor:
Kang, Zhiqi, Sadeghi, Mostafa, Horaud, Radu, Alameda-Pineda, Xavier, Donley, Jacob, Kumar, Anurag
This paper investigates the impact of head movements on audio-visual speech enhancement (AVSE). Although being a common conversational feature, head movements have been ignored by past and recent studies: they challenge today's learning-based methods
Externí odkaz:
http://arxiv.org/abs/2202.00538