Zobrazeno 1 - 10
of 16
pro vyhledávání: '"Christopher Schymura"'
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 28:487-502
Identification and localization of sounds are both integral parts of computational auditory scene analysis. Although each can be solved separately, the goal of forming coherent auditory objects and achieving a comprehensive spatial scene understandin
Autor:
Tomohiro Nakatani, Julio Wissing, Tsubasa Ochiai, Dorothea Kolossa, Marc Delcroix, Keisuke Kinoshita, Shoko Araki, Benedikt Boenninghoff, Christopher Schymura
Publikováno v:
ICASSP
Estimating the positions of multiple speakers can be helpful for tasks like automatic speech recognition or speaker diarization. Both applications benefit from a known speaker position when, for instance, applying beamforming or assigning unique spea
Autor:
Keisuke Kinoshita, Christopher Schymura, Tomohiro Nakatani, Shoko Araki, Marc Delcroix, Tsubasa Ochiai, Dorothea Kolossa
Publikováno v:
EUSIPCO
Sound event localization frameworks based on deep neural networks have shown increased robustness with respect to reverberation and noise in comparison to classical parametric approaches. In particular, recurrent architectures that incorporate tempor
Autor:
Shoko Araki, Christopher Schymura, Benedikt Bönninghoff, Marc Delcroix, Keisuke Kinoshita, Dorothea Kolossa, Tomohiro Nakatani, Tsubasa Ochiai
Sound event localization aims at estimating the positions of sound sources in the environment with respect to an acoustic receiver (e.g. a microphone array). Recent advances in this domain most prominently focused on utilizing deep recurrent neural n
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1d7de552b6e5cb05d487ff98469b938e
Publikováno v:
IJCNN
Deep neural networks have proven highly effective at speech enhancement, which makes them attractive not just as front-ends for machine listening and speech recognition, but also as enhancement models for the benefit of human listeners. They are, how
Autor:
Shoko Araki, Christopher Schymura, Tomohiro Nakatani, Tsubasa Ochiai, Dorothea Kolossa, Keisuke Kinoshita, Marc Delcroix
Publikováno v:
ICASSP
Audiovisual speaker tracking is an application that has been tackled by a wide range of classical approaches based on Gaussian filters, most notably the well-known Kalman filter. Recently, a specific Kalman filter implementation was proposed for this
Autor:
Christopher Schymura, Dorothea Kolossa
Publikováno v:
ICASSP
Multimodal data fusion is an important aspect of many object localization and tracking frameworks that rely on sensory observations from different sources. A prominent example is audiovisual speaker localization, where the incorporation of visual inf
Autor:
Dorothea Kolossa, Christopher Schymura
Data fusion plays an important role in many technical applications that require efficient processing of multimodal sensory observations. A prominent example is audiovisual signal processing, which has gained increasing attention in automatic speech r
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::56d3cafa2794e5c2660f5380f95dea91
http://arxiv.org/abs/1903.06031
http://arxiv.org/abs/1903.06031
Publikováno v:
IWAENC
An important aspect of audiovisual speaker localization is the appropriate fusion of acoustic and visual observations based on their time-varying reliability. In this study, a framework which incorporates dynamic stream weights into the well-known Ka
Autor:
Christopher Schymura, Dorothea Kolossa
Publikováno v:
ICASSP
This paper presents a novel framework for active exploration in the context of acoustic simultaneous localization and mapping (SLAM) using a microphone array mounted on a mobile robotic agent. Acoustic SLAM aims at building a map of acoustic sources