Výsledky vyhledávání - "Nugraha, Aditya Arie"

Report

Run-Time Adaptation of Neural Beamforming for Robust Speech Dereverberation and Denoising

Autor: Fujita, Yoto, Nugraha, Aditya Arie, Di Carlo, Diego, Bando, Yoshiaki, Fontaine, Mathieu, Yoshii, Kazuyoshi

This paper describes speech enhancement for realtime automatic speech recognition (ASR) in real environments. A standard approach to this task is to use neural beamforming that can work efficiently in an online manner. It estimates the masks of clean

Externí odkaz: http://arxiv.org/abs/2410.22805

Zobrazit plný text záznamu

Report

Neural Fast Full-Rank Spatial Covariance Analysis for Blind Source Separation

Autor: Bando, Yoshiaki, Masuyama, Yoshiki, Nugraha, Aditya Arie, Yoshii, Kazuyoshi

This paper describes an efficient unsupervised learning method for a neural source separation model that utilizes a probabilistic generative model of observed multichannel mixtures proposed for blind source separation (BSS). For this purpose, amortiz

Externí odkaz: http://arxiv.org/abs/2306.10240

Zobrazit plný text záznamu

Report

Neural Steerer: Novel Steering Vector Synthesis with a Causal Neural Field over Frequency and Source Positions

Autor: Di Carlo, Diego, Nugraha, Aditya Arie, Fontaine, Mathieu, Yoshii, Kazuyoshi

We address the problem of accurately interpolating measured anechoic steering vectors with a deep learning framework called the neural field. This task plays a pivotal role in reducing the resource-intensive measurements required for precise sound so

Externí odkaz: http://arxiv.org/abs/2305.04447

Zobrazit plný text záznamu

Report

DNN-Free Low-Latency Adaptive Speech Enhancement Based on Frame-Online Beamforming Powered by Block-Online FastMNMF

Autor: Nugraha, Aditya Arie, Sekiguchi, Kouhei, Fontaine, Mathieu, Bando, Yoshiaki, Yoshii, Kazuyoshi

This paper describes a practical dual-process speech enhancement system that adapts environment-sensitive frame-online beamforming (front-end) with help from environment-free block-online source separation (back-end). To use minimum variance distorti

Externí odkaz: http://arxiv.org/abs/2207.10934

Zobrazit plný text záznamu

Report

Direction-Aware Adaptive Online Neural Speech Enhancement with an Augmented Reality Headset in Real Noisy Conversational Environments

Autor: Sekiguchi, Kouhei, Nugraha, Aditya Arie, Du, Yicheng, Bando, Yoshiaki, Fontaine, Mathieu, Yoshii, Kazuyoshi

This paper describes the practical response- and performance-aware development of online speech enhancement for an augmented reality (AR) headset that helps a user understand conversations made in real noisy echoic environments (e.g., cocktail party)

Externí odkaz: http://arxiv.org/abs/2207.07296

Zobrazit plný text záznamu

Report

Direction-Aware Joint Adaptation of Neural Speech Enhancement and Recognition in Real Multiparty Conversational Environments

Autor: Du, Yicheng, Nugraha, Aditya Arie, Sekiguchi, Kouhei, Bando, Yoshiaki, Fontaine, Mathieu, Yoshii, Kazuyoshi

This paper describes noisy speech recognition for an augmented reality headset that helps verbal communication within real multiparty conversational environments. A major approach that has actively been studied in simulated environments is to sequent

Externí odkaz: http://arxiv.org/abs/2207.07273

Zobrazit plný text záznamu

Report

A Deep Generative Model of Speech Complex Spectrograms

Autor: Nugraha, Aditya Arie, Sekiguchi, Kouhei, Yoshii, Kazuyoshi

This paper proposes an approach to the joint modeling of the short-time Fourier transform magnitude and phase spectrograms with a deep generative model. We assume that the magnitude follows a Gaussian distribution and the phase follows a von Mises di

Externí odkaz: http://arxiv.org/abs/1903.03269

Zobrazit plný text záznamu

Report

Fast Multichannel Source Separation Based on Jointly Diagonalizable Spatial Covariance Matrices

Autor: Sekiguchi, Kouhei, Nugraha, Aditya Arie, Bando, Yoshiaki, Yoshii, Kazuyoshi

This paper describes a versatile method that accelerates multichannel source separation methods based on full-rank spatial modeling. A popular approach to multichannel source separation is to integrate a spatial model with a source model for estimati

Externí odkaz: http://arxiv.org/abs/1903.03237

Zobrazit plný text záznamu

Akademický článek

An analysis of environment, microphone and data simulation mismatches in robust speech recognition

Autor: Vincent, Emmanuel, Watanabe, Shinji, Nugraha, Aditya Arie, Barker, Jon, Marxer, Ricard

Publikováno v: In Computer Speech & Language November 2017 46:535-557

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání