Zobrazeno 1 - 10
of 18
pro vyhledávání: '"Petr Fousek"'
Autor:
Dimitrios Dimitriadis, Petr Fousek
Publikováno v:
INTERSPEECH
Autor:
Jason W. Pelecanos, Dimitrios Dimitriadis, Kenneth Church, Weizhong Zhu, Josef Vopicka, Petr Fousek
Publikováno v:
ICASSP
This paper discusses some challenges and opportunities in developing a speaker diarization system for operation on real world call center telephony data. We contrast some of the differences between a standard data set akin to NIST evaluations and tho
Publikováno v:
ICASSP
Deep Scattering Network features introduced for image processing have recently proved useful in speech recognition as an alternative to log-mel features for Deep Neural Network (DNN) acoustic models. Scattering features use wavelet decomposition dire
Autor:
David Nahamoo, Vijayaditya Peddinti, Petr Fousek, Tara N. Sainath, Brian Kingsbury, Bhuvana Ramabhadran
Publikováno v:
INTERSPEECH
Log-mel filterbank features, which are commonly used features for CNNs, can remove higher-resolution information from the speech signal. A novel technique, known as Deep Scattering Spectrum (DSS), addresses this issue and looks to preserve this infor
Publikováno v:
ICASSP
In this paper, we present new methods for parameterizing the connections of neural networks using sums of direct products. We show that low rank parameterizations of weight matrices are a subset of this set, and explore the theoretical and practical
Publikováno v:
ICASSP
We present the Factorial Hidden Restricted Boltzmann Machine (FHRBM) for robust speech recognition. Speech and noise are modeled as independent RBMs, and the interaction between them is explicitly modeled to capture how speech and noise combine to ge
Publikováno v:
ASRU
In this paper we describe how the model-based noise robustness algorithm for previously unseen noise conditions, Dynamic Noise Adaptation (DNA), can be made robust to matched data, without the need to do any system re-training. The approach is to do
Autor:
Tara N. Sainath, Bhuvana Ramabhadran, Petr Fousek, Petr Novak, Brian Kingsbury, Abdelrahman Mohamed
Publikováno v:
ASRU
To date, there has been limited work in applying Deep Belief Networks (DBNs) for acoustic modeling in LVCSR tasks, with past work using standard speech features. However, a typical LVCSR system makes use of both feature and model-space speaker adapta
Publikováno v:
ICASSP
Dynamic noise adaptation (DNA) [1, 2] is a model-based technique for improving automatic speech recognition (ASR) performance in noise. DNA has shown promise on artificially mixed data such as the Aurora II and DNA+Aurora II tasks [1]—significantly