Zobrazeno 1 - 10
of 66
pro vyhledávání: '"Roberto Gemello"'
Publikováno v:
INTERSPEECH
In this paper, we apply Semi-Supervised Learning (SSL) along with Data Augmentation (DA) for improving the accuracy of End-to-End ASR. We focus on the consistency regularization principle, which has been successfully applied to image classification t
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::4bc46f7beed87cdad035d8f2200424be
Publikováno v:
ASRU
Deep Neural Network (DNN) acoustic models are sensitive to the mismatch between training and testing environments. When a trained model is tested on unseen speakers, domain, or environment, recognition accuracy can degrade substantially. In such a ca
Publikováno v:
Pattern Recognition Letters. 31:1302-1309
In this paper we describe the implementation of a complete ANN training procedure using the block mode back-propagation learning algorithm for sequential patterns - such as the observation feature vectors of a speech recognition system - exploiting t
Publikováno v:
Speech Communication. 49:827-835
This paper focuses on the adaptation of Automatic Speech Recognition systems using Hybrid models combining Artificial Neural Networks (ANN) with Hidden Markov Models (HMM). Most adaptation techniques for ANNs reported in the literature consist in add
Publikováno v:
IEEE Signal Processing Letters. 13:56-59
A soft decision gain modification is introduced and applied to the Ephraim-Malah gain function based on maximum mean-square-error estimation after amplitude compression. Nonlinear evaluations of the noise overestimation factor and spectral floor are
Publikováno v:
Computer Speech & Language. 20:2-21
This paper investigates the potential of exploiting the redundancy implicit in multiple resolution analysis for automatic speech recognition systems. The analysis is performed by a binary tree of elements, each one of which is made by a half-band fil
Autor:
Hung H. Bui, Andrew P. Breen, Roberto Gemello, Peter F. Patel-Schneider, Richard S. Crouch, Adwait Ratnaparkhi, Charles L. Ortiz, Peter Stubley, Jiaying Shen, Paul van Mulbregt, Ronald M. Kaplan, William F. Ganong, Holger Quast, Kevin R. Farrell, Friedrich Faubel, Tim Haulick, Vlad Sejnoha
Publikováno v:
Interactive Displays: Natural Human-Interface Technologies
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::948deb3c4e8788e05db243da9ffba46a
https://doi.org/10.1002/9781118706237.ch3
https://doi.org/10.1002/9781118706237.ch3
Publikováno v:
Computer Speech & Language. 15:341-354
This paper describes an investigation on the possibility of adding new features to classical Mel Scaled Cepstral Coefficients (MFCC) and their time derivatives. A hybrid Automatic Speech Recognition (ASR) system is used based on a Neural Network (NN)
Publikováno v:
Information Sciences. 123:3-11
This paper describes the benefits in recognition accuracy that can be achieved in a hybrid Hidden Markov Model – Neural Network (HMM–NN) recognition framework by using context-dependent subword units named Stationary–Transitional Units. These u
Publikováno v:
Neural Processing Letters. 8:163-179
At present the Multi-layer Perceptron model (MLP) is, without doubt, the most diffused neural network for applications. So, it is important, from an engineering point of view, to design and test methods to improve MLP efficiency at run time. This pap