Hybrid Lstm-Fsmn Networks for Acoustic Modeling

Autor:	Asa Oines, Pedro J. Moreno, Eugene Weinstein
Rok vydání:	2018
Předmět:	Sequential access memory Context model Artificial neural network Connectionism Computer science Speech recognition Feed forward Feedforward neural network Context (language use) Representation (mathematics)
Zdroj:	ICASSP
Popis:	This paper describes a series of experiments with neural networks containing long short-term memory (LSTM) [1] and feedforward sequential memory network (FSMN) [2]–[4] layers trained with the connectionist temporal classification (CTC) [5] criteria for acoustic modeling. We propose using a hybrid LSTM/FSMN (FLMN) architecture as an enhancement to conventional LSTM-only acoustic models. The addition of FSMN layers allows the network to model a fixed size representation of future context suitable for online speech recognition. Our experiments show that FLMN acoustic models significantly outperform conventional LSTM. We also compare the FLMN architecture with other methods of modeling future context. Finally, we present a modification of the FSMN architecture that improves performance by reducing the width of the FSMN output.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::df52ea81b1ff6651aa08824871dcb872 https://doi.org/10.1109/icassp.2018.8461563 Zobrazit plný text záznamu