Study of Large Data Resources for Multilingual Training and System Porting

Autor:	Martin Karafiat, Ekaterina Egorova, Frantisek Grezl
Rok vydání:	2016
Předmět:	multilingual training Computer science Speech recognition Feature extraction 02 engineering and technology computer.software_genre Stacked Bottle-Neck Porting Set (abstract data type) 030507 speech-language pathology & audiology 03 medical and health sciences 0202 electrical engineering electronic engineering information engineering Cluster analysis General Environmental Science large data Artificial neural network business.industry feature extraction Triphone General Earth and Planetary Sciences 020201 artificial intelligence & image processing Artificial intelligence 0305 other medical science business computer Fisher database Natural language processing
Zdroj:	SLTU Procedia Computer Science
ISSN:	1877-0509
Popis:	This study investigates the behavior of a feature extraction neural network model trained on a large amount of single language data (“source language”) on a set of under-resourced target languages. The coverage of the source language acoustic space was changed in two ways: (1) by changing the amount of training data and (2) by altering the level of detail of acoustic units (by changing the triphone clustering). We observe the effect of these changes on the performance on target language in two scenarios: (1) the source-language NNs were used directly, (2) NNs were first ported to target language. The results show that increasing coverage as well as level of detail on the source language improves the target language system performance in both scenarios. For the first one, both source language characteristic have about the same effect. For the second scenario, the amount of data in source language is more important than the level of detail. The possibility to include large data into multilingual training set was also investigated. Our experiments point out possible risk of over-weighting the NNs towards the source language with large data. This degrades the performance on part of the target languages, compared to the setting where the amounts of data per language are balanced.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::561d2aec6e41cac756337fe497441f01 https://doi.org/10.1016/j.procs.2016.04.024 Zobrazit plný text záznamu