Playing with NeMo for building an automatic speech recogniser for Italian

Autor: Tamburini F.
Přispěvatelé: Elisabetta Fersini, Marco Passarotti, Viviana Patti, Tamburini F.
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Popis: This paper presents work in progress for the creation of a Large Vocabulary Automatic Speech Recogniser for Italian using NVIDIA NeMo. Thanks to this package, we were able to build a reliable recogniser for adults' speech by fine tuning the English model provided by NVIDIA and rescoring it with powerful neural language models, obtaining very good performances. The lack of a standard, reliable and publicy available baseline for Italian motivated this work.
Databáze: OpenAIRE