Latest Development of Multilingual Speech Recognition Acoustic Model Modeling Methods

Autor: CHENG Gao-feng, YAN Yong-hong
Jazyk: čínština
Rok vydání: 2022
Předmět:
Zdroj: Jisuanji kexue, Vol 49, Iss 1, Pp 47-52 (2022)
Druh dokumentu: article
ISSN: 1002-137X
DOI: 10.11896/jsjkx.210900013
Popis: With the rapid development of multimedia and communication technology,the amount of multilingual speech data on the Internet is increasing.Speech recognition technology is the core for media analysis and processing.How to quickly expand from a few major languages such as Chinese and English to more languages has become a prominent issue yet to be overcome in order to improve multilingual processing capabilities.This article summarizes the latest progress in the field of acoustic model modeling,and discusses breakthroughs needed by traditional speech recognition technology in the course of moving from single language to multi-languages.The latest end-to-end speech recognition technology was exploited to construct a keyword spotting system,and the system achieves favorable performance.The approach is detailed as follows:1)multi-lingual hierarchical and structured acoustic model modeling method;2)multilingual acoustic modeling based on language classification information;3)end-to-end keyword spotting based on frame-synchronous alignments.
Databáze: Directory of Open Access Journals