Python Language Training System Based on MFCC, VQ, Variational Coefficient and KNTM Algorithm

Autor: José Luis Oropeza Rodríguez, Gustavo Asumu Mboro Nchama, Roberto Rodriguez Morales, Sergio Suárez Guerra, Leandro Daniel Lau Alfonso
Rok vydání: 2021
Předmět:
Zdroj: Mathematics and Computer Science. 6:38
ISSN: 2575-6036
DOI: 10.11648/j.mcs.20210602.12
Popis: This contribution describes the second stage of the creation of a language training system programmed in Python with the aim of application to speech therapy in spanish-speaking countries, starting the study in Cuba. The first stage of this research was carried out in Matlab by analyzing the dynamics of change of the centroids of the codebooks, extracted from words pronounced by a locutor. As second stage, the Variational Coefficient formula is used in order to estimate the percentage of effectiveness with which the announcer performs voice training. A modified approach to programming the variational coefficient is taken into account as a measure of dispersion of a group of vectors. The modification is given by taking the mean of the group of vectors as the vector that represents the phonetic boundaries of the word to be trained. Besides, a novel approach for word recognition is used, based on the K-Nearest Training Matrix (KNTM) algorithm that lays its foundations in the analysis of matrix similarity taken the Frobenius norm as a measure to distinguish similar or non-similar characteristics of a matrix with respect to a database of matrices. To reduce the computational cost of the program and speed up its proper functioning, the training matrices of the database are saved in files with a .tex extension, in this way after training process, the program should only read them and not recalculate them, which significantly reduces the running time of the algorithm.
Databáze: OpenAIRE