Performance Comparison of AR Codebook Training for Speech Processing

Autor: Changchun Bao, Jesper Kjar Nielsen, Mads Grasboll Christensen, Zihao Cui
Rok vydání: 2020
Předmět:
Zdroj: Cui, Z, Bao, C, Nielsen, J K & Christensen, M G 2020, Performance Comparison of AR Codebook Training for Speech Processing . in 2020 15th IEEE International Conference on Signal Processing (ICSP) ., 9320929, IEEE Signal Processing Society, IEEE International Conference on Signal Processing (ICSP), pp. 131-135, 15th IEEE International Conference on Signal Processing, 06/12/2020 . https://doi.org/10.1109/ICSP48669.2020.9320929
DOI: 10.1109/icsp48669.2020.9320929
Popis: In this paper, different ways of training codebook containing autoregressive (AR) parameter vectors are discussed. The fundamental goal of the discussion is to investigate if the classical approach for training AR-codebooks by clustering line spectral frequencies (LSF) can be improved. To do this, we discuss and evaluate the alternatives in terms of the de-correlated AR-parameters and manifold learning. The different training methods are evaluated using different metrics quantifying the distance between actual power spectral density (PSD) and the estimated PSD from the AR-codebook. The experimental results show that the training on the de-correlated features can improve the performance to some degree compared to the traditional LSF training approach in terms of the Itakura-Saito divergence not in terms of the Kullback-Leibler divergence, the log-spectral distortion and speech distortion.
Databáze: OpenAIRE