Method of Speech Signal Structuring and Transforming for Biometric Personality Identification

Autor: Eugene Fedorov, Yaroslav Korpan, Tetyana Utkina, Olga Nechyporenko
Rok vydání: 2020
Předmět:
Zdroj: Communications in Computer and Information Science ISBN: 9783030616557
DSMP
Popis: This paper proposes a method for structuring and transforming a speech signal. For this, segmentation method, methods for determining the fundamental tone of the vocal segment and determining on its basis the boundaries of the quasiperiodic oscillations of the vocal segment, the geometric transformation of quasiperiodic oscillations of the vocal segment were suggested. The proposed segmentation of the speech signal uses statistical estimation of short-term energies, which allows the use of an adaptive threshold, thus increasing the vocal segments determination accuracy. The proposed definition of fundamental tone of the vocal segment uses bandpass filtering and statistical estimation of local extremum, which reduces computational complexity, and also reduces noise dependency and allows the use of an adaptive threshold, thus increasing the accuracy of determining the fundamental tone and the boundaries of quasiperiodic oscillations of the vocal segment. The proposed geometric transformation of quasiperiodic oscillations of the vocal segment allows you to transform quasiperiodic oscillations to a single amplitude-time window, which allows you to form patterns of the vocal segment, taking into account its structure. A method for determining a model structure for transforming speech signal patterns is proposed, which is based on a statistical evaluation of the quality of the transforming, which provides a high degree of compression and the speech signal identification.
Databáze: OpenAIRE