Voice spoofing detection using a neural networks assembly considering spectrograms and mel frequency cepstral coefficients.

Autor: Alberto Hernández-Nava, Carlos, Alfredo Rincón-García, Eric, Lara-Velázquez, Pedro, de-los-Cobos-Silva, Sergio Gerardo, Angel Gutiérrez-Andrade, Miguel, Anselmo Mora-Gutiérrez, Roman
Předmět:
Zdroj: PeerJ Computer Science; Dec2023, p1-16, 16p
Abstrakt: Nowadays, biometric authentication has gained relevance due to the technological advances that have allowed its inclusion in many daily-use devices. However, this same advantage has also brought dangers, as spoofing attacks are now more common. This work addresses the vulnerabilities of automatic speaker verification authentication systems, which are prone to attacks arising from new techniques for the generation of spoofed audio. In this article, we present a countermeasure for these attacks using an approach that includes easy to implement feature extractors such as spectrograms and mel frequency cepstral coefficients, as well as a modular architecture based on deep neural networks. Finally, we evaluate our proposal using the well-know ASVspoof 2017 V2 database, the experiments show that using the final architecture the best performance is obtained, achieving an equal error rate of 6.66% on the evaluation set. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index