Privacy-Oriented Manipulation of Speaker Representations

Autor:	Francisco Teixeira, Alberto Abad, Bhiksha Raj, Isabel Trancoso
Jazyk:	angličtina
Rok vydání:	2024
Předmět:	Age information removal attribute-based privacy sex information removal privacy-oriented manipulation speaker embeddings speaker recognition Electrical engineering. Electronics. Nuclear engineering TK1-9971
Zdroj:	IEEE Access, Vol 12, Pp 82949-82971 (2024)
Druh dokumentu:	article
ISSN:	2169-3536
DOI:	10.1109/ACCESS.2024.3409067
Popis:	Speaker embeddings are ubiquitous, with applications ranging from speaker recognition and diarization to speech synthesis and voice anonymization. The amount of information held by these embeddings lends them versatility but also raises privacy concerns. Speaker embeddings have been shown to contain sensitive information, including the speaker’s age, sex, health state and more – in other words, information that speakers may want to keep private, especially when it is not required for the target task. In this work, we propose a method for removing and manipulating private attribute information in speaker representations that leverages a Vector-Quantized Variational Autoencoder architecture combined with an adversarial classifier and a novel mutual information loss. We validate our model on two attributes, sex and age, and perform experiments to remove or manipulate this information using ignorant and informed attackers. The model is tested with in-domain and out-of-domain data to assess its robustness, and the resulting speaker representations are used in a speaker verification scenario to validate their utility. Our results show that our model obtains a strong trade-off between utility and privacy, achieving age and sex classification results near chance level for both attackers and yielding little impact on speaker verification performance.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/0ddad9e46c0c40ffb71513cb0fa3ee20 Zobrazit plný text záznamu View record in DOAJ