Federated Learning for Privacy Preserving On-Device Speaker Recognition

Autor:	Tom Bäckström, Abraham Woubie
Přispěvatelé:	Dept Signal Process and Acoust, Speech Interaction Technology, Aalto-yliopisto, Aalto University
Jazyk:	angličtina
Rok vydání:	2021
Předmět:	Privacy preserving Human–computer interaction Computer science Speaker recognition Federated learning
Popis:	State-of-the-art speaker recognition systems are usually trained on a single computer using speech data collected from multiple users. However, these speech samples may contain private information which users are not willing to share. To overcome such potential breaches of privacy, we investigate the use of federated learning in speaker recognition. Distributed learning methods such as federated learning enable us to train a shared model without sharing the private data by training the models on edge devices where the data resides. In the proposed system, each edge device trains an individual model which is subsequently sent to a secure aggregator. To provide contrasting data without the need for transmitting data, we use a generative adversarial network (GAN) to generate impostor data at the edge. Afterwards, the secure aggregator merges the individual models, builds a global model and transmits the global model to the edge devices through a main server. Experimental results on the Voxceleb-1 dataset show that the use of federated learning for speaker recognition system provides two advantages. Firstly, it retains privacy since the raw data does not leave the edge devices. Secondly, experimental results show that the aggregated model provides better average equal error rate than the individual models.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::20c18fd6839cfc826d5d2452755af16c https://aaltodoc.aalto.fi/handle/123456789/111229 Zobrazit plný text záznamu