Zobrazeno 1 - 10
of 538
pro vyhledávání: '"Abad, Alberto"'
Autor:
Carvalho, Carlos, Abad, Alberto
Self-supervised learning (SSL) leverages large amounts of unlabelled data to learn rich speech representations, fostering improvements in automatic speech recognition (ASR), even when only a small amount of labelled data is available for fine-tuning.
Externí odkaz:
http://arxiv.org/abs/2410.14910
Speech is a rich biomarker that encodes substantial information about the health of a speaker, and thus it has been proposed for the detection of numerous diseases, achieving promising results. However, questions remain about what the models trained
Externí odkaz:
http://arxiv.org/abs/2409.10230
Autor:
Teixeira, Francisco, Pizzi, Karla, Olivier, Raphael, Abad, Alberto, Raj, Bhiksha, Trancoso, Isabel
Membership Inference (MI) poses a substantial privacy threat to the training data of Automatic Speech Recognition (ASR) systems, while also offering an opportunity to audit these models with regard to user data. This paper explores the effectiveness
Externí odkaz:
http://arxiv.org/abs/2405.01207
Publikováno v:
IEEE Access, vol. 12, pp. 82949-82971, 2024
Speaker embeddings are ubiquitous, with applications ranging from speaker recognition and diarization to speech synthesis and voice anonymisation. The amount of information held by these embeddings lends them versatility, but also raises privacy conc
Externí odkaz:
http://arxiv.org/abs/2310.06652
Autor:
Carvalho, Carlos, Abad, Alberto
Publikováno v:
Proc. INTERSPEECH 2023, 2218--2222
Conformers have recently been proposed as a promising modelling approach for automatic speech recognition (ASR), outperforming recurrent neural network-based approaches and transformers. Nevertheless, in general, the performance of these end-to-end m
Externí odkaz:
http://arxiv.org/abs/2309.13029
Of all components of Prosody, Rhythm has been regarded as the hardest to address, as it is utterly linked to Pitch and Intensity. Nevertheless, Rhythm is a very good indicator of a speaker's fluency in a foreign language or even of some diseases. Can
Externí odkaz:
http://arxiv.org/abs/2212.10201
Automatic Speaker Diarization (ASD) is an enabling technology with numerous applications, which deals with recordings of multiple speakers, raising special concerns in terms of privacy. In fact, in remote settings, where recordings are shared with a
Externí odkaz:
http://arxiv.org/abs/2210.14995
Publikováno v:
Proc. Interspeech 2022, 2798-2802
The development of privacy-preserving automatic speaker verification systems has been the focus of a number of studies with the intent of allowing users to authenticate themselves without risking the privacy of their voice. However, current privacy-p
Externí odkaz:
http://arxiv.org/abs/2206.11750
Publikováno v:
In International Journal of Hydrogen Energy 26 July 2024 76:281-289