Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Shaik, Mohammed Maqsood"'
Pre-trained Transformer-based speech models have shown striking performance when fine-tuned on various downstream tasks such as automatic speech recognition and spoken language identification (SLID). However, the problem of domain mismatch remains a
Externí odkaz:
http://arxiv.org/abs/2312.07338
Self-supervised representation learning for speech often involves a quantization step that transforms the acoustic input into discrete units. However, it remains unclear how to characterize the relationship between these discrete units and abstract p
Externí odkaz:
http://arxiv.org/abs/2306.02405