Zobrazeno 1 - 2
of 2
pro vyhledávání: '"Sereda, Taras"'
Autor:
Sereda, Taras
In this work, we showcase a cost-effective method for generating training data for speech processing tasks. First, we transcribe unlabeled speech using a state-of-the-art Automatic Speech Recognition (ASR) model. Next, we align generated transcripts
Externí odkaz:
http://arxiv.org/abs/2406.12674
In recent years, speech generation has seen remarkable progress, now achieving one-shot generation capability that is often virtually indistinguishable from real human voice. Integrating such advancements in speech generation with large language mode
Externí odkaz:
http://arxiv.org/abs/2401.02839