Výsledky vyhledávání - "Sereda, Taras"

Report

Transcribe, Align and Segment: Creating speech datasets for low-resource languages

Autor: Sereda, Taras

In this work, we showcase a cost-effective method for generating training data for speech processing tasks. First, we transcribe unlabeled speech using a state-of-the-art Automatic Speech Recognition (ASR) model. Next, we align generated transcripts

Externí odkaz: http://arxiv.org/abs/2406.12674

Zobrazit plný text záznamu

Report

Pheme: Efficient and Conversational Speech Generation

Autor: Budzianowski, Paweł, Sereda, Taras, Cichy, Tomasz, Vulić, Ivan

In recent years, speech generation has seen remarkable progress, now achieving one-shot generation capability that is often virtually indistinguishable from real human voice. Integrating such advancements in speech generation with large language mode

Externí odkaz: http://arxiv.org/abs/2401.02839

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání