The IMS Toucan System for the Blizzard Challenge 2023

Autor: Lux, Florian, Koch, Julia, Meyer, Sarina, Bott, Thomas, Schauffler, Nadja, Denisov, Pavel, Schweitzer, Antje, Vu, Ngoc Thang
Rok vydání: 2023
Předmět:
Druh dokumentu: Working Paper
Popis: For our contribution to the Blizzard Challenge 2023, we improved on the system we submitted to the Blizzard Challenge 2021. Our approach entails a rule-based text-to-phoneme processing system that includes rule-based disambiguation of homographs in the French language. It then transforms the phonemes to spectrograms as intermediate representations using a fast and efficient non-autoregressive synthesis architecture based on Conformer and Glow. A GAN based neural vocoder that combines recent state-of-the-art approaches converts the spectrogram to the final wave. We carefully designed the data processing, training, and inference procedures for the challenge data. Our system identifier is G. Open source code and demo are available.
Comment: Published at the Blizzard Challenge Workshop 2023, colocated with the Speech Synthesis Workshop 2023, a sattelite event of the Interspeech 2023
Databáze: arXiv