Comparative analysis of natural and synthesized Polish speech

Autor: Michał Daniluk, Agnieszka Paula Pietrzak
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: International Journal of Electronics and Telecommunications, Vol vol. 70, Iss No 2, Pp 361-366 (2024)
Druh dokumentu: article
ISSN: 2081-8491
2300-1933
DOI: 10.24425/ijet.2024.149553
Popis: In the evolving field of speech synthesis, not only intelligibility, but also naturalness remains an important factor. This paper presents a comparative analysis of natural versus synthesized Polish speech. Speech synthesizers: Ivona, Mekatron, Notevibes, and ttsmp3 were explored. Four methods for assessing synthesized speech quality and comparing it to natural speech were presented: the AB test, MOS, logatom articulation test, and MUSHRA. Sentence databases and a database of logatoms were generated for each synthesizer and recorded for natural speech. Results indicated natural speech was consistently better than synthesized speech. Among the synthesizers, Notevibes performed best in all comparisons, while Mekatron ranked lowest.
Databáze: Directory of Open Access Journals