An overview of voice conversion systems
Autor: | Alexander Kain, Seyed Hamidreza Mohammadi |
---|---|
Rok vydání: | 2017 |
Předmět: |
Linguistics and Language
Thesaurus (information retrieval) Voice activity detection Computer science Communication Speech recognition Speech quality Voice transformation SIGNAL (programming language) 020206 networking & telecommunications 02 engineering and technology Language and Linguistics Computer Science Applications Voice analysis 030507 speech-language pathology & audiology 03 medical and health sciences Rule-based machine translation Modeling and Simulation 0202 electrical engineering electronic engineering information engineering Computer Vision and Pattern Recognition 0305 other medical science Software Sentence |
Zdroj: | Speech Communication. 88:65-82 |
ISSN: | 0167-6393 |
DOI: | 10.1016/j.specom.2017.01.008 |
Popis: | Voice transformation (VT) aims to change one or more aspects of a speech signal while preserving linguistic information. A subset of VT, Voice conversion (VC) specifically aims to change a source speakers speech in such a way that the generated output is perceived as a sentence uttered by a target speaker. Despite many years of research, VC systems still exhibit deficiencies in accurately mimicking a target speaker spectrally and prosodically, and simultaneously maintaining high speech quality. In this work we provide an overview of real-world applications, extensively study existing systems proposed in the literature, and discuss remaining challenges. |
Databáze: | OpenAIRE |
Externí odkaz: |