Výsledky vyhledávání - "speech conversion"

Akademický článek

Machine Learning Approaches for Whisper to Normal Speech Conversion

Autor: Marco A. Oliveira

Publikováno v: U.Porto Journal of Engineering, Vol 8, Iss 2, Pp 202-212 (2022)

Whispered speech is a mode of speech that differs from normal speech due to the absence of a periodic component, namely the Fundamental Frequency that characterizes the pitch, among other spectral and temporal differences. Much attention has been giv

Externí odkaz: https://doaj.org/article/d34e4004a2cd4232905b13a2687ee509

Zobrazit plný text záznamu

Akademický článek

CBFMCycleGAN-VC: Using the Improved MaskCycleGAN-VC to Effectively Predict a Person’s Voice After Aging

Autor: Xiaoqun Zhou, Ling Yu, Fanglin Niu, Junlin Jin

Publikováno v: IEEE Access, Vol 10, Pp 114297-114305 (2022)

One task of nonparallel speech conversion is to convert the source speaker’s speech samples to the target speaker’s speech samples, keeping the content unchanged. In view of the advantages of MaskCycleGAN-VC in nonparallel speech conversion, such

Externí odkaz: https://doaj.org/article/f4bc15553f854cdb8e4aa6a02b29dabe

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Enhancing Object Detection for VIPs Using YOLOv4_Resnet101 and Text-to-Speech Conversion Model

Autor: Tahani Jaser Alahmadi, Atta Ur Rahman, Hend Khalid Alkahtani, Hisham Kholidy

Publikováno v: Multimodal Technologies and Interaction, Vol 7, Iss 8, p 77 (2023)

Vision impairment affects an individual’s quality of life, posing challenges for visually impaired people (VIPs) in various aspects such as object recognition and daily tasks. Previous research has focused on developing visual navigation systems to

Externí odkaz: https://doaj.org/article/3e2203c5496148d6aea6d7983f5d8209

Zobrazit plný text záznamu

Akademický článek

Accent labeling algorithm based on morphological rules and machine learning in English conversion system

Autor: Liu Xiaofeng, Singh Pradeep Kumar, Pavlovich Pljonkin Anton

Publikováno v: Journal of Intelligent Systems, Vol 30, Iss 1, Pp 881-892 (2021)

The dependency of a speech recognition system on the accent of a user leads to the variation in its performance, as the people from different backgrounds have different accents. Accent labeling and conversion have been reported as a prospective solut

Externí odkaz: https://doaj.org/article/bf6d7f516c71462e9b3f88b5fadbf7ba

Zobrazit plný text záznamu

Akademický článek

Many-to-Many Unsupervised Speech Conversion From Nonparallel Corpora

Autor: Yun Kyung Lee, Hyun Woo Kim, Jeon Gue Park

Publikováno v: IEEE Access, Vol 9, Pp 27278-27286 (2021)

We address a nonparallel data-driven many-to-many speech modeling and multimodal style conversion method. In this work, we train a speech conversion model for multiple domains rather than a specific source and target domain pair, and we generate dive

Externí odkaz: https://doaj.org/article/bd7bacf1b34b449c9ce8a5e6283f902f

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Whispered Speech Conversion Based on the Inversion of Mel Frequency Cepstral Coefficient Features

Autor: Qiang Zhu, Zhong Wang, Yunfeng Dou, Jian Zhou

Publikováno v: Algorithms, Vol 15, Iss 2, p 68 (2022)

A conversion method based on the inversion of Mel frequency cepstral coefficient (MFCC) features was proposed to convert whispered speech into normal speech. First, the MFCC features of whispered speech and normal speech were extracted and a matching

Externí odkaz: https://doaj.org/article/0b75b1d0d98342de857df9bfd72bf7fa

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Multimodal Unsupervised Speech Translation for Recognizing and Evaluating Second Language Speech

Autor: Yun Kyung Lee, Jeon Gue Park

Publikováno v: Applied Sciences, Vol 11, Iss 6, p 2642 (2021)

This paper addresses an automatic proficiency evaluation and speech recognition for second language (L2) speech. The proposed method recognizes the speech uttered by the L2 speaker, measures a variety of fluency scores, and evaluates the proficiency

Externí odkaz: https://doaj.org/article/89b2250757e6474fb025b84fa0995e1f

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání