Zobrazeno 1 - 10
of 61
pro vyhledávání: '"Dongsuk Yook"'
Publikováno v:
IEEE Access, Vol 12, Pp 5477-5489 (2024)
To accelerate the training speed of massive DNN models on large-scale datasets, distributed training techniques, including data parallelism and model parallelism, have been extensively studied. In particular, pipeline parallelism, which is derived fr
Externí odkaz:
https://doaj.org/article/b8cd67204d4b4e04a982bd6eb9102078
Publikováno v:
Applied Sciences, Vol 14, Iss 20, p 9595 (2024)
Voice conversion (VC) refers to the technique of modifying one speaker’s voice to mimic another’s while retaining the original linguistic content. This technology finds its applications in fields such as speech synthesis, accent modification, med
Externí odkaz:
https://doaj.org/article/ebf0ad970b234e8c92a4a8e481f44e32
Publikováno v:
Applied Sciences, Vol 14, Iss 10, p 4251 (2024)
Voice conversion is the task of changing the speaker characteristics of input speech while preserving its linguistic content. It can be used in various areas, such as entertainment, medicine, and education. The quality of the converted speech is cruc
Externí odkaz:
https://doaj.org/article/91e0b0e49dd24f37ad396235d6d3f91b
Publikováno v:
Applied Sciences, Vol 13, Iss 21, p 11730 (2023)
Stochastic gradient descent (SGD) is an optimization method typically used in deep learning to train deep neural network (DNN) models. In recent studies for DNN training, pipeline parallelism, a type of model parallelism, is proposed to accelerate SG
Externí odkaz:
https://doaj.org/article/08e927370db64a319afb66ce0f96d4bc
Publikováno v:
IEEE Access, Vol 10, Pp 130190-130199 (2022)
Speech-based interfaces provide convenient methods for controlling various smart devices. For these interfaces to work reliably, considerable speech data with various noise and speaker characteristics must be collected to train the associated speech-
Externí odkaz:
https://doaj.org/article/3377a4a66af94d80b8c0765a2dfdd743
Publikováno v:
IEEE Access, Vol 8, Pp 198637-198645 (2020)
As speech-based user interfaces integrated in the devices such as AI speakers become ubiquitous, a large amount of user voice data is being collected to enhance the accuracy of speech recognition systems. Since such voice data contain personal inform
Externí odkaz:
https://doaj.org/article/e61ec8d9da08413884c19ad541041066
Publikováno v:
IEEE Access, Vol 8, Pp 198637-198645 (2020)
As speech-based user interfaces integrated in the devices such as AI speakers become ubiquitous, a large amount of user voice data is being collected to enhance the accuracy of speech recognition systems. Since such voice data contain personal inform
Publikováno v:
IEEE Transactions on Consumer Electronics. 65:188-194
Speech-based interfaces are convenient and intuitive, and therefore, strongly preferred by Internet of Things (IoT) devices for human–computer interaction. Pre-defined keywords are typically used as a trigger to notify devices for inputting the sub
Publikováno v:
Odyssey
Publikováno v:
ICASSP
Voice conversion (VC) refers to transforming the speaker characteristics of an utterance without altering its linguistic contents. Many works on voice conversion require to have parallel training data that is highly expensive to acquire. Recently, th