Výsledky vyhledávání - "Dongsuk Yook"

Akademický článek

Pipeline Parallelism With Elastic Averaging

Autor: Bongwon Jang, In-Chul Yoo, Dongsuk Yook

Publikováno v: IEEE Access, Vol 12, Pp 5477-5489 (2024)

To accelerate the training speed of massive DNN models on large-scale datasets, distributed training techniques, including data parallelism and model parallelism, have been extensively studied. In particular, pipeline parallelism, which is derived fr

Externí odkaz: https://doaj.org/article/b8cd67204d4b4e04a982bd6eb9102078

Zobrazit plný text záznamu

Akademický článek

CycleDiffusion: Voice Conversion Using Cycle-Consistent Diffusion Models

Autor: Dongsuk Yook, Geonhee Han, Hyung-Pil Chang, In-Chul Yoo

Publikováno v: Applied Sciences, Vol 14, Iss 20, p 9595 (2024)

Voice conversion (VC) refers to the technique of modifying one speaker’s voice to mimic another’s while retaining the original linguistic content. This technology finds its applications in fields such as speech synthesis, accent modification, med

Externí odkaz: https://doaj.org/article/ebf0ad970b234e8c92a4a8e481f44e32

Zobrazit plný text záznamu

Akademický článek

Wav2wav: Wave-to-Wave Voice Conversion

Autor: Changhyeon Jeong, Hyung-pil Chang, In-Chul Yoo, Dongsuk Yook

Publikováno v: Applied Sciences, Vol 14, Iss 10, p 4251 (2024)

Voice conversion is the task of changing the speaker characteristics of input speech while preserving its linguistic content. It can be used in various areas, such as entertainment, medicine, and education. The quality of the converted speech is cruc

Externí odkaz: https://doaj.org/article/91e0b0e49dd24f37ad396235d6d3f91b

Zobrazit plný text záznamu

Akademický článek

Pipelined Stochastic Gradient Descent with Taylor Expansion

Autor: Bongwon Jang, Inchul Yoo, Dongsuk Yook

Publikováno v: Applied Sciences, Vol 13, Iss 21, p 11730 (2023)

Stochastic gradient descent (SGD) is an optimization method typically used in deep learning to train deep neural network (DNN) models. In recent studies for DNN training, pipeline parallelism, a type of model parallelism, is proposed to accelerate SG

Externí odkaz: https://doaj.org/article/08e927370db64a319afb66ce0f96d4bc

Zobrazit plný text záznamu

Akademický článek

Zero-Shot Unseen Speaker Anonymization via Voice Conversion

Autor: Hyung-Pil Chang, In-Chul Yoo, Changhyeon Jeong, Dongsuk Yook

Publikováno v: IEEE Access, Vol 10, Pp 130190-130199 (2022)

Speech-based interfaces provide convenient methods for controlling various smart devices. For these interfaces to work reliably, considerable speech data with various noise and speaker characteristics must be collected to train the associated speech-

Externí odkaz: https://doaj.org/article/3377a4a66af94d80b8c0765a2dfdd743

Zobrazit plný text záznamu

Akademický článek

Speaker Anonymization for Personal Information Protection Using Voice Conversion Techniques

Autor: In-Chul Yoo, Keonnyeong Lee, Seonggyun Leem, Hyunwoo Oh, Bonggu Ko, Dongsuk Yook

Publikováno v: IEEE Access, Vol 8, Pp 198637-198645 (2020)

As speech-based user interfaces integrated in the devices such as AI speakers become ubiquitous, a large amount of user voice data is being collected to enhance the accuracy of speech recognition systems. Since such voice data contain personal inform

Externí odkaz: https://doaj.org/article/e61ec8d9da08413884c19ad541041066

Zobrazit plný text záznamu

Speaker Anonymization for Personal Information Protection Using Voice Conversion Techniques

Autor: Dongsuk Yook, In-Chul Yoo, Hyunwoo Oh, Keonnyeong Lee, Seong-Gyun Leem, BongGu Ko

Publikováno v: IEEE Access, Vol 8, Pp 198637-198645 (2020)

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::3db20222342f42dc1ae8bed580eb772c
https://doi.org/10.1109/access.2020.3035416

Zobrazit plný text záznamu

Multitask Learning of Deep Neural Network-Based Keyword Spotting for IoT Devices

Autor: Dongsuk Yook, Seong-Gyun Leem, In-Chul Yoo

Publikováno v: IEEE Transactions on Consumer Electronics. 65:188-194

Speech-based interfaces are convenient and intuitive, and therefore, strongly preferred by Internet of Things (IoT) devices for human–computer interaction. Pre-defined keywords are typically used as a trigger to notify devices for inputting the sub

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::867322127f0660c426316a1748fad8af
https://doi.org/10.1109/tce.2019.2899067

Zobrazit plný text záznamu

Many-to-Many Voice Conversion Using Cycle-Consistent Variational Autoencoder with Multiple Decoders

Autor: In-Chul Yoo, Dongsuk Yook, Keonnyeong Lee, Seong-Gyun Leem

Publikováno v: Odyssey

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::cffacf1388595db65241e0c831db6542
https://doi.org/10.21437/odyssey.2020-31

Zobrazit plný text záznamu

Many-To-Many Voice Conversion Using Conditional Cycle-Consistent Adversarial Networks

Autor: Dongsuk Yook, In-Chul Yoo, Keonnyeong Lee, BongGu Ko, Shindong Lee

Publikováno v: ICASSP

Voice conversion (VC) refers to transforming the speaker characteristics of an utterance without altering its linguistic contents. Many works on voice conversion require to have parallel training data that is highly expensive to acquire. Recently, th

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a0677b488de6ec9030cf80a69e918b43
https://doi.org/10.1109/icassp40776.2020.9053726

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání