Zobrazeno 1 - 10
of 26 032
pro vyhledávání: '"Ji hoon An"'
Autor:
Nakajima, Kimihiko, Ouchi, Masami, Isobe, Yuki, Xu, Yi, Ozaki, Shinobu, Nagao, Tohru, Inoue, Akio K., Rauch, Michael, Kusakabe, Haruka, Onodera, Masato, Nishigaki, Moka, Ono, Yoshiaki, Sugahara, Yuma, Hattori, Takashi, Hirai, Yutaka, Hashimoto, Takuya, Kim, Ji Hoon, Moriya, Takashi J., Yanagisawa, Hiroto, Aoyama, Shohei, Fujimoto, Seiji, Fukushima, Hajime, Fukushima, Keita, Harikane, Yuichi, Hatano, Shun, Hayashi, Kohei, Ishigaki, Tsuyoshi, Kawasaki, Masahiro, Kojima, Takashi, Komiyama, Yutaka, Koyama, Shuhei, Koyama, Yusei, Lee, Chien-Hsiu, Matsumoto, Akinori, Mawatari, Ken, Motohara, Kentaro, Murai, Kai, Nagamine, Kentaro, Nakane, Minami, Saito, Tomoki, Sasaki, Rin, Shibuya, Takatoshi, Suzuki, Akihiro, Takeuchi, Tsutomu T., Umeda, Hiroya, Umemura, Masayuki, Watanabe, Kuria, Yabe, Kiyoto, Yajima, Hidenobu, Zhang, Yechi
Using the Subaru/FOCAS IFU capability, we examine the spatially resolved relationships between gas-phase metallicity, stellar mass, and star-formation rate surface densities (Sigma_* and Sigma_SFR, respectively) in extremely metal-poor galaxies (EMPG
Externí odkaz:
http://arxiv.org/abs/2412.04541
In this paper, we introduce V2SFlow, a novel Video-to-Speech (V2S) framework designed to generate natural and intelligible speech directly from silent talking face videos. While recent V2S systems have shown promising results on constrained datasets
Externí odkaz:
http://arxiv.org/abs/2411.19486
Autor:
Nguyen, Tan Dat, Kim, Ji-Hoon, Choi, Jeongsoo, Choi, Shukjae, Park, Jinseok, Lee, Younglo, Chung, Joon Son
The goal of this paper is to accelerate codec-based speech synthesis systems with minimum sacrifice to speech quality. We propose an enhanced inference method that allows for flexible trade-offs between speed and quality during inference without requ
Externí odkaz:
http://arxiv.org/abs/2410.13839
Autor:
Jung, Jee-weon, Wu, Yihan, Wang, Xin, Kim, Ji-Hoon, Maiti, Soumi, Matsunaga, Yuta, Shim, Hye-jin, Tian, Jinchuan, Evans, Nicholas, Chung, Joon Son, Zhang, Wangyou, Um, Seyun, Takamichi, Shinnosuke, Watanabe, Shinji
This paper introduces SpoofCeleb, a dataset designed for Speech Deepfake Detection (SDD) and Spoofing-robust Automatic Speaker Verification (SASV), utilizing source data from real-world conditions and spoofing attacks generated by Text-To-Speech (TTS
Externí odkaz:
http://arxiv.org/abs/2409.17285
Autor:
Jung, Jee-weon, Zhang, Wangyou, Maiti, Soumi, Wu, Yihan, Wang, Xin, Kim, Ji-Hoon, Matsunaga, Yuta, Um, Seyun, Tian, Jinchuan, Shim, Hye-jin, Evans, Nicholas, Chung, Joon Son, Takamichi, Shinnosuke, Watanabe, Shinji
Text-to-speech (TTS) systems are traditionally trained using modest databases of studio-quality, prompted or read speech collected in benign acoustic environments such as anechoic rooms. The recent literature nonetheless shows efforts to train TTS sy
Externí odkaz:
http://arxiv.org/abs/2409.08711
Autor:
Lee, Jun-Young, Kim, Ji-hoon, Jung, Minyong, Oh, Boon Kiat, Jo, Yongseok, Park, Songyoun, Lee, Jaehyun, Ting, Yuan-Sen, Hwang, Ho Seong
We present a proof-of-concept simulation-based inference on $\Omega_{\rm m}$ and $\sigma_{8}$ from the SDSS BOSS LOWZ NGC catalog using neural networks and domain generalization techniques without the need of summary statistics. Using rapid lightcone
Externí odkaz:
http://arxiv.org/abs/2409.02256
We introduce a GPU-accelerated hybrid hydro/N-body code (Enzo-N) designed to address the challenges of concurrently simulating star clusters and their parent galaxies. This task has been exceedingly challenging, primarily due to the considerable comp
Externí odkaz:
http://arxiv.org/abs/2408.03128
Autor:
Roca-Fàbrega, Santi, Kim, Ji-hoon, Primack, Joel R., Genina, Anna, Jung, Minyong, Lupi, Alessandro, Nagamine, Kentaro, Powell, Johnny W., Quinn, Thomas R., Revaz, Yves, Shimizu, Ikkoh, Velázquez, Héctor, Collaboration, the AGORA
The AGORA Cosmorun (arXiv:2106.09738) is a set of hydrodynamical cosmological zoom-in simulations carried out within the AGORA High-resolution Galaxy Simulations Comparison Project (arXiv:1308.2669,arXiv:1610.03066). These simulations show the format
Externí odkaz:
http://arxiv.org/abs/2408.00432
Autor:
Ahn, Junseok, Kim, Youkyum, Choi, Yeunju, Kwak, Doyeop, Kim, Ji-Hoon, Mun, Seongkyu, Chung, Joon Son
This paper introduces VoxSim, a dataset of perceptual voice similarity ratings. Recent efforts to automate the assessment of speech synthesis technologies have primarily focused on predicting mean opinion score of naturalness, leaving speaker voice s
Externí odkaz:
http://arxiv.org/abs/2407.18505
We introduce a novel software called TCSpy which is designed to efficiently control a multi-telescope array through network-based protocols. The primary objectives of TCSpy include centralized control of the array, support for diverse observation mod
Externí odkaz:
http://arxiv.org/abs/2407.13315