Zobrazeno 1 - 10
of 1 059 218
pro vyhledávání: '"Chung AS"'
We propose FD3, a fundus image enhancement method based on direct diffusion bridges, which can cope with a wide range of complex degradations, including haze, blur, noise, and shadow. We first propose a synthetic forward model through a human feedbac
Externí odkaz:
http://arxiv.org/abs/2409.12377
Consider the problems of computing the Augustin information and a R\'{e}nyi information measure of statistical independence, previously explored by Lapidoth and Pfister (IEEE Information Theory Workshop, 2018) and Tomamichel and Hayashi (IEEE Trans.
Externí odkaz:
http://arxiv.org/abs/2409.02640
Autor:
Han, Cheongho, Bond, Ian A., Udalski, Andrzej, Lee, Chung-Uk, Gould, Andrew, Albrow, Michael D., Chung, Sun-Ju, Hwang, Kyu-Ha, Jung, Youn Kil, Ryu, Yoon-Hyun, Shvartzvald, Yossi, Shin, In-Gu, Yee, Jennifer C., Yang, Hongjing, Zang, Weicheng, Cha, Sang-Mok, Kim, Doeon, Kim, Dong-Jin, Kim, Seung-Lee, Lee, Dong-Joo, Lee, Yongseok, Park, Byeong-Gon, Pogge, Richard W., Abe, Fumio, Bando, Ken, Barry, Richard, Bennett, David P., Bhattacharya, Aparna, Fujii, Hirosame, Fukui, Akihiko, Hamada, Ryusei, Hamada, Shunya, Hamasaki, Naoto, Hirao, Yuki, Silva, Stela Ishitani, Itow, Yoshitaka, Kirikawa, Rintaro, Koshimoto, Naoki, Matsubara, Yutaka, Miyazaki, Shota, Muraki, Yasushi, Nagai, Tutumi, Nunota, Kansuke, Olmschenk, Greg, Ranc, Clément, Rattenbury, Nicholas J., Satoh, Yuki, Sumi, Takahiro, Suzuki, Daisuke, Tomoyoshi, Mio, Tristram, Paul J., Vandorou, Aikaterini, Yama, Hibiki, Yamashita, Kansuke, Szymański, Przemek Mróz Michał K., Skowron, Jan, Poleski, Radosław, Soszyński, Igor, Pietrukowicz, Paweł, Kozłowski, Szymon, Rybicki, Krzysztof A., Iwanek, Patryk, Ulaczyk, Krzysztof, Wrona, Marcin, Gromadzki, Mariusz, Mróz, Mateusz J.
Building on previous works to construct a homogeneous sample of brown dwarfs in binary systems, we investigate microlensing events detected by the Korea Microlensing Telescope Network (KMTNet) survey during the 2022 and 2023 seasons. Given the diffic
Externí odkaz:
http://arxiv.org/abs/2408.11248
When assisting people in daily tasks, robots need to accurately interpret visual cues and respond effectively in diverse safety-critical situations, such as sharp objects on the floor. In this context, we present M-CoDAL, a multimodal-dialogue system
Externí odkaz:
http://arxiv.org/abs/2410.14141
Autor:
Chen, Bowen, Shang, Zaixi, Chung, Jae Won, Lerner, David, Robitza, Werner, Rao, Rakesh Rao Ramachandra, Raake, Alexander, Bovik, Alan C.
Demand for streaming services, including satellite, continues to exhibit unprecedented growth. Internet Service Providers find themselves at the crossroads of technological advancements and rising customer expectations. To stay relevant and competiti
Externí odkaz:
http://arxiv.org/abs/2410.13952
Autor:
Nguyen, Tan Dat, Kim, Ji-Hoon, Choi, Jeongsoo, Choi, Shukjae, Park, Jinseok, Lee, Younglo, Chung, Joon Son
The goal of this paper is to accelerate codec-based speech synthesis systems with minimum sacrifice to speech quality. We propose an enhanced inference method that allows for flexible trade-offs between speed and quality during inference without requ
Externí odkaz:
http://arxiv.org/abs/2410.13839
Video Temporal Grounding (VTG) aims to identify visual frames in a video clip that match text queries. Recent studies in VTG employ cross-attention to correlate visual frames and text queries as individual token sequences. However, these approaches o
Externí odkaz:
http://arxiv.org/abs/2410.13598
Autor:
Tabia, Gelo Noel M., Hsieh, Chung-Yun
Entanglement is an essential resource for various quantum-information tasks. When a target system shares entanglement with another memory system and is stored reliably, one can use entanglement at a later time -- this is quantum memory. In practice,
Externí odkaz:
http://arxiv.org/abs/2410.13499
Machine learning models are known to be vulnerable to adversarial attacks, but traditional attacks have mostly focused on single-modalities. With the rise of large multi-modal models (LMMs) like CLIP, which combine vision and language capabilities, n
Externí odkaz:
http://arxiv.org/abs/2410.13010
This paper introduces a cross-lingual statutory article retrieval (SAR) dataset designed to enhance legal information retrieval in multilingual settings. Our dataset features spoken-language-style legal inquiries in English, paired with corresponding
Externí odkaz:
http://arxiv.org/abs/2410.11450