Zobrazeno 1 - 10
of 32 393
pro vyhledávání: '"Kim, So‐Won"'
We introduce Multimodal Matching based on Valence and Arousal (MMVA), a tri-modal encoder framework designed to capture emotional content across images, music, and musical captions. To support this framework, we expand the Image-Music-Emotion-Matchin
Externí odkaz:
http://arxiv.org/abs/2501.01094
Autor:
Kim, Dae-Won, Ros, Eduardo, Kadler, Matthias, Krichbaum, Thomas P., Zhao, Guang-Yao, Rösch, Florian, Lobanov, Andrei P., Zensus, J. Anton
We present a long-term strong correlation between millimeter (mm) radio and $\gamma$-ray emission in the flat-spectrum radio quasar (FSRQ) PKS 1424-418. The mm$-\gamma$-ray connection in blazars is generally thought to originate from the relativistic
Externí odkaz:
http://arxiv.org/abs/2411.19737
Autor:
Oh, Yujin, Park, Sangjoon, Li, Xiang, Yi, Wang, Paly, Jonathan, Efstathiou, Jason, Chan, Annie, Kim, Jun Won, Byun, Hwa Kyung, Lee, Ik Jae, Cho, Jaeho, Wee, Chan Woo, Shu, Peng, Wang, Peilong, Yu, Nathan, Holmes, Jason, Ye, Jong Chul, Li, Quanzheng, Liu, Wei, Koom, Woong Sub, Kim, Jin Sung, Kim, Kyungsang
Clinical experts employ diverse philosophies and strategies in patient care, influenced by regional patient populations. However, existing medical artificial intelligence (AI) models are often trained on data distributions that disproportionately ref
Externí odkaz:
http://arxiv.org/abs/2410.00046
The success of visual instruction tuning has accelerated the development of large language and vision models (LLVMs). Following the scaling laws of instruction-tuned large language models (LLMs), LLVMs either have further increased their sizes, reach
Externí odkaz:
http://arxiv.org/abs/2409.14713
Accurate pain assessment is crucial in healthcare for effective diagnosis and treatment; however, traditional methods relying on self-reporting are inadequate for populations unable to communicate their pain. Cutting-edge AI is promising for supporti
Externí odkaz:
http://arxiv.org/abs/2409.05088
The creation of listener facial responses aims to simulate interactive communication feedback from a listener during a face-to-face conversation. Our goal is to generate believable videos of listeners' heads that respond authentically to a single spe
Externí odkaz:
http://arxiv.org/abs/2409.05089
Autor:
Yeo, Jeong Hun, Kim, Chae Won, Kim, Hyunjun, Rha, Hyeongseop, Han, Seunghee, Cheng, Wen-Huang, Ro, Yong Man
Lip reading aims to predict spoken language by analyzing lip movements. Despite advancements in lip reading technologies, performance degrades when models are applied to unseen speakers due to their sensitivity to variations in visual information suc
Externí odkaz:
http://arxiv.org/abs/2409.00986
Estimation of distribution algorithms (EDAs) constitute a new branch of evolutionary optimization algorithms, providing effective and efficient optimization performance in a variety of research areas. Recent studies have proposed new EDAs that employ
Externí odkaz:
http://arxiv.org/abs/2407.18257
Autor:
Kim, Dae-Won, Lee, Kwang H.
A novel initialization method in the fuzzy c-means (FCM) algorithm is proposed for the color clustering problem. Given a set of color points, the proposed initialization extracts dominant colors that are the most vivid and distinguishable colors. Col
Externí odkaz:
http://arxiv.org/abs/2407.17423
Autor:
Kim, Dae-Won, Lee, Kwang H.
The research interest of this paper is focused on the efficient clustering task for an arbitrary color data. In order to tackle this problem, we have tried to model the inherent uncertainty and vagueness of color data using fuzzy color model. By taki
Externí odkaz:
http://arxiv.org/abs/2407.06782