Zobrazeno 1 - 10
of 874 393
pro vyhledávání: '"An, Hyun"'
Vision language models (VLMs) have shown promising reasoning capabilities across various benchmarks; however, our understanding of their visual perception remains limited. In this work, we propose an eye examination process to investigate how a VLM p
Externí odkaz:
http://arxiv.org/abs/2409.14759
We propose a novel machine learning approach for inferring causal variables of a target variable from observations. Our goal is to identify both direct and indirect causes within a system, thereby efficiently regulating the target variable when the d
Externí odkaz:
http://arxiv.org/abs/2408.16218
Autor:
Sun, Seungjong, Lee, Eungu, Baek, Seo Yeon, Hwang, Seunghyun, Lee, Wonbyung, Nan, Dongyan, Jansen, Bernard J., Kim, Jang Hyun
This study is the first to explore whether multi-modal large language models (LLMs) can align their behaviors with visual personas, addressing a significant gap in the literature that predominantly focuses on text-based personas. We developed a novel
Externí odkaz:
http://arxiv.org/abs/2410.03181
We propose a new fast algorithm optimized for full-wave electromagnetic (EM) scattering analysis of a large-scale cloud of chaffs with arbitrary orientation, spatial distribution, and length. By leveraging the unique EM scattering characteristics in
Externí odkaz:
http://arxiv.org/abs/2410.03060
Autor:
Simson, Walter, Zhuang, Louise, Frey, Benjamin N., Sanabria, Sergio J., Dahl, Jeremy J., Hyun, Dongwoon
Wavefield imaging reconstructs physical properties from wavefield measurements across an aperture, using modalities like radar, optics, sonar, seismic, and ultrasound imaging. Propagation of a wavefront from unknown sources through heterogeneous medi
Externí odkaz:
http://arxiv.org/abs/2410.03008
While language models have demonstrated impressive capabilities across a range of tasks, they still struggle with tasks that require complex planning and reasoning. Recent studies have proposed training language models on search processes rather than
Externí odkaz:
http://arxiv.org/abs/2410.02992
Let $G$ be a countable group with no finitely generated subgroup of exponential growth. We show that every action of $G$ on a countable set preserving a linear (respectively, circular) order can be realised as the restriction of some action by $C^1$
Externí odkaz:
http://arxiv.org/abs/2410.02614
Novel view synthesis (NVS) aims to generate images at arbitrary viewpoints using multi-view images, and recent insights from neural radiance fields (NeRF) have contributed to remarkable improvements. Recently, studies on generalizable NeRF (G-NeRF) h
Externí odkaz:
http://arxiv.org/abs/2410.00672
Autor:
Park, Sang Hyun, Koh, Jun Young, Lee, Junha, Song, Joy, Kim, Dongha, Moon, Hoyeon, Lee, Hyunju, Song, Min
In this work, we share the insights for achieving state-of-the-art quality in our text-to-image anime image generative model, called Illustrious. To achieve high resolution, dynamic color range images, and high restoration ability, we focus on three
Externí odkaz:
http://arxiv.org/abs/2409.19946
Autor:
Nahrendra, I Made Aswin, Yu, Byeongho, Oh, Minho, Lee, Dongkyu, Lee, Seunghyun, Lee, Hyeonwoo, Lim, Hyungtae, Myung, Hyun
Quadrupedal robots hold promising potential for applications in navigating cluttered environments with resilience akin to their animal counterparts. However, their floating base configuration makes them vulnerable to real-world uncertainties, yieldin
Externí odkaz:
http://arxiv.org/abs/2409.19709