Výsledky vyhledávání

Report

VLM's Eye Examination: Instruct and Inspect Visual Competency of Vision Language Models

Autor: Hyeon-Woo, Nam, Ye-Bin, Moon, Choi, Wonseok, Hyun, Lee, Oh, Tae-Hyun

Vision language models (VLMs) have shown promising reasoning capabilities across various benchmarks; however, our understanding of their visual perception remains limited. In this work, we propose an eye examination process to investigate how a VLM p

Externí odkaz: http://arxiv.org/abs/2409.14759

Zobrazit plný text záznamu

Report

Targeted Cause Discovery with Data-Driven Learning

Autor: Kim, Jang-Hyun, Gibbs, Claudia Skok, Yun, Sangdoo, Song, Hyun Oh, Cho, Kyunghyun

We propose a novel machine learning approach for inferring causal variables of a target variable from observations. Our goal is to identify both direct and indirect causes within a system, thereby efficiently regulating the target variable when the d

Externí odkaz: http://arxiv.org/abs/2408.16218

Zobrazit plný text záznamu

Report

Kiss up, Kick down: Exploring Behavioral Changes in Multi-modal Large Language Models with Assigned Visual Personas

Autor: Sun, Seungjong, Lee, Eungu, Baek, Seo Yeon, Hwang, Seunghyun, Lee, Wonbyung, Nan, Dongyan, Jansen, Bernard J., Kim, Jang Hyun

This study is the first to explore whether multi-modal large language models (LLMs) can align their behaviors with visual personas, addressing a significant gap in the literature that predominantly focuses on text-based personas. We developed a novel

Externí odkaz: http://arxiv.org/abs/2410.03181

Zobrazit plný text záznamu

Report

Fast Algorithm for Full-wave EM Scattering Analysis of Large-scale Chaff Cloud with Arbitrary Orientation, Spatial Distribution, and Length

Autor: Lee, Chung Hyun, Kang, Dong-Kook, Kwon, Kyoung Il, Kim, Kyung-Tae, Na, Dong-Yeop

We propose a new fast algorithm optimized for full-wave electromagnetic (EM) scattering analysis of a large-scale cloud of chaffs with arbitrary orientation, spatial distribution, and length. By leveraging the unique EM scattering characteristics in

Externí odkaz: http://arxiv.org/abs/2410.03060

Zobrazit plný text záznamu

Report

Ultrasound Autofocusing: Common Midpoint Phase Error Optimization via Differentiable Beamforming

Autor: Simson, Walter, Zhuang, Louise, Frey, Benjamin N., Sanabria, Sergio J., Dahl, Jeremy J., Hyun, Dongwoon

Wavefield imaging reconstructs physical properties from wavefield measurements across an aperture, using modalities like radar, optics, sonar, seismic, and ultrasound imaging. Propagation of a wavefront from unknown sources through heterogeneous medi

Externí odkaz: http://arxiv.org/abs/2410.03008

Zobrazit plný text záznamu

Report

Guided Stream of Search: Learning to Better Search with Language Models via Optimal Path Guidance

Autor: Moon, Seungyong, Park, Bumsoo, Song, Hyun Oh

While language models have demonstrated impressive capabilities across a range of tasks, they still struggle with tasks that require complex planning and reasoning. Recent studies have proposed training language models on search processes rather than

Externí odkaz: http://arxiv.org/abs/2410.02992

Zobrazit plný text záznamu

Report

Subexponential growth and $C^1$ actions on one-manifolds

Autor: Kim, Sang-hyun, Bon, Nicolás Matte, de la Salle, Mikael, Triestino, Michele

Let $G$ be a countable group with no finitely generated subgroup of exponential growth. We show that every action of $G$ on a countable set preserving a linear (respectively, circular) order can be realised as the restriction of some action by $C^1$

Externí odkaz: http://arxiv.org/abs/2410.02614

Zobrazit plný text záznamu

Report

GMT: Enhancing Generalizable Neural Rendering via Geometry-Driven Multi-Reference Texture Transfer

Autor: Yoon, Youngho, Jang, Hyun-Kurl, Yoon, Kuk-Jin

Novel view synthesis (NVS) aims to generate images at arbitrary viewpoints using multi-view images, and recent insights from neural radiance fields (NeRF) have contributed to remarkable improvements. Recently, studies on generalizable NeRF (G-NeRF) h

Externí odkaz: http://arxiv.org/abs/2410.00672

Zobrazit plný text záznamu

Report

Illustrious: an Open Advanced Illustration Model

Autor: Park, Sang Hyun, Koh, Jun Young, Lee, Junha, Song, Joy, Kim, Dongha, Moon, Hoyeon, Lee, Hyunju, Song, Min

In this work, we share the insights for achieving state-of-the-art quality in our text-to-image anime image generative model, called Illustrious. To achieve high resolution, dynamic color range images, and high restoration ability, we focus on three

Externí odkaz: http://arxiv.org/abs/2409.19946

Zobrazit plný text záznamu

Report

Obstacle-Aware Quadrupedal Locomotion With Resilient Multi-Modal Reinforcement Learning

Autor: Nahrendra, I Made Aswin, Yu, Byeongho, Oh, Minho, Lee, Dongkyu, Lee, Seunghyun, Lee, Hyeonwoo, Lim, Hyungtae, Myung, Hyun

Quadrupedal robots hold promising potential for applications in navigating cluttered environments with resilience akin to their animal counterparts. However, their floating base configuration makes them vulnerable to real-world uncertainties, yieldin

Externí odkaz: http://arxiv.org/abs/2409.19709

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání