Výsledky vyhledávání - "Kim, DongHyun"

Report

Lessons Learned from Developing a Human-Centered Guide Dog Robot for Mobility Assistance

Autor: Hwang, Hochul, Suzuki, Ken, Giudice, Nicholas A, Biswas, Joydeep, Lee, Sunghoon Ivan, Kim, Donghyun

While guide dogs offer essential mobility assistance, their high cost, limited availability, and care requirements make them inaccessible to most blind or low vision (BLV) individuals. Recent advances in quadruped robots provide a scalable solution f

Externí odkaz: http://arxiv.org/abs/2409.19778

Zobrazit plný text záznamu

Report

Synthetic data augmentation for robotic mobility aids to support blind and low vision people

Autor: Hwang, Hochul, Adhikari, Krisha, Shodhaka, Satya, Kim, Donghyun

Robotic mobility aids for blind and low-vision (BLV) individuals rely heavily on deep learning-based vision models specialized for various navigational tasks. However, the performance of these models is often constrained by the availability and diver

Externí odkaz: http://arxiv.org/abs/2409.11164

Zobrazit plný text záznamu

Report

Extending the science fiction and the Loehr--Warrington formula

Autor: Kim, Donghyun, Oh, Jaeseong

We introduce the Macdonald piece polynomial $\operatorname{I}_{\mu,\lambda,k}[X;q,t]$, which is a vast generalization of the Macdonald intersection polynomial in the science fiction conjecture by Bergeron and Garsia. We demonstrate a remarkable conne

Externí odkaz: http://arxiv.org/abs/2409.01041

Zobrazit plný text záznamu

Report

Efficient and Versatile Robust Fine-Tuning of Zero-shot Models

Autor: Kim, Sungyeon, Jeong, Boseung, Kim, Donghyun, Kwak, Suha

Large-scale image-text pre-trained models enable zero-shot classification and provide consistent accuracy across various data distributions. Nonetheless, optimizing these models in downstream tasks typically requires fine-tuning, which reduces genera

Externí odkaz: http://arxiv.org/abs/2408.05749

Zobrazit plný text záznamu

Report

Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection

Autor: Park, Kwanyong, Saito, Kuniaki, Kim, Donghyun

Vision-language (VL) models often exhibit a limited understanding of complex expressions of visual objects (e.g., attributes, shapes, and their relations), given complex and diverse language queries. Traditional approaches attempt to improve VL model

Externí odkaz: http://arxiv.org/abs/2407.15296

Zobrazit plný text záznamu

Report

A Biomechanics-Inspired Approach to Soccer Kicking for Humanoid Robots

Autor: Marew, Daniel, Perera, Nisal, Yu, Shangqun, Roelker, Sarah, Kim, Donghyun

Soccer kicking is a complex whole-body motion that requires intricate coordination of various motor actions. To accomplish such dynamic motion in a humanoid robot, the robot needs to simultaneously: 1) transfer high kinetic energy to the kicking leg,

Externí odkaz: http://arxiv.org/abs/2407.14612

Zobrazit plný text záznamu

Report

NDST: Neural Driving Style Transfer for Human-Like Vision-Based Autonomous Driving

Autor: Kim, Donghyun, Khalil, Aws, Nam, Haewoon, Kwon, Jaerock

Autonomous Vehicles (AV) and Advanced Driver Assistant Systems (ADAS) prioritize safety over comfort. The intertwining factors of safety and comfort emerge as pivotal elements in ensuring the effectiveness of Autonomous Driving (AD). Users often expe

Externí odkaz: http://arxiv.org/abs/2407.08073

Zobrazit plný text záznamu

Report

FALCON: Frequency Adjoint Link with CONtinuous Density Mask for Fast Single Image Dehazing

Autor: Kim, Donghyun, Kang, Seil, Hwang, Seong Jae

Image dehazing, addressing atmospheric interference like fog and haze, remains a pervasive challenge crucial for robust vision applications such as surveillance and remote sensing under adverse visibility. While various methodologies have evolved fro

Externí odkaz: http://arxiv.org/abs/2407.00972

Zobrazit plný text záznamu

Report

MATE: Meet At The Embedding -- Connecting Images with Long Texts

Autor: Jang, Young Kyun, Kang, Junmo, Lee, Yong Jae, Kim, Donghyun

While advancements in Vision Language Models (VLMs) have significantly improved the alignment of visual and textual data, these models primarily focus on aligning images with short descriptive captions. This focus limits their ability to handle compl

Externí odkaz: http://arxiv.org/abs/2407.09541

Zobrazit plný text záznamu

Report

Too Many Frames, Not All Useful: Efficient Strategies for Long-Form Video QA

Autor: Park, Jongwoo, Ranasinghe, Kanchana, Kahatapitiya, Kumara, Ryoo, Wonjeong, Kim, Donghyun, Ryoo, Michael S.

Long-form videos that span across wide temporal intervals are highly information redundant and contain multiple distinct events or entities that are often loosely related. Therefore, when performing long-form video question answering (LVQA), all info

Externí odkaz: http://arxiv.org/abs/2406.09396

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání