Zobrazeno 1 - 10
of 70
pro vyhledávání: '"Kim, Jung Uk"'
Abstract. The advancement of deep learning has coincided with the proliferation of both models and available data. The surge in dataset sizes and the subsequent surge in computational requirements have led to the development of the Dataset Condensati
Externí odkaz:
http://arxiv.org/abs/2409.14538
Monocular 3D object detection is an important challenging task in autonomous driving. Existing methods mainly focus on performing 3D detection in ideal weather conditions, characterized by scenarios with clear and optimal visibility. However, the cha
Externí odkaz:
http://arxiv.org/abs/2407.16448
Recent Audio-Visual Question Answering (AVQA) methods rely on complete visual and audio input to answer questions accurately. However, in real-world scenarios, issues such as device malfunctions and data transmission errors frequently result in missi
Externí odkaz:
http://arxiv.org/abs/2407.16171
The goal of the multi-sound source localization task is to localize sound sources from the mixture individually. While recent multi-sound source localization methods have shown improved performance, they face challenges due to their reliance on prior
Externí odkaz:
http://arxiv.org/abs/2403.17420
Continual learning aims to learn a model from a continuous stream of data, but it mainly assumes a fixed number of data and tasks with clear task boundaries. However, in real-world scenarios, the number of input data and tasks is constantly changing
Externí odkaz:
http://arxiv.org/abs/2308.09303
The objective of the sound source localization task is to enable machines to detect the location of sound-making objects within a visual scene. While the audio modality provides spatial cues to locate the sound source, existing approaches only use au
Externí odkaz:
http://arxiv.org/abs/2308.06087
Autor:
Zhang, Chaoning, Han, Dongshen, Qiao, Yu, Kim, Jung Uk, Bae, Sung-Ho, Lee, Seungkyu, Hong, Choong Seon
Segment Anything Model (SAM) has attracted significant attention due to its impressive zero-shot transfer performance and high versatility for numerous vision applications (like image editing with fine-grained control). Many of such applications need
Externí odkaz:
http://arxiv.org/abs/2306.14289
Autor:
Zhang, Chaoning, Zhang, Chenshuang, Li, Chenghao, Qiao, Yu, Zheng, Sheng, Dam, Sumit Kumar, Zhang, Mengchun, Kim, Jung Uk, Kim, Seong Tae, Choi, Jinwoo, Park, Gyeong-Moon, Bae, Sung-Ho, Lee, Lik-Hang, Hui, Pan, Kweon, In So, Hong, Choong Seon
OpenAI has recently released GPT-4 (a.k.a. ChatGPT plus), which is demonstrated to be one small step for generative AI (GAI), but one giant leap for artificial general intelligence (AGI). Since its official release in November 2022, ChatGPT has quick
Externí odkaz:
http://arxiv.org/abs/2304.06488
Autor:
Kim, Jung Uk1, Wang, Pei Wei1 qodnl0810@naver.com
Publikováno v:
Clinics in Shoulder & Elbow. Jun2024, Vol. 27 Issue 2, p263-266. 4p.
Recent advances in facial expression synthesis have shown promising results using diverse expression representations including facial action units. Facial action units for an elaborate facial expression synthesis need to be intuitively represented fo
Externí odkaz:
http://arxiv.org/abs/2007.08154