Zobrazeno 1 - 10
of 192
pro vyhledávání: '"Shin, Minchul"'
Human-Object Interaction (HOI) detection is the task of identifying a set of triplets from an image. Recent work proposed transformer encoder-decoder architectures that successfully eliminated the need for many hand-desig
Externí odkaz:
http://arxiv.org/abs/2203.14709
Autor:
Oh, Changdae, So, Junhyuk, Byun, Hoyoon, Lim, YongTaek, Shin, Minchul, Jeon, Jong-June, Song, Kyungwoo
Pre-trained multi-modal models, such as CLIP, provide transferable embeddings and show promising results in diverse applications. However, the analysis of learned multi-modal embeddings is relatively unexplored, and the embedding transferability can
Externí odkaz:
http://arxiv.org/abs/2203.03897
Autor:
Park, Hoonmin, Shin, Minchul, Choi, Gyubok, Sim, Yuseop, Lee, Jiho, Yun, Huitaek, Jun, Martin Byung-Guk, Kim, Gyuman, Jeong, Younghun, Yi, Hak
Publikováno v:
In Robotics and Computer-Integrated Manufacturing October 2024 89
Autor:
Mun, Jonghwan, Shin, Minchul, Han, Gunsoo, Lee, Sangho, Ha, Seongsu, Lee, Joonseok, Kim, Eun-Sol
Self-supervised learning has drawn attention through its effectiveness in learning in-domain representations with no ground-truth annotations; in particular, it is shown that properly designed pretext tasks (e.g., contrastive prediction task) bring s
Externí odkaz:
http://arxiv.org/abs/2201.05277
We consider the Bayesian analysis of models in which the unknown distribution of the outcomes is specified up to a set of conditional moment restrictions. The nonparametric exponentially tilted empirical likelihood function is constructed to satisfy
Externí odkaz:
http://arxiv.org/abs/2110.13531
The VALUE (Video-And-Language Understanding Evaluation) benchmark is newly introduced to evaluate and analyze multi-modal representation learning algorithms on three video-and-language tasks: Retrieval, QA, and Captioning. The main objective of the V
Externí odkaz:
http://arxiv.org/abs/2110.06476
Previous deep learning-based line segment detection (LSD) suffers from the immense model size and high computational cost for line prediction. This constrains them from real-time inference on computationally restricted environments. In this paper, we
Externí odkaz:
http://arxiv.org/abs/2106.00186
In this paper, we study the compositional learning of images and texts for image retrieval. The query is given in the form of an image and text that describes the desired modifications to the image; the goal is to retrieve the target image that satis
Externí odkaz:
http://arxiv.org/abs/2104.03015
We propose methods for constructing regularized mixtures of density forecasts. We explore a variety of objectives and regularization penalties, and we use them in a substantive exploration of Eurozone inflation and real interest rate density forecast
Externí odkaz:
http://arxiv.org/abs/2012.11649
Autor:
Shin, Minchul
This paper presents a study on semi-supervised learning to solve the visual attribute prediction problem. In many applications of vision algorithms, the precise recognition of visual attributes of objects is important but still challenging. This is b
Externí odkaz:
http://arxiv.org/abs/2007.06769