Zobrazeno 1 - 4
of 4
pro vyhledávání: '"Jang, Jinhyun"'
Visual scenes are naturally organized in a hierarchy, where a coarse semantic is recursively comprised of several fine details. Exploring such a visual hierarchy is crucial to recognize the complex relations of visual elements, leading to a comprehen
Externí odkaz:
http://arxiv.org/abs/2404.00974
Aerial-to-ground image synthesis is an emerging and challenging problem that aims to synthesize a ground image from an aerial image. Due to the highly different layout and object representation between the aerial and ground images, existing approache
Externí odkaz:
http://arxiv.org/abs/2308.06945
Recent DETR-based video grounding models have made the model directly predict moment timestamps without any hand-crafted components, such as a pre-defined proposal or non-maximum suppression, by learning moment queries. However, their input-agnostic
Externí odkaz:
http://arxiv.org/abs/2308.06947
Recent progress in deterministic prompt learning has become a promising alternative to various downstream vision tasks, enabling models to learn powerful visual representations with the help of pre-trained vision-language models. However, this approa
Externí odkaz:
http://arxiv.org/abs/2304.00779