Zobrazeno 1 - 10
of 251
pro vyhledávání: '"IDE, Ichiro"'
Text-to-image diffusion models sometimes depict blended concepts in the generated images. One promising use case of this effect would be the nonword-to-image generation task which attempts to generate images intuitively imaginable from a non-existing
Externí odkaz:
http://arxiv.org/abs/2411.03595
Multi-label multi-view action recognition aims to recognize multiple concurrent or sequential actions from untrimmed videos captured by multiple cameras. Existing work has focused on multi-view action recognition in a narrow area with strong labels a
Externí odkaz:
http://arxiv.org/abs/2410.03302
Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association
This paper focuses on tracking birds that appear small in a panoramic video. When the size of the tracked object is small in the image (small object tracking) and move quickly, object detection and association suffers. To address these problems, we p
Externí odkaz:
http://arxiv.org/abs/2405.17323
Open-vocabulary Temporal Action Detection (Open-vocab TAD) is an advanced video analysis approach that expands Closed-vocabulary Temporal Action Detection (Closed-vocab TAD) capabilities. Closed-vocab TAD is typically confined to localizing and class
Externí odkaz:
http://arxiv.org/abs/2404.19542
Recipe is a set of instructions that describes how to make food. It can help people from the preparation of ingredients, food cooking process, etc. to prepare the food, and increasingly in demand on the Web. To help users find the vast amount of reci
Externí odkaz:
http://arxiv.org/abs/2310.15593
Autor:
Kondo, Yuki, Ukita, Norimichi, Yamaguchi, Takayuki, Hou, Hao-Yu, Shen, Mu-Yi, Hsu, Chia-Chi, Huang, En-Ming, Huang, Yu-Chen, Xia, Yu-Cheng, Wang, Chien-Yao, Lee, Chun-Yi, Huo, Da, Kastner, Marc A., Liu, Tingwei, Kawanishi, Yasutomo, Hirayama, Takatsugu, Komamizu, Takahiro, Ide, Ichiro, Shinya, Yosuke, Liu, Xinyao, Liang, Guang, Yasui, Syusuke
Publikováno v:
2023 18th International Conference on Machine Vision and Applications (MVA)
Small Object Detection (SOD) is an important machine vision topic because (i) a variety of real-world applications require object detection for distant objects and (ii) SOD is a challenging task due to the noisy, blurred, and less-informative image a
Externí odkaz:
http://arxiv.org/abs/2307.09143
Autor:
Matsuhira, Chihaya, Kastner, Marc A., Komamizu, Takahiro, Hirayama, Takatsugu, Doman, Keisuke, Kawanishi, Yasutomo, Ide, Ichiro
Recently, large-scale Vision and Language (V\&L) pretraining has become the standard backbone of many multimedia systems. While it has shown remarkable performance even in unseen situations, it often performs in ways not intuitive to humans. Particul
Externí odkaz:
http://arxiv.org/abs/2303.03144
Autor:
Nguyen, Trung Thanh, Nguyen, Hoang Dang, Nguyen, Thanh Hung, Pham, Huy Hieu, Ide, Ichiro, Nguyen, Phi Le
Medication mistaking is one of the risks that can result in unpredictable consequences for patients. To mitigate this risk, we develop an automatic system that correctly identifies pill-prescription from mobile images. Specifically, we define a so-ca
Externí odkaz:
http://arxiv.org/abs/2209.01152