Zobrazeno 1 - 10
of 102
pro vyhledávání: '"Kawanishi, Yasutomo"'
Multi-label multi-view action recognition aims to recognize multiple concurrent or sequential actions from untrimmed videos captured by multiple cameras. Existing work has focused on multi-view action recognition in a narrow area with strong labels a
Externí odkaz:
http://arxiv.org/abs/2410.03302
Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association
This paper focuses on tracking birds that appear small in a panoramic video. When the size of the tracked object is small in the image (small object tracking) and move quickly, object detection and association suffers. To address these problems, we p
Externí odkaz:
http://arxiv.org/abs/2405.17323
Open-vocabulary Temporal Action Detection (Open-vocab TAD) is an advanced video analysis approach that expands Closed-vocabulary Temporal Action Detection (Closed-vocab TAD) capabilities. Closed-vocab TAD is typically confined to localizing and class
Externí odkaz:
http://arxiv.org/abs/2404.19542
Autor:
Ueda, Nobuhiro, Habe, Hideko, Matsui, Yoko, Yuguchi, Akishige, Kawano, Seiya, Kawanishi, Yasutomo, Kurohashi, Sadao, Yoshino, Koichiro
Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users. In real-world reference resolution, a system must ground the ver
Externí odkaz:
http://arxiv.org/abs/2403.19259
Situated conversations, which refer to visual information as visual question answering (VQA), often contain ambiguities caused by reliance on directive information. This problem is exacerbated because some languages, such as Japanese, often omit subj
Externí odkaz:
http://arxiv.org/abs/2403.17545
Autor:
John, Vijay, Kawanishi, Yasutomo
For training a video-based action recognition model that accepts multi-view video, annotating frame-level labels is tedious and difficult. However, it is relatively easy to annotate sequence-level labels. This kind of coarse annotations are called as
Externí odkaz:
http://arxiv.org/abs/2403.11616
Novel view synthesis has recently made significant progress with the advent of Neural Radiance Fields (NeRF). DietNeRF is an extension of NeRF that aims to achieve this task from only a few images by introducing a new loss function for unknown viewpo
Externí odkaz:
http://arxiv.org/abs/2310.13670
Autor:
Shimajiri, Yoshito, Kawanishi, Yasutomo, Fujita, Shinji, Miyamoto, Yusuke, Ito, Atsushi M., Arzoumanian, Doris, André, Philippe, Nishimura, Atsushi, Tokuda, Kazuki, Kaneko, Hiroyuki, Takekawa, Shunya, Ueda, Shota, Onishi, Toshikazu, Inoue, Tsuyoshi, Nishimoto, Shimpei, Yoneda, Ryuki
The total mass estimate of molecular clouds suffers from the uncertainty in the H$_2$-CO conversion factor, the so-called $X_{\rm CO}$ factor, which is used to convert the $^{12}$CO (1--0) integrated intensity to the H$_2$ column density. We demonstr
Externí odkaz:
http://arxiv.org/abs/2309.07348
Autor:
Kondo, Yuki, Ukita, Norimichi, Yamaguchi, Takayuki, Hou, Hao-Yu, Shen, Mu-Yi, Hsu, Chia-Chi, Huang, En-Ming, Huang, Yu-Chen, Xia, Yu-Cheng, Wang, Chien-Yao, Lee, Chun-Yi, Huo, Da, Kastner, Marc A., Liu, Tingwei, Kawanishi, Yasutomo, Hirayama, Takatsugu, Komamizu, Takahiro, Ide, Ichiro, Shinya, Yosuke, Liu, Xinyao, Liang, Guang, Yasui, Syusuke
Publikováno v:
2023 18th International Conference on Machine Vision and Applications (MVA)
Small Object Detection (SOD) is an important machine vision topic because (i) a variety of real-world applications require object detection for distant objects and (ii) SOD is a challenging task due to the noisy, blurred, and less-informative image a
Externí odkaz:
http://arxiv.org/abs/2307.09143
In this paper, we realize automatic visual recognition and direction estimation of pointing. We introduce the first neural pointing understanding method based on two key contributions. The first is the introduction of a first-of-its-kind large-scale
Externí odkaz:
http://arxiv.org/abs/2304.06977