Výsledky vyhledávání - "Kawanishi, Yasutomo"

Report

Action Selection Learning for Multi-label Multi-view Action Recognition

Autor: Nguyen, Trung Thanh, Kawanishi, Yasutomo, Komamizu, Takahiro, Ide, Ichiro

Multi-label multi-view action recognition aims to recognize multiple concurrent or sequential actions from untrimmed videos captured by multiple cameras. Existing work has focused on multi-view action recognition in a narrow area with strong labels a

Externí odkaz: http://arxiv.org/abs/2410.03302

Zobrazit plný text záznamu

Report

Tracking Small Birds by Detection Candidate Region Filtering and Detection History-aware Association

Autor: Liu, Tingwei, Kawanishi, Yasutomo, Komamizu, Takahiro, Ide, Ichiro

This paper focuses on tracking birds that appear small in a panoramic video. When the size of the tracked object is small in the image (small object tracking) and move quickly, object detection and association suffers. To address these problems, we p

Externí odkaz: http://arxiv.org/abs/2405.17323

Zobrazit plný text záznamu

Report

One-Stage Open-Vocabulary Temporal Action Detection Leveraging Temporal Multi-scale and Action Label Features

Autor: Nguyen, Trung Thanh, Kawanishi, Yasutomo, Komamizu, Takahiro, Ide, Ichiro

Open-vocabulary Temporal Action Detection (Open-vocab TAD) is an advanced video analysis approach that expands Closed-vocabulary Temporal Action Detection (Closed-vocab TAD) capabilities. Closed-vocab TAD is typically confined to localizing and class

Externí odkaz: http://arxiv.org/abs/2404.19542

Zobrazit plný text záznamu

Report

J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution

Autor: Ueda, Nobuhiro, Habe, Hideko, Matsui, Yoko, Yuguchi, Akishige, Kawano, Seiya, Kawanishi, Yasutomo, Kurohashi, Sadao, Yoshino, Koichiro

Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users. In real-world reference resolution, a system must ground the ver

Externí odkaz: http://arxiv.org/abs/2403.19259

Zobrazit plný text záznamu

Report

A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions

Autor: Inadumi, Shun, Kawano, Seiya, Yuguchi, Akishige, Kawanishi, Yasutomo, Yoshino, Koichiro

Situated conversations, which refer to visual information as visual question answering (VQA), often contain ambiguities caused by reliance on directive information. This problem is exacerbated because some languages, such as Japanese, often omit subj

Externí odkaz: http://arxiv.org/abs/2403.17545

Zobrazit plný text záznamu

Report

Multi-View Video-Based Learning: Leveraging Weak Labels for Frame-Level Perception

Autor: John, Vijay, Kawanishi, Yasutomo

For training a video-based action recognition model that accepts multi-view video, annotating frame-level labels is tedious and difficult. However, it is relatively easy to annotate sequence-level labels. This kind of coarse annotations are called as

Externí odkaz: http://arxiv.org/abs/2403.11616

Zobrazit plný text záznamu

Report

ManifoldNeRF: View-dependent Image Feature Supervision for Few-shot Neural Radiance Fields

Autor: Kanaoka, Daiju, Sonogashira, Motoharu, Tamukoh, Hakaru, Kawanishi, Yasutomo

Novel view synthesis has recently made significant progress with the advent of Neural Radiance Fields (NeRF). DietNeRF is an extension of NeRF that aims to achieve this task from only a few images by introducing a new loss function for unknown viewpo

Externí odkaz: http://arxiv.org/abs/2310.13670

Zobrazit plný text záznamu

Report

Predicting reliable H$_2$ column density maps from molecular line data using machine learning

Autor: Shimajiri, Yoshito, Kawanishi, Yasutomo, Fujita, Shinji, Miyamoto, Yusuke, Ito, Atsushi M., Arzoumanian, Doris, André, Philippe, Nishimura, Atsushi, Tokuda, Kazuki, Kaneko, Hiroyuki, Takekawa, Shunya, Ueda, Shota, Onishi, Toshikazu, Inoue, Tsuyoshi, Nishimoto, Shimpei, Yoneda, Ryuki

The total mass estimate of molecular clouds suffers from the uncertainty in the H$_2$-CO conversion factor, the so-called $X_{\rm CO}$ factor, which is used to convert the $^{12}$CO (1--0) integrated intensity to the H$_2$ column density. We demonstr

Externí odkaz: http://arxiv.org/abs/2309.07348

Zobrazit plný text záznamu

Report

MVA2023 Small Object Detection Challenge for Spotting Birds: Dataset, Methods, and Results

Publikováno v: 2023 18th International Conference on Machine Vision and Applications (MVA)

Small Object Detection (SOD) is an important machine vision topic because (i) a variety of real-world applications require object detection for distant objects and (ii) SOD is a challenging task due to the noisy, blurred, and less-informative image a

Externí odkaz: http://arxiv.org/abs/2307.09143

Zobrazit plný text záznamu

Report

DeePoint: Visual Pointing Recognition and Direction Estimation

Autor: Nakamura, Shu, Kawanishi, Yasutomo, Nobuhara, Shohei, Nishino, Ko

In this paper, we realize automatic visual recognition and direction estimation of pointing. We introduce the first neural pointing understanding method based on two key contributions. The first is the introduction of a first-of-its-kind large-scale

Externí odkaz: http://arxiv.org/abs/2304.06977

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání