Výsledky vyhledávání

Report

DENEB: A Hallucination-Robust Automatic Evaluation Metric for Image Captioning

Autor: Matsuda, Kazuki, Wada, Yuiga, Sugiura, Komei

In this work, we address the challenge of developing automatic evaluation metrics for image captioning, with a particular focus on robustness against hallucinations. Existing metrics are often inadequate for handling hallucinations, primarily due to

Externí odkaz: http://arxiv.org/abs/2409.19255

Zobrazit plný text záznamu

Report

Polos: Multimodal Metric Learning from Human Feedback for Image Captioning

Autor: Wada, Yuiga, Kaneda, Kanta, Saito, Daichi, Sugiura, Komei

Establishing an automatic evaluation metric that closely aligns with human judgments is essential for effectively developing image captioning models. Recent data-driven metrics have demonstrated a stronger correlation with human judgments than classi

Externí odkaz: http://arxiv.org/abs/2402.18091

Zobrazit plný text záznamu

Report

DialMAT: Dialogue-Enabled Transformer with Moment-Based Adversarial Training

Autor: Kaneda, Kanta, Korekata, Ryosuke, Wada, Yuiga, Nagashima, Shunya, Kambara, Motonari, Iioka, Yui, Matsuo, Haruka, Imai, Yuto, Nishimura, Takayuki, Sugiura, Komei

This paper focuses on the DialFRED task, which is the task of embodied instruction following in a setting where an agent can actively ask questions about the task. To address this task, we propose DialMAT. DialMAT introduces Moment-based Adversarial

Externí odkaz: http://arxiv.org/abs/2311.06855

Zobrazit plný text záznamu

Report

JaSPICE: Automatic Evaluation Metric Using Predicate-Argument Structures for Image Captioning Models

Autor: Wada, Yuiga, Kaneda, Kanta, Sugiura, Komei

Image captioning studies heavily rely on automatic evaluation metrics such as BLEU and METEOR. However, such n-gram-based metrics have been shown to correlate poorly with human evaluation, leading to the proposal of alternative metrics such as SPICE

Externí odkaz: http://arxiv.org/abs/2311.04192

Zobrazit plný text záznamu

Report

Multimodal Diffusion Segmentation Model for Object Segmentation from Manipulation Instructions

Autor: Iioka, Yui, Yoshida, Yu, Wada, Yuiga, Hatanaka, Shumpei, Sugiura, Komei

In this study, we aim to develop a model that comprehends a natural language instruction (e.g., "Go to the living room and get the nearest pillow to the radio art on the wall") and generates a segmentation mask for the target everyday object. The tas

Externí odkaz: http://arxiv.org/abs/2307.08597

Zobrazit plný text záznamu

Periodical

Determining ball trajectory on a work plate held by a dual-arm SCARA robot using IMU sensor data

Autor: Tsukamoto, Hideaki, Mita, Yuma, Hanai, Hiroaki, Wada, Yuiga, Nakagawa, Masao, Hirogaki, Toshiki, Aoyama, Eiichi

Publikováno v: Proceedings of SPIE; October 2024, Vol. 13290 Issue: 1 p132900D-132900D-7, 1196108p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání