Zobrazeno 1 - 10
of 1 456
pro vyhledávání: '"Tanaka, Ryota"'
Understanding human actions from videos is essential in many domains, including sports. In figure skating, technical judgments are performed by watching skaters' 3D movements, and its part of the judging procedure can be regarded as a Temporal Action
Externí odkaz:
http://arxiv.org/abs/2408.16638
We explore visual prompt injection (VPI) that maliciously exploits the ability of large vision-language models (LVLMs) to follow instructions drawn onto the input image. We propose a new VPI method, "goal hijacking via visual prompt injection" (GHVPI
Externí odkaz:
http://arxiv.org/abs/2408.03554
Autor:
Sakaki, Makoto, Tanaka, Ryota
We discuss translation minimal surfaces, homothetical minimal surfaces, and separable minimal surfaces in the $3$-space with $2m$-norm.
Comment: 15 pages
Comment: 15 pages
Externí odkaz:
http://arxiv.org/abs/2407.08896
We study the problem of completing various visual document understanding (VDU) tasks, e.g., question answering and information extraction, on real-world documents through human-written instructions. To this end, we propose InstructDoc, the first larg
Externí odkaz:
http://arxiv.org/abs/2401.13313
Automatic evaluating systems are fundamental issues in sports technologies. In many sports, such as figure skating, automated evaluating methods based on pose estimation have been proposed. However, previous studies have evaluated skaters' skills in
Externí odkaz:
http://arxiv.org/abs/2310.17193
Autor:
Tanaka, Ryota, Nishida, Kyosuke, Nishida, Kosuke, Hasegawa, Taku, Saito, Itsumi, Saito, Kuniko
Visual question answering on document images that contain textual, visual, and layout information, called document VQA, has received much attention recently. Although many datasets have been proposed for developing document VQA systems, most of the e
Externí odkaz:
http://arxiv.org/abs/2301.04883
Autor:
Tanaka, Ryota1 (AUTHOR), Tamao, Kenji1 (AUTHOR), Ono, Mana1 (AUTHOR), Yamayoshi, Seiya2,3 (AUTHOR), Kawaoka, Yoshihiro2,3,4,5 (AUTHOR), Su'etsugu, Masayuki6 (AUTHOR), Noji, Hiroyuki1 (AUTHOR), Tabata, Kazuhito V.1 (AUTHOR) tabatak@g.ecc.u-tokyo.ac.jp
Publikováno v:
PLoS ONE. 11/8/2024, Vol. 19 Issue 11, p1-13. 13p.
Autor:
Okuhira, Ryuta, Higashino, Nobuyuki, Sonomura, Tetsuo, Fukuda, Kodai, Koike, Masataka, Kamisako, Atsufumi, Tanaka, Ryota, Koyama, Takao, Sato, Hirotatsu, Ikoma, Akira, Minamiguchi, Hiroki
Publikováno v:
In Journal of Vascular and Interventional Radiology March 2024 35(3):462-468
Recent studies on machine reading comprehension have focused on text-level understanding but have not yet reached the level of human understanding of the visual layout and content of real-world documents. In this study, we introduce a new visual mach
Externí odkaz:
http://arxiv.org/abs/2101.11272
Autor:
Yoshijima, Chisato, Suzuki, Yosuke, Tanaka, Ryota, Ono, Hiroyuki, Oda, Ayako, Ozaki, Takashi, Shibata, Hirotaka, Itoh, Hiroki, Ohno, Keiko
Publikováno v:
In Clinical Biochemistry February 2024 124