Výsledky vyhledávání - "Furuta, Ryosuke"

Report

ActionVOS: Actions as Prompts for Video Object Segmentation

Autor: Ouyang, Liangyang, Liu, Ruicong, Huang, Yifei, Furuta, Ryosuke, Sato, Yoichi

Delving into the realm of egocentric vision, the advancement of referring video object segmentation (RVOS) stands as pivotal in understanding human activities. However, existing RVOS task primarily relies on static attributes such as object names to

Externí odkaz: http://arxiv.org/abs/2407.07402

Zobrazit plný text záznamu

Report

Learning Object States from Actions via Large Language Models

Autor: Tateno, Masatoshi, Yagi, Takuma, Furuta, Ryosuke, Sato, Yoichi

Temporally localizing the presence of object states in videos is crucial in understanding human activities beyond actions and objects. This task has suffered from a lack of training data due to object states' inherent ambiguity and variety. To avoid

Externí odkaz: http://arxiv.org/abs/2405.01090

Zobrazit plný text záznamu

Report

On a homology of foliations defined by non-singular Morse-Smale flows

Autor: Akizawa, Masato, Furuta, Ryosuke, Miyoshi, Shigeaki

We propose a definition of a homology of a one-dimensional foliation defined by a non-singular Morse-Smale flow. We also show the calculation of the homology of such a foliation which is naturally associated with Seifert fibration.

Externí odkaz: http://arxiv.org/abs/2402.01387

Zobrazit plný text záznamu

Report

FineBio: A Fine-Grained Video Dataset of Biological Experiments with Hierarchical Annotation

Autor: Yagi, Takuma, Ohashi, Misaki, Huang, Yifei, Furuta, Ryosuke, Adachi, Shungo, Mitsuyama, Toutai, Sato, Yoichi

In the development of science, accurate and reproducible documentation of the experimental process is crucial. Automatic recognition of the actions in experiments from videos would help experimenters by complementing the recording of experiments. Tow

Externí odkaz: http://arxiv.org/abs/2402.00293

Zobrazit plný text záznamu

Report

Ego-Exo4D: Understanding Skilled Human Activity from First- and Third-Person Perspectives

Autor: Grauman, Kristen, Westbury, Andrew, Torresani, Lorenzo, Kitani, Kris, Malik, Jitendra, Afouras, Triantafyllos, Ashutosh, Kumar, Baiyya, Vijay, Bansal, Siddhant, Boote, Bikram, Byrne, Eugene, Chavis, Zach, Chen, Joya, Cheng, Feng, Chu, Fu-Jen, Crane, Sean, Dasgupta, Avijit, Dong, Jing, Escobar, Maria, Forigua, Cristhian, Gebreselasie, Abrham, Haresh, Sanjay, Huang, Jing, Islam, Md Mohaiminul, Jain, Suyog, Khirodkar, Rawal, Kukreja, Devansh, Liang, Kevin J, Liu, Jia-Wei, Majumder, Sagnik, Mao, Yongsen, Martin, Miguel, Mavroudi, Effrosyni, Nagarajan, Tushar, Ragusa, Francesco, Ramakrishnan, Santhosh Kumar, Seminara, Luigi, Somayazulu, Arjun, Song, Yale, Su, Shan, Xue, Zihui, Zhang, Edward, Zhang, Jinxu, Castillo, Angela, Chen, Changan, Fu, Xinzhu, Furuta, Ryosuke, Gonzalez, Cristina, Gupta, Prince, Hu, Jiabo, Huang, Yifei, Huang, Yiming, Khoo, Weslie, Kumar, Anush, Kuo, Robert, Lakhavani, Sach, Liu, Miao, Luo, Mi, Luo, Zhengyi, Meredith, Brighid, Miller, Austin, Oguntola, Oluwatumininu, Pan, Xiaqing, Peng, Penny, Pramanick, Shraman, Ramazanova, Merey, Ryan, Fiona, Shan, Wei, Somasundaram, Kiran, Song, Chenan, Southerland, Audrey, Tateno, Masatoshi, Wang, Huiyu, Wang, Yuchen, Yagi, Takuma, Yan, Mingfei, Yang, Xitong, Yu, Zecheng, Zha, Shengxin Cindy, Zhao, Chen, Zhao, Ziwei, Zhu, Zhifan, Zhuo, Jeff, Arbelaez, Pablo, Bertasius, Gedas, Crandall, David, Damen, Dima, Engel, Jakob, Farinella, Giovanni Maria, Furnari, Antonino, Ghanem, Bernard, Hoffman, Judy, Jawahar, C. V., Newcombe, Richard, Park, Hyun Soo, Rehg, James M., Sato, Yoichi, Savva, Manolis, Shi, Jianbo, Shou, Mike Zheng, Wray, Michael

We present Ego-Exo4D, a diverse, large-scale multimodal multiview video dataset and benchmark challenge. Ego-Exo4D centers around simultaneously-captured egocentric and exocentric video of skilled human activities (e.g., sports, music, dance, bike re

Externí odkaz: http://arxiv.org/abs/2311.18259

Zobrazit plný text záznamu

Report

Seeking Flat Minima with Mean Teacher on Semi- and Weakly-Supervised Domain Generalization for Object Detection

Autor: Furuta, Ryosuke, Sato, Yoichi

Object detectors do not work well when domains largely differ between training and testing data. To overcome this domain gap in object detection without requiring expensive annotations, we consider two problem settings: semi-supervised domain general

Externí odkaz: http://arxiv.org/abs/2310.19351

Zobrazit plný text záznamu

Report

Proposal-based Temporal Action Localization with Point-level Supervision

Autor: Yin, Yuan, Huang, Yifei, Furuta, Ryosuke, Sato, Yoichi

Point-level supervised temporal action localization (PTAL) aims at recognizing and localizing actions in untrimmed videos where only a single point (frame) within every action instance is annotated in training data. Without temporal annotations, most

Externí odkaz: http://arxiv.org/abs/2310.05511

Zobrazit plný text záznamu

Report

Fine-grained Affordance Annotation for Egocentric Hand-Object Interaction Videos

Autor: Yu, Zecheng, Huang, Yifei, Furuta, Ryosuke, Yagi, Takuma, Goutsu, Yusuke, Sato, Yoichi

Object affordance is an important concept in hand-object interaction, providing information on action possibilities based on human motor capacity and objects' physical property thus benefiting tasks such as action anticipation and robot imitation lea

Externí odkaz: http://arxiv.org/abs/2302.03292

Zobrazit plný text záznamu

Report

Precise Affordance Annotation for Egocentric Action Video Datasets

Autor: Yu, Zecheng, Huang, Yifei, Furuta, Ryosuke, Yagi, Takuma, Goutsu, Yusuke, Sato, Yoichi

Object affordance is an important concept in human-object interaction, providing information on action possibilities based on human motor capacity and objects' physical property thus benefiting tasks such as action anticipation and robot imitation le

Externí odkaz: http://arxiv.org/abs/2206.05424

Zobrazit plný text záznamu

Report

Efficient Annotation and Learning for 3D Hand Pose Estimation: A Survey

Autor: Ohkawa, Takehiko, Furuta, Ryosuke, Sato, Yoichi

In this survey, we present a systematic review of 3D hand pose estimation from the perspective of efficient annotation and learning. 3D hand pose estimation has been an important research area owing to its potential to enable various applications, su

Externí odkaz: http://arxiv.org/abs/2206.02257

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání