Zobrazeno 1 - 10
of 81
pro vyhledávání: '"Zhang, Yisi"'
Autor:
Ding, Henghui, Hong, Lingyi, Liu, Chang, Xu, Ning, Yang, Linjie, Fan, Yuchen, Miao, Deshui, Gu, Yameng, Li, Xin, He, Zhenyu, Wang, Yaowei, Yang, Ming-Hsuan, Chai, Jinming, Ma, Qin, Zhang, Junpei, Jiao, Licheng, Liu, Fang, Liu, Xinyu, Zhang, Jing, Zhang, Kexin, Liu, Xu, Li, LingLing, Fang, Hao, Pan, Feiyu, Lu, Xiankai, Zhang, Wei, Cong, Runmin, Tran, Tuyen, Cao, Bin, Zhang, Yisi, Wang, Hanyi, He, Xingjian, Liu, Jing
Despite the promising performance of current video segmentation models on existing benchmarks, these models still struggle with complex scenes. In this paper, we introduce the 6th Large-scale Video Object Segmentation (LSVOS) challenge in conjunction
Externí odkaz:
http://arxiv.org/abs/2409.05847
Referring Video Object Segmentation is an emerging multi-modal task that aims to segment objects in the video given a natural language expression. In this work, we build two instance-centric models and fuse predicted results from frame-level and inst
Externí odkaz:
http://arxiv.org/abs/2408.10541
Autor:
Ding, Henghui, Liu, Chang, Wei, Yunchao, Ravi, Nikhila, He, Shuting, Bai, Song, Torr, Philip, Miao, Deshui, Li, Xin, He, Zhenyu, Wang, Yaowei, Yang, Ming-Hsuan, Xu, Zhensong, Yao, Jiangtao, Wu, Chengjing, Liu, Ting, Liu, Luoqi, Liu, Xinyu, Zhang, Jing, Zhang, Kexin, Yang, Yuting, Jiao, Licheng, Yang, Shuyuan, Gao, Mingqi, Luo, Jingnan, Yang, Jinyu, Han, Jungong, Zheng, Feng, Cao, Bin, Zhang, Yisi, Lin, Xuanxu, He, Xingjian, Zhao, Bo, Liu, Jing, Pan, Feiyu, Fang, Hao, Lu, Xiankai
Pixel-level Video Understanding in the Wild Challenge (PVUW) focus on complex video understanding. In this CVPR 2024 workshop, we add two new tracks, Complex Video Object Segmentation Track based on MOSE dataset and Motion Expression guided Video Seg
Externí odkaz:
http://arxiv.org/abs/2406.17005
Motion Expression guided Video Segmentation is a challenging task that aims at segmenting objects in the video based on natural language expressions with motion descriptions. Unlike the previous referring video object segmentation (RVOS), this task f
Externí odkaz:
http://arxiv.org/abs/2406.13939
Visual grounding (VG) aims at locating the foreground entities that match the given natural language expressions. Previous datasets and methods for classic VG task mainly rely on the prior assumption that the given expression must literally refer to
Externí odkaz:
http://arxiv.org/abs/2402.11265
The Few-Shot Segmentation (FSS) aims to accomplish the novel class segmentation task with a few annotated images. Current FSS research based on meta-learning focus on designing a complex interaction mechanism between the query and support feature. Ho
Externí odkaz:
http://arxiv.org/abs/2312.15731
Autor:
Wang, Wenxuan, Yue, Tongtian, Zhang, Yisi, Guo, Longteng, He, Xingjian, Wang, Xinlong, Liu, Jing
Referring expression segmentation (RES) aims at segmenting the foreground masks of the entities that match the descriptive natural language expression. Previous datasets and methods for classic RES task heavily rely on the prior assumption that one e
Externí odkaz:
http://arxiv.org/abs/2312.08007
Autor:
Wang, Wenxuan, Liu, Jing, He, Xingjian, Zhang, Yisi, Chen, Chen, Shen, Jiachen, Zhang, Yan, Li, Jiangyun
Referring image segmentation (RIS) is a fundamental vision-language task that intends to segment a desired object from an image based on a given natural language expression. Due to the essentially distinct data properties between image and text, most
Externí odkaz:
http://arxiv.org/abs/2305.11481
Autor:
Shi, Xuefei, Zhang, Yisi, Liu, Kecheng, Wen, Zhaokun, Wang, Wenxuan, Zhang, Tianxiang, Li, Jiangyun
Publikováno v:
In Signal Processing January 2025 226
Autor:
El Hady, Ahmed, Takahashi, Daniel, Sun, Ruolan, Akinwale, Oluwateniola, Boyd-Meredith, Tyler, Zhang, Yisi, Charles, Adam S., Brody, Carlos D.
Publikováno v:
In Journal of Neuroscience Methods March 2024 403