Zobrazeno 1 - 10
of 251
pro vyhledávání: '"MA ChaoFan"'
Publikováno v:
Jixie qiangdu, Vol 40, Pp 882-889 (2018)
A lightweight design method based on the idea of secondary optimization was proposed for a certain commercial vehicle front axle.The finite element model of the front axle was established and simulated.Combined with the fatigue endurance test,the tru
Externí odkaz:
https://doaj.org/article/d89b5b92e66346a7baea2258c24d1de2
Referring Image Segmentation~(RIS) leveraging transformers has achieved great success on the interpretation of complex visual-language tasks. However, the quadratic computation cost makes it resource-consuming in capturing long-range visual-language
Externí odkaz:
http://arxiv.org/abs/2403.17839
Autor:
Zhang, Fei, Zhou, Tianfei, Li, Boyang, He, Hao, Ma, Chaofan, Zhang, Tianjiao, Yao, Jiangchao, Zhang, Ya, Wang, Yanfeng
This paper studies the problem of weakly open-vocabulary semantic segmentation (WOVSS), which learns to segment objects of arbitrary classes using mere image-text pairs. Existing works turn to enhance the vanilla vision transformer by introducing exp
Externí odkaz:
http://arxiv.org/abs/2310.19001
Open-vocabulary semantic segmentation is a challenging task that requires segmenting novel object categories at inference time. Recent studies have explored vision-language pre-training to handle this task, but suffer from unrealistic assumptions in
Externí odkaz:
http://arxiv.org/abs/2309.00096
The goal of the audio-visual segmentation (AVS) task is to segment the sounding objects in the video frames using audio cues. However, current fusion-based methods have the performance limitations due to the small receptive field of convolution and i
Externí odkaz:
http://arxiv.org/abs/2307.13236
In semantic segmentation, generalizing a visual system to both seen categories and novel categories at inference time has always been practically valuable yet challenging. To enable such functionality, existing methods mainly rely on either providing
Externí odkaz:
http://arxiv.org/abs/2307.02003
The objective of Audio-Visual Segmentation (AVS) is to localise the sounding objects within visual scenes by accurately predicting pixel-wise segmentation masks. To tackle the task, it involves a comprehensive consideration of both the data and model
Externí odkaz:
http://arxiv.org/abs/2305.11019
Publikováno v:
IEEE Transactions on Medical Imaging, vol. 40, no. 10, pp. 2563-2574, Oct. 2021
Interactive segmentation has recently been explored to effectively and efficiently harvest high-quality segmentation masks by iteratively incorporating user hints. While iterative in nature, most existing interactive segmentation methods tend to igno
Externí odkaz:
http://arxiv.org/abs/2303.10692
Autor:
Ma, Chaofan, Yang, Yuhuan, Ju, Chen, Zhang, Fei, Liu, Jinxiang, Wang, Yu, Zhang, Ya, Wang, Yanfeng
Learning from a large corpus of data, pre-trained models have achieved impressive progress nowadays. As popular generative pre-training, diffusion models capture both low-level visual knowledge and high-level semantic relations. In this paper, we pro
Externí odkaz:
http://arxiv.org/abs/2303.09813
Autor:
Ju, Chen, Wang, Haicheng, Liu, Jinxiang, Ma, Chaofan, Zhang, Ya, Zhao, Peisen, Chang, Jianlong, Tian, Qi
Temporal sentence grounding aims to detect the event timestamps described by the natural language query from given untrimmed videos. The existing fully-supervised setting achieves great performance but requires expensive annotation costs; while the w
Externí odkaz:
http://arxiv.org/abs/2302.09850