Zobrazeno 1 - 10
of 173
pro vyhledávání: '"Dai Dawei"'
Human-scene vision-language tasks are increasingly prevalent in diverse social applications, yet recent advancements predominantly rely on models specifically tailored to individual tasks. Emerging research indicates that large vision-language models
Externí odkaz:
http://arxiv.org/abs/2411.03034
In actual scenarios, whether manually or automatically annotated, label noise is inevitably generated in the training data, which can affect the effectiveness of deep CNN models. The popular solutions require data cleaning or designing additional opt
Externí odkaz:
http://arxiv.org/abs/2409.03254
Autor:
Dai, Dawei, Zhang, Yuanhui, Xu, Long, Yang, Qianlan, Shen, Xiaojing, Xia, Shuyin, Wang, Guoyin
The previous advancements in pathology image understanding primarily involved developing models tailored to specific tasks. Recent studies has demonstrated that the large vision-language model can enhance the performance of various downstream tasks i
Externí odkaz:
http://arxiv.org/abs/2408.09530
Currently, image-text-driven multi-modal deep learning models have demonstrated their outstanding potential in many fields. In practice, tasks centered around facial images have broad application prospects. This paper presents \textbf{FaceCaption-15M
Externí odkaz:
http://arxiv.org/abs/2407.08515
In specific scenarios, face sketch can be used to identify a person. However, drawing a face sketch often requires exceptional skill and is time-consuming, limiting its widespread applications in actual scenarios. The new framework of sketch less fac
Externí odkaz:
http://arxiv.org/abs/2401.00371
In some specific scenarios, face sketch was used to identify a person. However, drawing a complete face sketch often needs skills and takes time, which hinder its widespread applicability in the practice. In this study, we proposed a new task named s
Externí odkaz:
http://arxiv.org/abs/2302.05576
In supervised learning, the presence of noise can have a significant impact on decision making. Since many classifiers do not take label noise into account in the derivation of the loss function, including the loss functions of logistic regression, S
Externí odkaz:
http://arxiv.org/abs/2207.08810
Publikováno v:
In Information Fusion December 2024 112
Publikováno v:
In Expert Systems With Applications 1 December 2024 255 Part A
One-Stage Deep Edge Detection Based on Dense-Scale Feature Fusion and Pixel-Level Imbalance Learning
Edge detection, a basic task in the field of computer vision, is an important preprocessing operation for the recognition and understanding of a visual scene. In conventional models, the edge image generated is ambiguous, and the edge lines are also
Externí odkaz:
http://arxiv.org/abs/2203.09387