Výsledky vyhledávání

Report

HumanVLM: Foundation for Human-Scene Vision-Language Model

Autor: Dai, Dawei, Long, Xu, Yutang, Li, Yuanhui, Zhang, Xia, Shuyin

Human-scene vision-language tasks are increasingly prevalent in diverse social applications, yet recent advancements predominantly rely on models specifically tailored to individual tasks. Emerging research indicates that large vision-language models

Externí odkaz: http://arxiv.org/abs/2411.03034

Zobrazit plný text záznamu

Report

Granular-ball Representation Learning for Deep CNN on Learning with Label Noise

Autor: Dai, Dawei, Zhu, Hao, Xia, Shuyin, Wang, Guoyin

In actual scenarios, whether manually or automatically annotated, label noise is inevitably generated in the training data, which can affect the effectiveness of deep CNN models. The popular solutions require data cleaning or designing additional opt

Externí odkaz: http://arxiv.org/abs/2409.03254

Zobrazit plný text záznamu

Report

PA-LLaVA: A Large Language-Vision Assistant for Human Pathology Image Understanding

Autor: Dai, Dawei, Zhang, Yuanhui, Xu, Long, Yang, Qianlan, Shen, Xiaojing, Xia, Shuyin, Wang, Guoyin

The previous advancements in pathology image understanding primarily involved developing models tailored to specific tasks. Recent studies has demonstrated that the large vision-language model can enhance the performance of various downstream tasks i

Externí odkaz: http://arxiv.org/abs/2408.09530

Zobrazit plný text záznamu

Report

15M Multimodal Facial Image-Text Dataset

Autor: Dai, Dawei, Li, YuTang, Liu, YingGe, Jia, Mingming, YuanHui, Zhang, Wang, Guoyin

Currently, image-text-driven multi-modal deep learning models have demonstrated their outstanding potential in many fields. In practice, tasks centered around facial images have broad application prospects. This paper presents \textbf{FaceCaption-15M

Externí odkaz: http://arxiv.org/abs/2407.08515

Zobrazit plný text záznamu

Report

Multi-Granularity Representation Learning for Sketch-based Dynamic Face Image Retrieval

Autor: Wang, Liang, Dai, Dawei, Fu, Shiyu, Wang, Guoyin

In specific scenarios, face sketch can be used to identify a person. However, drawing a face sketch often requires exceptional skill and is time-consuming, limiting its widespread applications in actual scenarios. The new framework of sketch less fac

Externí odkaz: http://arxiv.org/abs/2401.00371

Zobrazit plný text záznamu

Report

Sketch Less Face Image Retrieval: A New Challenge

Autor: Dai, Dawei, Li, Yutang, Wang, Liang, Fu, Shiyu, Xia, Shuyin, Wang, Guoyin

In some specific scenarios, face sketch was used to identify a person. However, drawing a complete face sketch often needs skills and takes time, which hinder its widespread applicability in the practice. In this study, we proposed a new task named s

Externí odkaz: http://arxiv.org/abs/2302.05576

Zobrazit plný text záznamu

Report

A Study of Deep CNN Model with Labeling Noise Based on Granular-ball Computing

Autor: Dai, Dawei, Li, Donggen, Zhuang, Zhiguo

In supervised learning, the presence of noise can have a significant impact on decision making. Since many classifiers do not take label noise into account in the derivation of the loss function, including the loss functions of logistic regression, S

Externí odkaz: http://arxiv.org/abs/2207.08810

Zobrazit plný text záznamu

Akademický článek

Vision-language joint representation learning for sketch less facial image retrieval

Autor: Dai, Dawei, Fu, Shiyu, Liu, Yingge, Wang, Guoyin

Publikováno v: In Information Fusion December 2024 112

Zobrazit plný text záznamu

Akademický článek

Prior semantic-embedding representation learning for on-the-fly FG-SBIR

Autor: Liu, Yingge, Dai, Dawei, Zou, Kenan, Tan, Xiufang, Wu, Yiqiao, Wang, Guoyin

Publikováno v: In Expert Systems With Applications 1 December 2024 255 Part A

Zobrazit plný text záznamu

Report

One-Stage Deep Edge Detection Based on Dense-Scale Feature Fusion and Pixel-Level Imbalance Learning

Autor: Dai, Dawei, Wang, Chunjie, Xia, Shuyin, Liu, Yingge, Wang, Guoyin

Edge detection, a basic task in the field of computer vision, is an important preprocessing operation for the recognition and understanding of a visual scene. In conventional models, the edge image generated is ambiguous, and the edge lines are also

Externí odkaz: http://arxiv.org/abs/2203.09387

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání