Výsledky vyhledávání - "Yang, Mingkun"

Report

Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition

Autor: Yang, Mingkun, Yang, Biao, Liao, Minghui, Zhu, Yingying, Bai, Xiang

Scene text recognition (STR) is a challenging task that requires large-scale annotated data for training. However, collecting and labeling real text images is expensive and time-consuming, which limits the availability of real data. Therefore, most e

Externí odkaz: http://arxiv.org/abs/2402.15806

Zobrazit plný text záznamu

Report

Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition

Autor: Yang, Mingkun, Yang, Biao, Liao, Minghui, Zhu, Yingying, Bai, Xiang

Scene text recognition is a rapidly developing field that faces numerous challenges due to the complexity and diversity of scene text, including complex backgrounds, diverse fonts, flexible arrangements, and accidental occlusions. In this paper, we p

Externí odkaz: http://arxiv.org/abs/2402.13643

Zobrazit plný text záznamu

Report

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Autor: Kuang, Jianfeng, Hua, Wei, Liang, Dingkang, Yang, Mingkun, Jiang, Deqiang, Ren, Bo, Bai, Xiang

Visual information extraction (VIE), which aims to simultaneously perform OCR and information extraction in a unified framework, has drawn increasing attention due to its essential role in various applications like understanding receipts, goods, and

Externí odkaz: http://arxiv.org/abs/2305.07498

Zobrazit plný text záznamu

Akademický článek

Research on Variable Speed Variable Displacement Power Unit with High Efficiency and High Dynamic Optimized Matching.

Autor: Yang, Mingkun¹ (AUTHOR), Liu, Xianhang¹ (AUTHOR), Yan, Guishan² (AUTHOR) yangsh235@mail.sysu.edu.cn, Ai, Chao¹ (AUTHOR), Yu, Cong¹ (AUTHOR)

Publikováno v: Energies (19961073). Jul2024, Vol. 17 Issue 13, p3322. 22p.

Zobrazit plný text záznamu

Plný text ve formátu HTML

Report

Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition

Autor: Yang, Mingkun, Liao, Minghui, Lu, Pu, Wang, Jing, Zhu, Shenggao, Luo, Hualin, Tian, Qi, Bai, Xiang

Existing text recognition methods usually need large-scale training data. Most of them rely on synthetic training data due to the lack of annotated real images. However, there is a domain gap between the synthetic data and real data, which limits the

Externí odkaz: http://arxiv.org/abs/2207.00193

Zobrazit plný text záznamu

Report

Few Could Be Better Than All: Feature Sampling and Grouping for Scene Text Detection

Autor: Tang, Jingqun, Zhang, Wenqing, Liu, Hongye, Yang, MingKun, Jiang, Bo, Hu, Guanglong, Bai, Xiang

Recently, transformer-based methods have achieved promising progresses in object detection, as they can eliminate the post-processes like NMS and enrich the deep representations. However, these methods cannot well cope with scene text due to its extr

Externí odkaz: http://arxiv.org/abs/2203.15221

Zobrazit plný text záznamu

Report

DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry

Autor: Zhu, Ran, Yang, Mingkun, Liu, Wang, Song, Rujun, Yan, Bo, Xiao, Zhuoling

The technology for Visual Odometry (VO) that estimates the position and orientation of the moving object through analyzing the image sequences captured by on-board cameras, has been well investigated with the rising interest in autonomous driving. Th

Externí odkaz: http://arxiv.org/abs/2105.09899

Zobrazit plný text záznamu

Akademický článek

Association between the dietary inflammatory index and all-cause and cardiovascular mortality in patients with atherosclerotic cardiovascular disease

Autor: Yang, Mingkun, Miao, Shenhui, Hu, Weihang, Yan, Jing

Publikováno v: In Nutrition, Metabolism and Cardiovascular Diseases April 2024 34(4):1046-1053

Zobrazit plný text záznamu

Report

Scene Text Retrieval via Joint Text Detection and Similarity Learning

Autor: Wang, Hao, Bai, Xiang, Yang, Mingkun, Zhu, Shenggao, Wang, Jing, Liu, Wenyu

Scene text retrieval aims to localize and search all text instances from an image gallery, which are the same or similar to a given query text. Such a task is usually realized by matching a query text to the recognized words, outputted by an end-to-e

Externí odkaz: http://arxiv.org/abs/2104.01552

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání