Zobrazeno 1 - 10
of 218
pro vyhledávání: '"Yang, Mingkun"'
Scene text recognition (STR) is a challenging task that requires large-scale annotated data for training. However, collecting and labeling real text images is expensive and time-consuming, which limits the availability of real data. Therefore, most e
Externí odkaz:
http://arxiv.org/abs/2402.15806
Scene text recognition is a rapidly developing field that faces numerous challenges due to the complexity and diversity of scene text, including complex backgrounds, diverse fonts, flexible arrangements, and accidental occlusions. In this paper, we p
Externí odkaz:
http://arxiv.org/abs/2402.13643
Autor:
Kuang, Jianfeng, Hua, Wei, Liang, Dingkang, Yang, Mingkun, Jiang, Deqiang, Ren, Bo, Bai, Xiang
Visual information extraction (VIE), which aims to simultaneously perform OCR and information extraction in a unified framework, has drawn increasing attention due to its essential role in various applications like understanding receipts, goods, and
Externí odkaz:
http://arxiv.org/abs/2305.07498
Autor:
Yang, Mingkun1 (AUTHOR), Liu, Xianhang1 (AUTHOR), Yan, Guishan2 (AUTHOR) yangsh235@mail.sysu.edu.cn, Ai, Chao1 (AUTHOR), Yu, Cong1 (AUTHOR)
Publikováno v:
Energies (19961073). Jul2024, Vol. 17 Issue 13, p3322. 22p.
Autor:
Yang, Mingkun, Liao, Minghui, Lu, Pu, Wang, Jing, Zhu, Shenggao, Luo, Hualin, Tian, Qi, Bai, Xiang
Existing text recognition methods usually need large-scale training data. Most of them rely on synthetic training data due to the lack of annotated real images. However, there is a domain gap between the synthetic data and real data, which limits the
Externí odkaz:
http://arxiv.org/abs/2207.00193
Autor:
Tang, Jingqun, Zhang, Wenqing, Liu, Hongye, Yang, MingKun, Jiang, Bo, Hu, Guanglong, Bai, Xiang
Recently, transformer-based methods have achieved promising progresses in object detection, as they can eliminate the post-processes like NMS and enrich the deep representations. However, these methods cannot well cope with scene text due to its extr
Externí odkaz:
http://arxiv.org/abs/2203.15221
The technology for Visual Odometry (VO) that estimates the position and orientation of the moving object through analyzing the image sequences captured by on-board cameras, has been well investigated with the rising interest in autonomous driving. Th
Externí odkaz:
http://arxiv.org/abs/2105.09899
Publikováno v:
In Nutrition, Metabolism and Cardiovascular Diseases April 2024 34(4):1046-1053
Scene text retrieval aims to localize and search all text instances from an image gallery, which are the same or similar to a given query text. Such a task is usually realized by matching a query text to the recognized words, outputted by an end-to-e
Externí odkaz:
http://arxiv.org/abs/2104.01552
Autor:
Cao, Chenrui, Yang, Mingkun, Liang, Chen, Zhang, Donglin, Chen, Xin, Zhao, Xiuchen, Lee, Chin C., Huo, Yongjun
Publikováno v:
In Electrochimica Acta 10 December 2023 471