Výsledky vyhledávání - "Diao, Xiaolei"

Report

Toward Zero-shot Character Recognition: A Gold Standard Dataset with Radical-level Annotations

Autor: Diao, Xiaolei, Shi, Daqian, Li, Jian, Shi, Lida, Yue, Mingzhe, Qi, Ruihua, Li, Chuntao, Xu, Hao

Optical character recognition (OCR) methods have been applied to diverse tasks, e.g., street view text recognition and document analysis. Recently, zero-shot OCR has piqued the interest of the research community because it considers a practical OCR s

Externí odkaz: http://arxiv.org/abs/2308.00655

Zobrazit plný text záznamu

Report

A semantics-driven methodology for high-quality image annotation

Autor: Giunchiglia, Fausto, Bagchi, Mayukh, Diao, Xiaolei

Recent work in Machine Learning and Computer Vision has highlighted the presence of various types of systematic flaws inside ground truth object recognition benchmark datasets. Our basic tenet is that these flaws are rooted in the many-to-many mappin

Externí odkaz: http://arxiv.org/abs/2307.14119

Zobrazit plný text záznamu

Report

Incremental Image Labeling via Iterative Refinement

Autor: Giunchiglia, Fausto, Diao, Xiaolei, Bagchi, Mayukh

Publikováno v: IWCIM@ICASSP 2023

Data quality is critical for multimedia tasks, while various types of systematic flaws are found in image benchmark datasets, as discussed in recent work. In particular, the existence of the semantic gap problem leads to a many-to-many mapping betwee

Externí odkaz: http://arxiv.org/abs/2304.08989

Zobrazit plný text záznamu

Report

Aligning Visual and Lexical Semantics

Autor: Giunchiglia, Fausto, Bagchi, Mayukh, Diao, Xiaolei

We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on

Externí odkaz: http://arxiv.org/abs/2212.06629

Zobrazit plný text záznamu

Report

CharFormer: A Glyph Fusion based Attentive Framework for High-precision Character Image Denoising

Autor: Shi, Daqian, Diao, Xiaolei, Shi, Lida, Tang, Hao, Chi, Yang, Li, Chuntao, Xu, Hao

Degraded images commonly exist in the general sources of character images, leading to unsatisfactory character recognition results. Existing methods have dedicated efforts to restoring degraded character images. However, the denoising results obtaine

Externí odkaz: http://arxiv.org/abs/2207.07798

Zobrazit plný text záznamu

Report

RCRN: Real-world Character Image Restoration Network via Skeleton Extraction

Autor: Shi, Daqian, Diao, Xiaolei, Tang, Hao, Li, Xiaomin, Xing, Hao, Xu, Hao

Constructing high-quality character image datasets is challenging because real-world images are often affected by image degradation. There are limitations when applying current image restoration methods to such real-world character images, since (i)

Externí odkaz: http://arxiv.org/abs/2207.07795

Zobrazit plný text záznamu

Report

RZCR: Zero-shot Character Recognition via Radical-based Reasoning

Autor: Diao, Xiaolei, Shi, Daqian, Tang, Hao, Shen, Qiang, Li, Yanzeng, Wu, Lei, Xu, Hao

The long-tail effect is a common issue that limits the performance of deep learning models on real-world datasets. Character image datasets are also affected by such unbalanced data distribution due to differences in character usage frequency. Thus,

Externí odkaz: http://arxiv.org/abs/2207.05842

Zobrazit plný text záznamu

Report

Building a visual semantics aware object hierarchy

Autor: Diao, Xiaolei

The semantic gap is defined as the difference between the linguistic representations of the same concept, which usually leads to misunderstanding between individuals with different knowledge backgrounds. Since linguistically annotated images are exte

Externí odkaz: http://arxiv.org/abs/2202.13021

Zobrazit plný text záznamu

Report

Visual Ground Truth Construction as Faceted Classification

Autor: Giunchiglia, Fausto, Bagchi, Mayukh, Diao, Xiaolei

Recent work in Machine Learning and Computer Vision has provided evidence of systematic design flaws in the development of major object recognition benchmark datasets. One such example is ImageNet, wherein, for several categories of images, there are

Externí odkaz: http://arxiv.org/abs/2202.08512

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání