Zobrazeno 1 - 10
of 30
pro vyhledávání: '"Diao, Xiaolei"'
Autor:
Diao, Xiaolei, Shi, Daqian, Li, Jian, Shi, Lida, Yue, Mingzhe, Qi, Ruihua, Li, Chuntao, Xu, Hao
Optical character recognition (OCR) methods have been applied to diverse tasks, e.g., street view text recognition and document analysis. Recently, zero-shot OCR has piqued the interest of the research community because it considers a practical OCR s
Externí odkaz:
http://arxiv.org/abs/2308.00655
Recent work in Machine Learning and Computer Vision has highlighted the presence of various types of systematic flaws inside ground truth object recognition benchmark datasets. Our basic tenet is that these flaws are rooted in the many-to-many mappin
Externí odkaz:
http://arxiv.org/abs/2307.14119
Publikováno v:
IWCIM@ICASSP 2023
Data quality is critical for multimedia tasks, while various types of systematic flaws are found in image benchmark datasets, as discussed in recent work. In particular, the existence of the semantic gap problem leads to a many-to-many mapping betwee
Externí odkaz:
http://arxiv.org/abs/2304.08989
We discuss two kinds of semantics relevant to Computer Vision (CV) systems - Visual Semantics and Lexical Semantics. While visual semantics focus on how humans build concepts when using vision to perceive a target reality, lexical semantics focus on
Externí odkaz:
http://arxiv.org/abs/2212.06629
Degraded images commonly exist in the general sources of character images, leading to unsatisfactory character recognition results. Existing methods have dedicated efforts to restoring degraded character images. However, the denoising results obtaine
Externí odkaz:
http://arxiv.org/abs/2207.07798
Constructing high-quality character image datasets is challenging because real-world images are often affected by image degradation. There are limitations when applying current image restoration methods to such real-world character images, since (i)
Externí odkaz:
http://arxiv.org/abs/2207.07795
The long-tail effect is a common issue that limits the performance of deep learning models on real-world datasets. Character image datasets are also affected by such unbalanced data distribution due to differences in character usage frequency. Thus,
Externí odkaz:
http://arxiv.org/abs/2207.05842
Autor:
Diao, Xiaolei
The semantic gap is defined as the difference between the linguistic representations of the same concept, which usually leads to misunderstanding between individuals with different knowledge backgrounds. Since linguistically annotated images are exte
Externí odkaz:
http://arxiv.org/abs/2202.13021
Recent work in Machine Learning and Computer Vision has provided evidence of systematic design flaws in the development of major object recognition benchmark datasets. One such example is ImageNet, wherein, for several categories of images, there are
Externí odkaz:
http://arxiv.org/abs/2202.08512
Publikováno v:
In Information Processing and Management May 2021 58(3)