Challenges of Deep Learning-based Text Detection in the Wild

Autor:	Zobeir Raisi, John Zelek, Mohamed A. Naiel, Steven Wardell, Paul Fieguth
Rok vydání:	2021
Předmět:	Computer science business.industry Deep learning Text detection Artificial intelligence business computer.software_genre computer Natural language processing
Zdroj:	Journal of Computational Vision and Imaging Systems. 6:1-5
ISSN:	2562-0444
DOI:	10.15353/jcvis.v6i1.3543
Popis:	The reported accuracy of recent state-of-the-art text detection methods, mostly deep learning approaches, is in the order of 80% to 90% on standard benchmark datasets. These methods have relaxed some of the restrictions of structured text and environment (i.e., "in the wild") which are usually required for classical OCR to properly function. Even with this relaxation, there are still circumstances where these state-of-the-art methods fail. Several remaining challenges in wild images, like in-plane-rotation, illumination reflection, partial occlusion, complex font styles, and perspective distortion, cause exciting methods to perform poorly. In order to evaluate current approaches in a formal way, we standardize the datasets and metrics for comparison which had made comparison between these methods difficult in the past. We use three benchmark datasets for our evaluations: ICDAR13, ICDAR15, and COCO-Text V2.0. The objective of the paper is to quantify the current shortcomings and to identify the challenges for future text detection research.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::c48de97738bbd4c6209daa7a3482a27a https://doi.org/10.15353/jcvis.v6i1.3543 Zobrazit plný text záznamu