tsegGAN: A Generative Adversarial Network for Segmenting Touching Nontext Components From Text Ones in Handwriting

Autor:	Showmik Bhowmik, Ram Sarkar, Riktim Mondal
Rok vydání:	2021
Předmět:	Computer science business.industry media_common.quotation_subject 020208 electrical & electronic engineering 02 engineering and technology Image segmentation Optical character recognition computer.software_genre Task (project management) Handwriting Margin (machine learning) Component (UML) ComputingMethodologies_DOCUMENTANDTEXTPROCESSING 0202 electrical engineering electronic engineering information engineering Segmentation Artificial intelligence Electrical and Electronic Engineering Function (engineering) business Instrumentation computer Natural language processing media_common
Zdroj:	IEEE Transactions on Instrumentation and Measurement. 70:1-10
ISSN:	1557-9662 0018-9456
DOI:	10.1109/tim.2020.3038277
Popis:	Segmentation of a touching component to separate its constituent text and nontext parts is always a very crucial but challenging task toward developing a comprehensive document image processing (DIP) system. This is because, irrespective of document types, either printed or handwritten, the nontext parts need to be suppressed first before processing the text parts through an optical character recognition (OCR) system. Although a good number of attempts have been made to address this issue for printed documents, the same for regular handwritten document images is almost none. However, the appearance of touching components where a nontext part gets joined with a text part is a common issue in freestyle handwriting. To this end, in this work, we tailor-make a generative adversarial network (GAN)-based model with a suitable loss function that we name tsegGAN. We also prepare an in-house data set by collecting touching components from different real-world handwritten documents to evaluate our model. The performance comparison of our model with state-of-the-art GAN models shows that tsegGAN has outperformed the others with a significant margin.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::4d6b331360528146977f063685c29c7d https://doi.org/10.1109/tim.2020.3038277 Zobrazit plný text záznamu