Text-line-up: Don’t Worry About the Caret

Autor: Adak, C, Chaudhuri, BB, Lin, CT, Blumenstein, M
Přispěvatelé: Llados, J, Lopresti, D, Uchida, S
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Popis: In a freestyle handwritten text-line, sometimes words are inserted using a caret symbol (∧ ) for corrections/annotations. Such insertions create fluctuations in the reading sequence of words. In this paper, we aim to line-up the words of a text-line, so that it can assist the OCR engine. Previous text-line segmentation techniques in the literature have scarcely addressed this issue. Here, the task undertaken is formulated as a path planning problem, and a novel multi-agent hierarchical reinforcement learning-based architecture solution is proposed. As a matter of fact, no linguistic knowledge is used here. Experimentation of the proposed solution architecture has been conducted on English and Bengali offline handwriting, which yielded some interesting results.
Databáze: OpenAIRE