Line Segmentation Free Probabilistic Keyword Spotting and Indexing

Autor: Enrique Vidal, Killian Barrere, Alejandro Héctor Toselli
Rok vydání: 2019
Předmět:
Zdroj: Pattern Recognition and Image Analysis ISBN: 9783030313203
IbPRIA (2)
DOI: 10.1007/978-3-030-31321-0_18
Popis: Probabilistic Keyword Spotting and Indexing (PKWSI) allows effective search through untranscribed large collections of images. However, when text-line detection fails to detect foreground text, the PKWSI techniques also fail dramatically. In this paper, we develop a new line segmentation-free approach using a uniform line-sized image slicing instead of previous text-line detection. As a result, new issues arise due to overlapping slices, leading to several spot hypotheses for the same word. We develop solutions to take advantage of multiple spots and to consolidate them into single hypotheses. We test our approach on a difficult historical handwritten dataset and it yields promising results.
Databáze: OpenAIRE