Representative Image Selection for Data Efficient Word Spotting

Autor: Florian Westphal, Niklas Lavesson, Håkan Grahn
Jazyk: angličtina
Rok vydání: 2020
Předmět:
Zdroj: Document Analysis Systems ISBN: 9783030570576
DAS
Popis: This paper compares three different word image representations as base for label free sample selection for word spotting in historical handwritten documents. These representations are a temporal pyramid representation based on pixel counts, a graph based representation, and a pyramidal histogram of characters (PHOC) representation predicted by a PHOCNet trained on synthetic data. We show that the PHOC representation can help to reduce the amount of required training samples by up to 69% depending on the dataset, if it is learned iteratively in an active learning like fashion. While this works for larger datasets containing about 1 700 images, for smaller datasets with 100 images, we find that the temporal pyramid and the graph representation perform better. open access
Databáze: OpenAIRE