Benchmarking discriminative approaches for word spotting in handwritten documents

Autor: Gautier Bideault, Luc Mioulet, Thierry Paquet, Clément Chatelain
Přispěvatelé: Laboratoire d'Informatique, de Traitement de l'Information et des Systèmes (LITIS), Institut national des sciences appliquées Rouen Normandie (INSA Rouen Normandie), Institut National des Sciences Appliquées (INSA)-Normandie Université (NU)-Institut National des Sciences Appliquées (INSA)-Normandie Université (NU)-Université de Rouen Normandie (UNIROUEN), Normandie Université (NU)-Université Le Havre Normandie (ULH), Normandie Université (NU), Equipe Apprentissage (DocApp - LITIS), Normandie Université (NU)-Institut national des sciences appliquées Rouen Normandie (INSA Rouen Normandie)
Rok vydání: 2015
Předmět:
Zdroj: ICDAR
2015 13th International Conference on Document Analysis and Recognition (ICDAR)
2015 13th International Conference on Document Analysis and Recognition (ICDAR), Aug 2015, Tunis, France. pp.201-205, ⟨10.1109/ICDAR.2015.7333752⟩
DOI: 10.1109/icdar.2015.7333752
Popis: International audience; In this article, we propose to benchmark the most popular methods for word spotting in handwritten documents. The benchmark includes a pure HMM approach, as well as hybrid discriminative methods MLP-HMM, CRF-HMM, RNN-HMM and BLSTM-CTC-HMM. This study enables us to observe the increase ratio of performance provided by each discriminative stage compared with the pure generative HMM approach. Moreover, we put forward the different abilities of all these discriminative stages from the simplest MLP to the most complex and current state of the art BLSTM-CTC. We also propose a more specific and original study on BLSTM-CTC, showing that when used as a lexicon-free recognizer, it can reach very interesting word-spotting performance.
Databáze: OpenAIRE