Zobrazeno 1 - 10
of 18
pro vyhledávání: '"Tarride, Solène"'
In recent advances in automatic text recognition (ATR), deep neural networks have demonstrated the ability to implicitly capture language statistics, potentially reducing the need for traditional language models. This study directly addresses whether
Externí odkaz:
http://arxiv.org/abs/2404.19317
Autor:
Tarride, Solène, Schneider, Yoann, Generali-Lince, Marie, Boillet, Mélodie, Abadie, Bastien, Kermorvant, Christopher
PyLaia is one of the most popular open-source software for Automatic Text Recognition (ATR), delivering strong performance in terms of speed and accuracy. In this paper, we outline our recent contributions to the PyLaia library, focusing on the incor
Externí odkaz:
http://arxiv.org/abs/2404.18722
Autor:
Boillet, Mélodie, Tarride, Solène, Blanco, Manon, Rigal, Valentin, Schneider, Yoann, Abadie, Bastien, Kesztenbaum, Lionel, Kermorvant, Christopher
This paper presents a complete processing workflow for extracting information from French census lists from 1836 to 1936. These lists contain information about individuals living in France and their households. We aim at extracting all the informatio
Externí odkaz:
http://arxiv.org/abs/2404.18706
Autor:
Villanova-Aparisi, David, Tarride, Solène, Martínez-Hinarejos, Carlos-D., Romero, Verónica, Kermorvant, Christopher, Pastor-Gadea, Moisés
Information Extraction processes in handwritten documents tend to rely on obtaining an automatic transcription and performing Named Entity Recognition (NER) over such transcription. For this reason, in publicly available datasets, the performance of
Externí odkaz:
http://arxiv.org/abs/2404.18664
In this paper, we explore different ways of training a model for handwritten text recognition when multiple imperfect or noisy transcriptions are available. We consider various training configurations, such as selecting a single transcription, retain
Externí odkaz:
http://arxiv.org/abs/2306.10878
Autor:
Tarride, Solène, Maarand, Martin, Boillet, Mélodie, McGrath, James, Capel, Eugénie, Vézina, Hélène, Kermorvant, Christopher
Publikováno v:
International Journal on Document Analysis and Recognition (IJDAR) (2023)
This paper presents a complete workflow designed for extracting information from Quebec handwritten parish registers. The acts in these documents contain individual and family information highly valuable for genetic, demographic and social studies of
Externí odkaz:
http://arxiv.org/abs/2304.14044
We propose a new database for information extraction from historical handwritten documents. The corpus includes 5,393 finding aids from six different series, dating from the 18th-20th centuries. Finding aids are handwritten documents that contain met
Externí odkaz:
http://arxiv.org/abs/2304.13606
We propose a Transformer-based approach for information extraction from digitized handwritten documents. Our approach combines, in a single model, the different steps that were so far performed by separate models: feature extraction, handwriting reco
Externí odkaz:
http://arxiv.org/abs/2304.13530
Publikováno v:
15th IAPR International Workshop on Document Analysis Systems
15th IAPR International Workshop on Document Analysis Systems, May 2022, La Rochelle, France
Document Analysis Systems ISBN: 9783031065545
15th IAPR International Workshop on Document Analysis Systems, May 2022, La Rochelle, France
Document Analysis Systems ISBN: 9783031065545
International audience; This article focuses on information extraction in historical handwritten marriage records. Traditional approaches rely on a sequential pipeline of two consecutive tasks: handwriting recognition is applied before named entity r
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::9fc4db61ebfea392fc7d58e5c55547b9
https://hal.science/hal-03677908
https://hal.science/hal-03677908
Publikováno v:
Doctoral Consortium-ICDAR 2021
Doctoral Consortium-ICDAR 2021, Sep 2021, Lausanne, Switzerland
Doctoral Consortium-ICDAR 2021, Sep 2021, Lausanne, Switzerland
International audience
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::b58f97a6d95f88b0cff76048b45a32f8
https://hal.science/hal-03556913
https://hal.science/hal-03556913