CREMMA Medieval Latin: Literary manuscript text recognition in Latin

Autor:	Clérice, Thibault, Vlachou-Efstathiou, Malamatenia, Chagué, Alix
Přispěvatelé:	École nationale des chartes (ENC), Université Paris sciences et lettres (PSL), Centre Jean Mabillon (CJM), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL), Histoire et Sources des Mondes antiques (HiSoMA), École normale supérieure - Lyon (ENS Lyon)-Université Lumière - Lyon 2 (UL2)-Université Jean Moulin - Lyon 3 (UJML), Université de Lyon-Université de Lyon-Université Jean Monnet [Saint-Étienne] (UJM)-Centre National de la Recherche Scientifique (CNRS), Automatic Language Modelling and ANAlysis & Computational Humanities (ALMAnaCH), Inria de Paris, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), The project CREMMA was funded by the DIM MAP (now DIM PAMIR) under the supervision of the Conseil Régional d’Île de France.
Jazyk:	angličtina
Rok vydání:	2022
Předmět:	Latin [SHS.LITT]Humanities and Social Sciences/Literature Handwritten Text Recognition [INFO]Computer Science [cs] Middle Ages Manuscripts Layout Segmentation
Popis:	This paper present a novel segmentation and handwritten text recognition dataset for Medieval Latin, from the 11 th to the 16 th century. It connects with Medieval French dataset as well as ealier Latin dataset, by enforcing common guidelines. We provide our own addition to Ariane Pinche's Old French guidelines to deal with specific Latin case. We also offer an overview of how we addressed this dataset compilation through the use of pre-existing resources. With a higher abbreviation ratio and a better representation of abbreviating marks, we offer new models that outperform the base Old French model on Latin dataset, reaching readability levels on unknown manuscripts.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=dedup_wf_001::54aad2b7ec5cd16bd52c8786c6233f5a https://hal-enc.archives-ouvertes.fr/hal-03828353/file/documentation_CREMMA_Medieval_LAT-4.pdf Zobrazit plný text záznamu