Noisy medieval data, from digitized manuscript to stylometric analysis: Evaluating Paul Meyer’s hagiographic hypothesis

Autor: Jean-Baptiste Camps, Thibault Clérice, Ariane Pinche
Přispěvatelé: Centre Jean Mabillon (CJM), École nationale des chartes (ENC), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL), Université Paris sciences et lettres (PSL), Lattice - Langues, Textes, Traitements informatiques, Cognition - UMR 8094 (Lattice), Université Sorbonne Nouvelle - Paris 3-Université Sorbonne Paris Cité (USPC)-Centre National de la Recherche Scientifique (CNRS)-Université Paris sciences et lettres (PSL)-Département Littératures et langage - ENS Paris (LILA), École normale supérieure - Paris (ENS-PSL), Université Paris sciences et lettres (PSL)-Université Paris sciences et lettres (PSL)-École normale supérieure - Paris (ENS-PSL), Centre National de la Recherche Scientifique (CNRS), Histoire, Archéologie et Littératures des mondes chrétiens et musulmans médiévaux (CIHAM), École normale supérieure de Lyon (ENS de Lyon)-Université Lumière - Lyon 2 (UL2)-École des hautes études en sciences sociales (EHESS)-Université Jean Moulin - Lyon 3 (UJML), Université de Lyon-Université de Lyon-Avignon Université (AU)-Centre National de la Recherche Scientifique (CNRS)
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Digital Scholarship in the Humanities
Digital Scholarship in the Humanities, 2021, 36 (Supplement_2), pp.ii49-ii71. ⟨10.1093/llc/fqab033⟩
ISSN: 2055-7671
2055-768X
DOI: 10.1093/llc/fqab033⟩
Popis: Stylometric analysis of medieval vernacular texts is still a significant challenge: the importance of scribal variation, be it spelling or more substantial, as well as the variants and errors introduced in the tradition, complicate the task of the would-be stylometrist, by inducing noise and perhaps even interferences in the authorship signal. Basing the analysis on the study of the copy from a single hand of several texts can partially mitigate these issues (Camps and Cafiero, 2013, Setting bounds in a homogeneous corpus: a methodological study applied to medieval literature. Revue Des Nouvelles Technologies de l’information (RNTI), SHS-1, pp. 55–84), but the limited availability of complete diplomatic transcriptions might make this difficult. In this article, we use a workflow combining handwritten text recognition and stylometric analysis, applied to the case of the hagiographic works contained in MS BnF, fr. 412. We seek to evaluate Paul Meyer's hypothesis about the constitution of groups of hagiographic works, as well as to examine potential authorial groupings in a vastly anonymous corpus.
Databáze: OpenAIRE