Automatiser Artl@s – extraire des données de catalogues d'exposition

Autor: Simon Gabay, Barbara Topalov, Caroline Corbières, Lucie Rondeau Du Noyer, Béatrice Joyeux-Prunel, Laurent Romary
Přispěvatelé: Université de Genève (UNIGE), Université Paris-Saclay, Automatic Language Modelling and ANAlysis & Computational Humanities (ALMAnaCH), Inria de Paris, Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria), Université de Genève = University of Geneva (UNIGE)
Předmět:
Zdroj: HAL
EADH 2021-Second International Conference of the European Association for Digital Humanities
EADH 2021-Second International Conference of the European Association for Digital Humanities, Sep 2021, Krasnoyarsk, Russia
Popis: International audience; Catalogues, which have been published for centuries, are an extremely precious resource for scholars. Using the Artl@s database as an example, where exhibition catalogues are transformed into a georeferenced database, we question the possibility of an (almost) automatic transformation of pdfs into semantically annotated data. To do so, we present and analyse the graphic organisation of exhibition catalogues, before exploring a possible modeling into TEI (involving possible enhancement of the guidelines).
Databáze: OpenAIRE