Autor: |
Romein, C. Annemieke, Hodel, Tobias, Gordijn, Femke, Zundert, Joris J. van, Chagué, Alix, Lange, Milan van, Jensen, Helle Strandgaard, Stauder, Andy, Purcell, Jake, Terras, Melissa M., Heuvel, Pauline van den, Keijzer, Carlijn, Rabus, Achim, Sitaram, Chantal, Bhatia, Aakriti, Depuydt, Katrien, Afolabi-Adeolu, Mary Aderonke, Anikina, Anastasiia, Bastianello, Elisa, Benzinger, Lukas Vincent, Bosse, Arno, Brown, David, Charlton, Ash, Dannevig, André Nilsson, Gelder, Klaas van, Go, Sabine C.P.J., Goh, Marcus J.C., Gstrein, Silvia, Hasan, Sewa, Heide, Stefan von der, Hindermann, Maximilian, Huff, Dorothee, Huysman, Ineke, Idris, Ali, Keijzer, Liesbeth, Kemper, Simon, Koenders, Sanne, Kuijpers, Erika, Rønsig Larsen, Lisette, Lepa, Sven, Link, Tommy O., Nispen, Annelies van, Nockels, Joe, Noort, Laura M. van, Oosterhuis, Joost Johannes, Popken, Vivien, Estrella Puertollano, María, Puusaag, Joosep J., Sheta, Ahmed, Stoop, Lex, Strutzenbladh, Ebba, Sijs, Nicoline van der, Spek, Jan Paul van der, Trouw, Barry Benaissa, Van Synghel, Geertrui, Vučković, Vladimir, Wilbrink, Heleen, Weiss, Sonia, Wrisley, David Joseph, Zweistra, Riet |
Přispěvatelé: |
Politieke Cultuur en Geschiedenis (HI), Computationele Literatuurwetenschap (HI), NIOD Institute for War, Holocaust and Genocide studies, Digital Infrastructure, International Institute of Social History (IISH), NL-Lab, Geschiedenis (HI), Meertens Institute |
Jazyk: |
angličtina |
Rok vydání: |
2023 |
Předmět: |
|
Zdroj: |
Zenodo |
Popis: |
This paper discusses best practices for sharing and reusing Ground Truth in Handwritten Text Recognition infrastructures, as well as ways to reference and acknowledge contributions to the creation and enrichment of data within these systems. We discuss how one can place Ground Truth data in a repository and, subsequently, inform others through HTR-United. Furthermore, we want to we want to suggest appropriate citation methods for HTR data, models, and contributions made by volunteers. Moreover, when using digitised sources (digital facsimiles), it becomes increasingly important to distinguish between the physical object and the digital collection. These topics all relate to the proper acknowledgement of labour put into digitising, transcribing, and sharing Ground Truth HTR data. This also points to broader issues surrounding the use of machine learning in archival and library contexts, and how the community should begin to acknowledge and record both contributions and data provenance. |
Databáze: |
OpenAIRE |
Externí odkaz: |
|