Newspapers & photos connected. Exploring the use of computer vision to gain better access to large heritage collections

Autor: De Gruijter, Michel
Jazyk: angličtina
Rok vydání: 2022
Předmět:
DOI: 10.5281/zenodo.7331359
Popis: Poster presented at the Clariah-NDE Conference, 24 November 2022, Utrecht. Related to Newspaper and Photos (2021-2022), aproject by Picturae B.V., Sioux Technologies, Groninger Archieven, Noord-Hollands Archief and KB, national library of the Netherlands. For more info about the project and the results, please check https://www.krant-en-fotos.nl/(website in Dutch). About the poster Question: How could we connect press photos to newspaper photos? Method & techniques: developing an algorithm based on convolutional neural network VGG16, Training set: pretrained on 1M+ images of ImageNet. Testset: 0,5M Press photos, 0,25M Newspaper photos. Results Software for image recognition (GitHub) Dataset with newspaper photos linked to press photos. 4% of photos connected with accuracy of 99% = 30,000 connections Demo www.krant-en-fotos.nl with interactive connections between photos Lessons learned in whitepaper for the cultural heritage sector Future work For higher accuracy of links, further development of algorithm is needed, a.o. by using metadata Scaling up: already 5 archives with press photo collections have shown interest in a follow-up Digitisation of more newspapers is taking place right now Implementing the functionality in existing newspaper and image repositories will make it more sustainable Poster design: Dorien Haagsma.
Databáze: OpenAIRE