impresso Text Reuse at Scale. An interface for the exploration of text reuse data in semantically enriched historical newspapers.
Autor: | Düring M; Digital History & Historiography, Luxembourg Centre for Contemporary and Digital History, Esch-sur-Alzette, Luxembourg., Romanello M; Institute of Archeology and Classical Studies (ASA), University of Lausanne, Lausanne, Switzerland., Ehrmann M; DHLAB, École Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland., Beelen K; Digital Humanities Research Hub, School of Advanced Study, University of London, London, United Kingdom., Guido D; Digital Research Infrastructure, Luxembourg Centre for Contemporary and Digital History, Esch-sur-Alzette, Luxembourg., Deseure B; Royal Library of Belgium, Brussels, Belgium., Bunout E; Contemporary History of Luxembourg, Luxembourg Centre for Contemporary and Digital History, Esch-sur-Alzette, Luxembourg., Keck J; German Historical Institute Washington, Washington, DC, United States., Apostolopoulos P; Digital History & Historiography, Luxembourg Centre for Contemporary and Digital History, Esch-sur-Alzette, Luxembourg. |
---|---|
Jazyk: | angličtina |
Zdroj: | Frontiers in big data [Front Big Data] 2023 Nov 03; Vol. 6, pp. 1249469. Date of Electronic Publication: 2023 Nov 03 (Print Publication: 2023). |
DOI: | 10.3389/fdata.2023.1249469 |
Abstrakt: | Text Reuse reveals meaningful reiterations of text in large corpora. Humanities researchers use text reuse to study, e.g., the posterior reception of influential texts or to reveal evolving publication practices of historical media. This research is often supported by interactive visualizations which highlight relations and differences between text segments. In this paper, we build on earlier work in this domain. We present impresso Text Reuse at Scale, the to our knowledge first interface which integrates text reuse data with other forms of semantic enrichment to enable a versatile and scalable exploration of intertextual relations in historical newspaper corpora. The Text Reuse at Scale interface was developed as part of the impresso project and combines powerful search and filter operations with close and distant reading perspectives. We integrate text reuse data with enrichments derived from topic modeling, named entity recognition and classification, language and document type detection as well as a rich set of newspaper metadata. We report on historical research objectives and common user tasks for the analysis of historical text reuse data and present the prototype interface together with the results of a user evaluation. Competing Interests: The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest. (Copyright © 2023 Düring, Romanello, Ehrmann, Beelen, Guido, Deseure, Bunout, Keck and Apostolopoulos.) |
Databáze: | MEDLINE |
Externí odkaz: |