Výsledky vyhledávání - "Marco Büchler"

Akademický článek

A teljesség minőségjelzőként való mérése az Europeanában

Publikováno v: Digitális Bölcsészet, Iss 2 (2019)

Az Europeana – a kulturális örökség európai digitális platformja – több, mint 3200 adatszolgáltatótól beérkező metaadatrekord gyűjteménye a rekordok jellemzőit tekintve meglehetősen heterogén. A rekordok eredeti típusa és konte

Externí odkaz: https://doaj.org/article/946c0aee2410435c96b73ec6d9eeac70

Zobrazit plný text záznamu

Optical character recognition of typeset Coptic text with neural networks

Autor: Kirill Bulert, Heike Behlmer, So Miyagawa, Marco Büchler

Publikováno v: Digital Scholarship in the Humanities. 34:i135-i141

Digital Humanities (DH) within Coptic Studies, an emerging field of development, will be much aided by the digitization of large quantities of typeset Coptic texts. Until recently, the only Optical Character Recognition (OCR) analysis of printed Copt

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::d0741e4a9c6ec69f27e76fd025d78462
https://doi.org/10.1093/llc/fqz023

Zobrazit plný text záznamu

Towards Big Religious Data: RESILIENCE research infrastructure for data on religion in the digital age

Autor: Sarah Riegert, Francesca Cadeddu, Marco Büchler, Federico Alpi

Data in and for religion is arguably as old as humanity. Religious significance has been attached to an immense variety of artifacts and documents, often in written form, in nearly all spoken and written languages over the past millennia. The rise of

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e8e7b38f915a8d21f8334115dffcb10d
https://hdl.handle.net/11380/1236710

Zobrazit plný text záznamu

Optical Character Recognition for Coptic fonts

Autor: Eliese-Sophia Lincke, Kirill Bulert, Marco Büchler

Publikováno v: DATeCH

In this paper, we show that the OCR engine Ocropy can be trained for fonts used in rather complex and varied Coptic typeset. For each of the three fonts presented in this paper, we used a number of texts from scholarly editions with different philolo

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::2531b94f894b7f2ffa89f6d01bdbc4fb
https://doi.org/10.1145/3322905.3322931

Zobrazit plný text záznamu

Measuring completeness as metadata quality metric in Europeana

Autor: Péter Király, Marco Büchler

Publikováno v: IEEE BigData

Europeana, the European digital platform for cultural heritage, has a heterogeneous collection of metadata records ingested from more than 3200 data providers. The original nature and context of these records were different. In order to create effect

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::035f3a5aff1a6870e7b3a81d5f81dbbb
https://doi.org/10.1109/bigdata.2018.8622487

Zobrazit plný text záznamu

Using and evaluating TRACER for an Index fontium computatus of the Summa contra Gentiles of Thomas Aquinas

Autor: Marco Büchler, Marco Carlo Passarotti, Maria Moritz, Greta Franzini

Publikováno v: Proceedings of the Fifth Italian Conference on Computational Linguistics CLiC-it 2018 ISBN: 9788831978682
CLiC-it
Scopus-Elsevier

This article describes a computational text reuse study on Latin texts designed to evaluate the performance of TRACER, a language-agnostic text reuse detection engine. As a case study, we use the Index Thomisticus as a gold standard to measure the pe

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cee5c88d1f4bcbc41a0d94aff530ee62
http://hdl.handle.net/10807/127871

Zobrazit plný text záznamu

Mining and Analysing One Billion Requests to Linguistic Services

Autor: Thomas Eckart, Greta Franzini, Marco Büchler, Emily Franzini

Publikováno v: IEEE BigData

From 2004 to 2016 the Leipzig Linguistic Services (LLS) existed as a SOAP-based cyberinfrastructure of atomic micro-services for the Wortschatz project, which covered different-sized textual corpora in more than 230 languages. The LLS were developed

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::772a774d24d1a8848d10cb0c3370b497
http://hdl.handle.net/10807/127327

Zobrazit plný text záznamu

Non-Literal Text Reuse in Historical Texts: An Approach to Identify Reuse Transformations and its Application to Bible Reuse

Autor: Andreas Wiederhold, Yuri Bizzoni, Marco Büchler, Maria Moritz, Barbara Pavlek

Publikováno v: EMNLP

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::dc664a2c8b0d45da14426c6e79826213
https://doi.org/10.18653/v1/d16-1190

Zobrazit plný text záznamu

Is it research or is it spying? Thinking-through ethics in Big Data AI and other knowledge sciences

Autor: Marco Büchler, Geoffrey Rockwell, Bettina Berendt

“How to be a knowledge scientist after the Snowden revelations?” is a question we all have to ask as it becomes clear that our work and our students could be involved in the building of an unprecedented surveillance society. In this essay, we arg

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d1b4d080e4feb053241352ce610e1578
https://lirias.kuleuven.be/handle/123456789/504185

Zobrazit plný text záznamu

Designing Close and Distant Reading Visualizations for Text Re-use

Autor: Stefan Jänicke, Thomas Efer, Marco Büchler, Gerik Scheuermann

Publikováno v: Communications in Computer and Information Science ISBN: 9783319251165
VISIGRAPP (Selected Papers)

We present various visualizations for the Text Re-use found among texts of a collection to support answering a broad palette of research questions in the humanities. When juxtaposing all texts of a corpus in form of tuples, we propose the Text Re-use

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::89acf6d020a3862587813b76d2375400
https://doi.org/10.1007/978-3-319-25117-2_10

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání