Zobrazeno 1 - 10
of 14
pro vyhledávání: '"Marco Büchler"'
Autor:
Péter Király, Marco Büchler
Publikováno v:
Digitális Bölcsészet, Iss 2 (2019)
Az Europeana – a kulturális örökség európai digitális platformja – több, mint 3200 adatszolgáltatótól beérkező metaadatrekord gyűjteménye a rekordok jellemzőit tekintve meglehetősen heterogén. A rekordok eredeti típusa és konte
Externí odkaz:
https://doaj.org/article/946c0aee2410435c96b73ec6d9eeac70
Publikováno v:
Digital Scholarship in the Humanities. 34:i135-i141
Digital Humanities (DH) within Coptic Studies, an emerging field of development, will be much aided by the digitization of large quantities of typeset Coptic texts. Until recently, the only Optical Character Recognition (OCR) analysis of printed Copt
Data in and for religion is arguably as old as humanity. Religious significance has been attached to an immense variety of artifacts and documents, often in written form, in nearly all spoken and written languages over the past millennia. The rise of
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::e8e7b38f915a8d21f8334115dffcb10d
https://hdl.handle.net/11380/1236710
https://hdl.handle.net/11380/1236710
Publikováno v:
DATeCH
In this paper, we show that the OCR engine Ocropy can be trained for fonts used in rather complex and varied Coptic typeset. For each of the three fonts presented in this paper, we used a number of texts from scholarly editions with different philolo
Autor:
Péter Király, Marco Büchler
Publikováno v:
IEEE BigData
Europeana, the European digital platform for cultural heritage, has a heterogeneous collection of metadata records ingested from more than 3200 data providers. The original nature and context of these records were different. In order to create effect
Publikováno v:
Proceedings of the Fifth Italian Conference on Computational Linguistics CLiC-it 2018 ISBN: 9788831978682
CLiC-it
Scopus-Elsevier
CLiC-it
Scopus-Elsevier
This article describes a computational text reuse study on Latin texts designed to evaluate the performance of TRACER, a language-agnostic text reuse detection engine. As a case study, we use the Index Thomisticus as a gold standard to measure the pe
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::cee5c88d1f4bcbc41a0d94aff530ee62
http://hdl.handle.net/10807/127871
http://hdl.handle.net/10807/127871
Publikováno v:
IEEE BigData
From 2004 to 2016 the Leipzig Linguistic Services (LLS) existed as a SOAP-based cyberinfrastructure of atomic micro-services for the Wortschatz project, which covered different-sized textual corpora in more than 230 languages. The LLS were developed
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::772a774d24d1a8848d10cb0c3370b497
http://hdl.handle.net/10807/127327
http://hdl.handle.net/10807/127327
Publikováno v:
EMNLP
“How to be a knowledge scientist after the Snowden revelations?” is a question we all have to ask as it becomes clear that our work and our students could be involved in the building of an unprecedented surveillance society. In this essay, we arg
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::d1b4d080e4feb053241352ce610e1578
https://lirias.kuleuven.be/handle/123456789/504185
https://lirias.kuleuven.be/handle/123456789/504185
Publikováno v:
Communications in Computer and Information Science ISBN: 9783319251165
VISIGRAPP (Selected Papers)
VISIGRAPP (Selected Papers)
We present various visualizations for the Text Re-use found among texts of a collection to support answering a broad palette of research questions in the humanities. When juxtaposing all texts of a corpus in form of tuples, we propose the Text Re-use
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::89acf6d020a3862587813b76d2375400
https://doi.org/10.1007/978-3-319-25117-2_10
https://doi.org/10.1007/978-3-319-25117-2_10