Výsledky vyhledávání - "Armin Hoenen"

Akademický článek

A Manual for Web Corpus Crawling of Low Resource Languages

Autor: Armin Hoenen, Cemre Koc, Marc Daniel Rahn

Publikováno v: Umanistica Digitale, Vol 4, Iss 8 (2020)

Since the seminal publication of “Web as Corpus” [1], the potential of creating corpora from the web has been realized for good for the creation of both online and offline corpora: noisy vs. clean, balanced vs. convenient, annotated vs. raw, smal

Externí odkaz: https://doaj.org/article/cec6496f960d4555b4cfb3d7fa2f42b3

Zobrazit plný text záznamu

Akademický článek

A practitioner’s view: a survey and comparison of lemmatization and morphological tagging in German and Latin

Autor: Rüdiger Gleim, Steffen Eger, Alexander Mehler, Tolga Uslu, Wahed Hemati, Andy Lücking, Alexander Henlein, Sven Kahlsdorf, Armin Hoenen

Publikováno v: Journal of Language Modelling, Vol 7, Iss 1, Pp 1–52-1–52 (2019)

The challenge of POS tagging and lemmatization in morphologically rich languages is examined by comparing German and Latin. We start by defining an NLP evaluation roadmap to model the combination of tools and resources guiding our experiments. We foc

Externí odkaz: https://doaj.org/article/8a624d9a53a84a728580d8fdf5f62880

Zobrazit plný text záznamu

Akademický článek

An open problem in computational stemmatology - a model for contamination

Autor: Armin Hoenen

Publikováno v: Umanistica Digitale, Vol 3, Iss 5 (2019)

In this contribution, two open problems in computational stemmatology are being considered. The first one is contamination, an umbrella term referring to all phenomena of admixture of text variants resulting from scribes considering more than one man

Externí odkaz: https://doaj.org/article/320b06437aad4f3ca95c50b063ac41a4

Zobrazit plný text záznamu

Hanne Martine Eckhoff, Silvia Luraghi u. Marco Passarotti (Hgg.): Diachronic Treebanks for Historical Linguistics, Amsterdam u. Philadelphia: John Benjamins 2020, 154 S. (Benjamins Current Topics 113)

Autor: Armin Hoenen

Publikováno v: Beiträge zur Geschichte der deutschen Sprache und Literatur. 143:276-279

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::197e4cd32d7eb637430903ec8fbb9059
https://doi.org/10.1515/bgsl-2021-0017

Zobrazit plný text záznamu

Migration of Small and Endangered Languages into the Wikipedia

Autor: Armin Hoenen, Marc Daniel Rahn

Publikováno v: Proceedings of the Workshop on Computational Methods for Endangered Languages. 2

This paper reviews the Wikipedias for the smallest languages it is available for. We compute the most frequent shared articles (by automatically extracting their translation links), categories and other features. By analysing these data and aligning

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::b3737e04ec18afbd57fea826504b9215
https://doi.org/10.33011/computel.v2i.987

Zobrazit plný text záznamu

8 Evolutionary models in other disciplines

Autor: Armin Hoenen

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::373b9d748f8e5eb722fe13956a8cfc1c
https://doi.org/10.1515/9783110684384-009

Zobrazit plný text záznamu

Gepi: An Epigraphic Corpus for Old Georgian and a Tool Sketch for Aiding Reconstruction

Autor: Armin Hoenen, Lela Samushia

Publikováno v: Journal for Language Technology and Computational Linguistics. 31:25-38

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::27458c9cd0844941ef9026d384c16116
https://doi.org/10.21248/jlcl.31.2016.210

Zobrazit plný text záznamu

Elektronická kniha

Der Fremdschrifterwerb. Eine Analyse verschiedener Sprachlehrbücher und Sprachkombinationen

Autor: Armin Hoenen

Examensarbeit aus dem Jahr 2011 im Fachbereich Sprachwissenschaft / Sprachforschung (fachübergreifend), Note: 1,3, Johannes Gutenberg-Universität Mainz (ZWW), Veranstaltung: Sprachandragogik, Sprache: Deutsch, Abstract: Diese Abschlussarbeit des Zu

Zobrazit plný text záznamu

How Many Stemmata with Root Degree k?

Autor: Ralf Gehrke, Armin Hoenen, Steffen Eger

Publikováno v: MOL

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::71fd7d427052dfbe5c855744b992cf7d
https://doi.org/10.18653/v1/w17-3402

Zobrazit plný text záznamu

Using Word Embeddings for Computing Distances Between Texts and for Authorship Attribution

Autor: Armin Hoenen

Publikováno v: Natural Language Processing and Information Systems ISBN: 9783319595689
NLDB

In this paper, word embeddings are used for the task of supervised authorship attribution. While previous methods have for instance been looking at characters (n-grams), syntax and most importantly token frequencies, the method presented focusses on

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::725dde3f404686dc3a40db435faa4f0f
https://doi.org/10.1007/978-3-319-59569-6_33

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání