Zobrazeno 1 - 10
of 17
pro vyhledávání: '"Armin Hoenen"'
Publikováno v:
Umanistica Digitale, Vol 4, Iss 8 (2020)
Since the seminal publication of “Web as Corpus” [1], the potential of creating corpora from the web has been realized for good for the creation of both online and offline corpora: noisy vs. clean, balanced vs. convenient, annotated vs. raw, smal
Externí odkaz:
https://doaj.org/article/cec6496f960d4555b4cfb3d7fa2f42b3
Autor:
Rüdiger Gleim, Steffen Eger, Alexander Mehler, Tolga Uslu, Wahed Hemati, Andy Lücking, Alexander Henlein, Sven Kahlsdorf, Armin Hoenen
Publikováno v:
Journal of Language Modelling, Vol 7, Iss 1, Pp 1–52-1–52 (2019)
The challenge of POS tagging and lemmatization in morphologically rich languages is examined by comparing German and Latin. We start by defining an NLP evaluation roadmap to model the combination of tools and resources guiding our experiments. We foc
Externí odkaz:
https://doaj.org/article/8a624d9a53a84a728580d8fdf5f62880
Autor:
Armin Hoenen
Publikováno v:
Umanistica Digitale, Vol 3, Iss 5 (2019)
In this contribution, two open problems in computational stemmatology are being considered. The first one is contamination, an umbrella term referring to all phenomena of admixture of text variants resulting from scribes considering more than one man
Externí odkaz:
https://doaj.org/article/320b06437aad4f3ca95c50b063ac41a4
Autor:
Armin Hoenen
Publikováno v:
Beiträge zur Geschichte der deutschen Sprache und Literatur. 143:276-279
Autor:
Armin Hoenen, Marc Daniel Rahn
Publikováno v:
Proceedings of the Workshop on Computational Methods for Endangered Languages. 2
This paper reviews the Wikipedias for the smallest languages it is available for. We compute the most frequent shared articles (by automatically extracting their translation links), categories and other features. By analysing these data and aligning
Autor:
Armin Hoenen, Lela Samushia
Publikováno v:
Journal for Language Technology and Computational Linguistics. 31:25-38
Autor:
Armin Hoenen
Examensarbeit aus dem Jahr 2011 im Fachbereich Sprachwissenschaft / Sprachforschung (fachübergreifend), Note: 1,3, Johannes Gutenberg-Universität Mainz (ZWW), Veranstaltung: Sprachandragogik, Sprache: Deutsch, Abstract: Diese Abschlussarbeit des Zu
Autor:
Armin Hoenen
Publikováno v:
Natural Language Processing and Information Systems ISBN: 9783319595689
NLDB
NLDB
In this paper, word embeddings are used for the task of supervised authorship attribution. While previous methods have for instance been looking at characters (n-grams), syntax and most importantly token frequencies, the method presented focusses on
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::725dde3f404686dc3a40db435faa4f0f
https://doi.org/10.1007/978-3-319-59569-6_33
https://doi.org/10.1007/978-3-319-59569-6_33