Výsledky vyhledávání - "Marc Kupietz"

Akademický článek

Introducing DeReKoGram: A Novel Frequency Dataset with Lemma and Part-of-Speech Information for German

Autor: Sascha Wolfer, Alexander Koplenig, Marc Kupietz, Carolin Müller-Spitzer

Publikováno v: Data, Vol 8, Iss 11, p 170 (2023)

We introduce DeReKoGram, a novel frequency dataset containing lemma and part-of-speech (POS) information for 1-, 2-, and 3-grams from the German Reference Corpus. The dataset contains information based on a corpus of 43.2 billion tokens and is divide

Externí odkaz: https://doaj.org/article/dc7024147ee8482ca8ab9949e102628e

Zobrazit plný text záznamu

Elektronická kniha

Korpuslinguistik

Autor: Marc Kupietz, Thomas Schmidt

Der Band nimmt eine Bestandsaufnahme zu Grundlagen, Methodik, Werkzeugen und Anwendungsfeldern der Korpuslinguistik mit Fokus auf die germanistische Sprachwissenschaft vor. Die Beiträge stellen den aktuellen Forschungsstand sowohl im Bereich schrift

Zobrazit plný text záznamu

Das Gesamtkonzept des Deutschen Referenzkorpus DeReKo

Autor: Marc Kupietz, Harald Lüngen, Nils Diewald

Publikováno v: Korpora in der germanistischen Sprachwissenschaft ISBN: 9783111085708

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::717547d73e840006dbcbb0eaac381c76
https://doi.org/10.1515/9783111085708-002

Zobrazit plný text záznamu

Building Paths to Corpus Data

Autor: Marc Kupietz, Nils Diewald, Eliza Margaretha

Publikováno v: CLARIN ISBN: 9783110767377

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::b7b0d3512a3870c5cc686eb1602e9f36
https://doi.org/10.1515/9783110767377-007

Zobrazit plný text záznamu

Autor: Pawel Kamocki, Vanessa Hannesschläger, Esther Hoorn, Aleksei Kelli, Marc Kupietz, Krister Lindén, Andrius Puksas

Publikováno v: University of Helsinki

Twitter data is used in a wide variety of research disciplines in Social Sciences and Humanities. Although most Twitter data is publicly available, its re-use and sharing raise many legal questions related to intellectual property and personal data p

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::a583adb24c8b903574f917acab07c784
http://hdl.handle.net/10138/350015

Zobrazit plný text záznamu

Testing the Relationship between Word Length, Frequency, and Predictability Based on the German Reference Corpus

Autor: Alexander Koplenig, Marc Kupietz, Sascha Wolfer

Publikováno v: Cognitive scienceReferences. 46(6)

In a recent article, Meylan and Griffiths (MeylanGriffiths, 2021, henceforth, MG) focus their attention on the significant methodological challenges that can arise when using large-scale linguistic corpora. To this end, MG revisit a well-known result

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7f807b51d5198c0cd7a2a67f07689256
https://pubmed.ncbi.nlm.nih.gov/35661231

Zobrazit plný text záznamu

Von monolingualen Korpora über Parallelund Vergleichskorpora zum Europäischen Referenzkorpus EuReCo

Autor: Beata Trawiński, Marc Kupietz

Publikováno v: Deutsch in Europa ISBN: 9783110731514
Deutsch in Europa

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::7fa7bce29b338a09588a066a4bd3ab55
https://doi.org/10.1515/9783110731514-012

Zobrazit plný text záznamu

KorAP und EuReCo – Recherchieren in mehrsprachigen vergleichbaren Korpora

Autor: Helge Stallkamp, Marc Kupietz, Franck Bodmer, Eliza Margaretha, Elena Irimia, Nils Diewald, Peter Harders

Publikováno v: Deutsch in Europa ISBN: 9783110731514

Die Korpusanalyseplattform KorAP ist von Grund auf sprachenunabhangig konzipiert. Dies gilt sowohl in Bezug auf die Lokalisierung der Benutzeroberflache als auch hinsichtlich unterschiedlicher Anfragesprachen und der Unterstutzung fremdsprachiger Kor

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::95ced0ca456c8b2cdef43589b1ea273e
https://doi.org/10.1515/9783110731514-014

Zobrazit plný text záznamu

Data-driven Identification of Idioms in Song Lyrics

Autor: Roman Schneider, Miriam Amin, Marc Kupietz, Peter Fankhauser

Publikováno v: Proceedings of the 17th Workshop on Multiword Expressions (MWE 2021).

The automatic recognition of idioms poses a challenging problem for NLP applications. Whereas native speakers can intuitively handle multiword expressions whose compositional meanings are hard to trace back to individual word semantics, there is stil

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::09e510467f22e6ca0b02113849249473
https://doi.org/10.18653/v1/2021.mwe-1.3

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání