Zobrazeno 1 - 10
of 33
pro vyhledávání: '"Çetinoğlu, Özlem"'
This paper presents a comprehensive survey of corpora and lexical resources available for Turkish. We review a broad range of resources, focusing on the ones that are publicly available. In addition to providing information about the available lingui
Externí odkaz:
http://arxiv.org/abs/2204.05042
Autor:
Sanguinetti, Manuela, Cassidy, Lauren, Bosco, Cristina, Çetinoğlu, Özlem, Cignarella, Alessandra Teresa, Lynn, Teresa, Rehbein, Ines, Ruppenhofer, Josef, Seddah, Djamé, Zeldes, Amir
This article presents a discussion on the main linguistic phenomena which cause difficulties in the analysis of user-generated texts found on the web and in social media, and proposes a set of annotation guidelines for their treatment within the Univ
Externí odkaz:
http://arxiv.org/abs/2011.02063
Canonical morphological segmentation consists of dividing words into their standardized morphemes. Here, we are interested in approaches for the task when training data is limited. We compare model performance in a simulated low-resource setting for
Externí odkaz:
http://arxiv.org/abs/2010.02804
Autor:
van der Goot, Rob, Çetinoğlu, Özlem
Lexical normalization, the translation of non-canonical data to standard language, has shown to improve the performance of manynatural language processing tasks on social media. Yet, using multiple languages in one utterance, also called code-switchi
Externí odkaz:
http://arxiv.org/abs/2006.01175
Language identification for code-switching (CS), the phenomenon of alternating between two or more languages in conversations, has traditionally been approached under the assumption of a single language per token. However, if at least one language is
Externí odkaz:
http://arxiv.org/abs/1904.01989
Publikováno v:
In Array December 2021 12
This paper addresses challenges of Natural Language Processing (NLP) on non-canonical multilingual data in which two or more languages are mixed. It refers to code-switching which has become more popular in our daily life and therefore obtains an inc
Externí odkaz:
http://arxiv.org/abs/1610.02213
Autor:
Çetinoğlu, Özlem
Publikováno v:
Encyclopedia of Turkic Languages and Linguistics Online
Externí odkaz:
https://doi.org/10.1163/2667-3029_ETLO_SIM_032592
Publikováno v:
LANGUAGE RESOURCES AND EVALUATION
This paper presents a comprehensive survey of corpora and lexical resources available for Turkish. We review a broad range of resources, focusing on the ones that are publicly available. In addition to providing information about the available lingui
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od_______330::0d01c3a174becdf0023eb53f38b367f1
https://biblio.ugent.be/publication/8765352/file/8765354
https://biblio.ugent.be/publication/8765352/file/8765354
Autor:
Sanguinetti, Manuela, Bosco, Cristina, Cassidy, Lauren, Çetinoğlu, Özlem, Cignarella, Alessandra Teresa, Lynn, Teresa, Rehbein, Ines, Ruppenhofer, Josef, Seddah, Djamé, Zeldes, Amir
Publikováno v:
Language Resources & Evaluation; Jun2023, Vol. 57 Issue 2, p493-544, 52p