Zobrazeno 1 - 10
of 23
pro vyhledávání: '"Özlem Çetinoğlu"'
Publikováno v:
Array, Vol 12, Iss , Pp 100104- (2021)
Multilingual speakers tend to mix different languages in text and speech; a phenomenon referred to by linguists as “code-switching” (CS). Also, speakers switch between morphemes from various languages in the same word (intra-word CS). User-genera
Externí odkaz:
https://doaj.org/article/a5ec6db515d345e290c61a851789e27e
Autor:
Özlem Çetinoğlu, Çağrı Çöltekin
Publikováno v:
Language Resources and Evaluation. 57:545-579
This paper presents the SAGT Turkish–German code-switching treebank, and observations and annotation challenges we encountered during its development. The treebank consists of transcriptions of bilingual conversations annotated with several layers:
Publikováno v:
Language resources and evaluation.
This paper presents a comprehensive survey of corpora and lexical resources available for Turkish. We review a broad range of resources, focusing on the ones that are publicly available. In addition to providing information about the available lingui
This paper presents a comprehensive survey of corpora and lexical resources available for Turkish. We review a broad range of resources, focusing on the ones that are publicly available. In addition to providing information about the available lingui
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::02cf995cf6ca8abcf37ecf297bc688df
Publikováno v:
Findings of the Association for Computational Linguistics: NAACL 2022.
Publikováno v:
Language Resources and Evaluation. 57:489-489
Publikováno v:
Journal of Intelligent & Fuzzy Systems. 36:4921-4929
Autor:
Rob van der Goot, Özlem Çetinoğlu
Publikováno v:
van der Goot, R & Çetinoğlu, Ö 2021, Lexical Normalization for Code-switched Data and its Effect on POS Tagging . in Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics: Main Volume . Association for Computational Linguistics, pp. 2352-2365 . < https://www.aclweb.org/anthology/2021.eacl-main.200.pdf >
EACL
EACL
Lexical normalization, the translation of non-canonical data to standard language, has shown to improve the performance of manynatural language processing tasks on social media. Yet, using multiple languages in one utterance, also called code-switchi
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c60ab9962d3141da8d5e6fd9736373c0
https://pure.itu.dk/ws/files/85931715/2021.eacl_main.200.pdf
https://pure.itu.dk/ws/files/85931715/2021.eacl_main.200.pdf
Autor:
Agnieszka Falenska, Özlem Çetinoğlu
Publikováno v:
Proceedings of the 3rd Workshop on Gender Bias in Natural Language Processing.
Potential gender biases existing in Wikipedia’s content can contribute to biased behaviors in a variety of downstream NLP systems. Yet, efforts in understanding what inequalities in portraying women and men occur in Wikipedia focused so far only on
Autor:
Şaziye Betül Özateş, Özlem Çetinoğlu
Publikováno v:
Proceedings of the Fifth Workshop on Computational Approaches to Linguistic Code-Switching.
Morphological tagging of code-switching (CS) data becomes more challenging especially when language pairs composing the CS data have different morphological representations. In this paper, we explore a number of ways of implementing a language-aware