Zobrazeno 1 - 10
of 32
pro vyhledávání: '"Houda Bouamor"'
Publikováno v:
New York University Scholars
We present the findings and results of the Second Nuanced Arabic Dialect Identification Shared Task (NADI 2021). This Shared Task includes four subtasks: country-level Modern Standard Arabic (MSA) identification (Subtask 1.1), country-level dialect i
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::5e2daf52dcb32a287ac578d417c2c1ba
Autor:
Karim Bouzoubaa, Nizar Habash, Wassim El-Hajj, Samhaa R. El-Beltagy, Huseein T. Al-Natsheh, Mourad Abbas, Hamdy Mubarak, Houda Bouamor, Hend S. Al-Khalifa, Mustafa Jarrar, Violetta Cavalli-Sforza, Kareem Darwish
Publikováno v:
New York University Scholars
The term natural language refers to any system of symbolic communication (spoken, signed or written) without intentional human planning and design. This distinguishes natural languages such as Arabic and Japanese from artificially constructed languag
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::b129996f740e71c0d74fe30b93a1ca22
Publikováno v:
New York University Scholars
We present the results and findings of the First Nuanced Arabic Dialect Identification Shared Task (NADI). This Shared Task includes two subtasks: country-level dialect identification (Subtask 1) and province-level sub-dialect identification (Subtask
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::312b80c25b4800d8fd0670218b25ea2d
Publikováno v:
Proceedings of the First Workshop on Gender Bias in Natural Language Processing.
The impressive progress in many Natural Language Processing (NLP) applications has increased the awareness of some of the biases these NLP systems have with regards to gender identities. In this paper, we propose an approach to extend biased single-o
Publikováno v:
WANLP@ACL 2019
Scopus-Elsevier
New York University Scholars
Scopus-Elsevier
New York University Scholars
In this paper, we present the results and findings of the MADAR Shared Task on Arabic Fine-Grained Dialect Identification. This shared task was organized as part of The Fourth Arabic Natural Language Processing Workshop, collocated with ACL 2019. The
Publikováno v:
NAACL-HLT (Demonstrations)
This demo paper describes ADIDA, a web-based system for automatic dialect identification for Arabic text. The system distinguishes among the dialects of 25 Arab cities (from Rabat to Muscat) in addition to Modern Standard Arabic. The results are pres
Publikováno v:
Proceedings of the 16th Workshop on Computational Research in Phonetics, Phonology, and Morphology.
We present de-lexical segmentation, a linguistically motivated alternative to greedy or other unsupervised methods, requiring only minimal language specific input. Our technique involves creating a small grammar of closed-class affixes which can be w
Publikováno v:
New York University Scholars
We present the second ever evaluated Arabic dialect-to-dialect machine translation effort, and the first to leverage external resources beyond a small parallel corpus. The subject has not previously received serious attention due to lack of naturally
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::88535ae60d9077cde3b064db8dec2a3f
Publikováno v:
ACM Transactions on Intelligent Systems and Technology. 4:1-27
This work uses parallel monolingual corpora for a detailed study of the task of sub-sentential paraphrase acquisition. We argue that the scarcity of this type of resource is compensated by the fact that it is the most suited type for studies on parap
Autor:
Ossama Obeid, Mona Diab, Abdelati Hawwari, Kemal Oflazer, Wajdi Zaghouani, Houda Bouamor, Mahmoud Ghoneim
Publikováno v:
Qatar Foundation Annual Research Conference Proceedings Volume 2016 Issue 1.
One of the characteristics of writing in Modern Standard Arabic (MSA) is that the commonly used orthography is mostly consonantal and does not provide full vocalization of the text. It sometimes includes optional diacritical marks (henceforth, diacri