Zobrazeno 1 - 10
of 97
pro vyhledávání: '"Zaraket, Fadi"'
Autor:
Kiulian, Artur, Polishko, Anton, Khandoga, Mykola, Kostiuk, Yevhen, Gabrielli, Guillermo, Gagała, Łukasz, Zaraket, Fadi, Obaida, Qusai Abu, Garud, Hrishikesh, Mak, Wendy Wing Yee, Chaplynskyi, Dmytro, Amor, Selma Belhadj, Peradze, Grigol
In this paper, we propose a model-agnostic cost-effective approach to developing bilingual base large language models (LLMs) to support English and any target language. The method includes vocabulary expansion, initialization of new embeddings, model
Externí odkaz:
http://arxiv.org/abs/2410.18836
Autor:
Haidar, Nawal, Zaraket, Fadi A.
Back-of-the-book indexes are crucial for book readability. Their manual creation is laborious and error prone. In this paper, we consider automating back-of-the-book index extraction for Arabic books to help simplify both the creation and review task
Externí odkaz:
http://arxiv.org/abs/2410.10286
This paper presents Nabra, a corpora of Syrian Arabic dialects with morphological annotations. A team of Syrian natives collected more than 6K sentences containing about 60K words from several sources including social media posts, scripts of movies a
Externí odkaz:
http://arxiv.org/abs/2310.17315
This article presents morphologically-annotated Yemeni, Sudanese, Iraqi, and Libyan Arabic dialects Lisan corpora. Lisan features around 1.2 million tokens. We collected the content of the corpora from several social media platforms. The Yemeni corpu
Externí odkaz:
http://arxiv.org/abs/2212.06468
Publikováno v:
In Proceedings of the International Conference on Language Resources and Evaluation (LREC 2022), Marseille, France. (2022)
The processing of the Arabic language is a complex field of research. This is due to many factors, including the complex and rich morphology of Arabic, its high degree of ambiguity, and the presence of several regional varieties that need to be proce
Externí odkaz:
http://arxiv.org/abs/2205.09692
We present CFAAR, a program repair assistance technique that operates by selectively altering the outcome of suspicious predicates in order to yield expected behavior. CFAAR is applicable to defects that are repairable by negating predicates under sp
Externí odkaz:
http://arxiv.org/abs/1808.09229
Autor:
Jaber, Amin, Zaraket, Fadi A.
Publikováno v:
Traitement Automatique des Langues (TAL). 58.3 (2017): 97-121
Rule-based techniques to extract relational entities from documents allow users to specify desired entities with natural language questions, finite state automata, regular expressions and structured query language. They require linguistic and program
Externí odkaz:
http://arxiv.org/abs/1709.05700
Oracles used for testing graphical user interface (GUI) programs are required to take into consideration complicating factors such as variations in screen resolution or color scheme when comparing observed GUI elements to expected GUI elements. Resea
Externí odkaz:
http://arxiv.org/abs/1607.01723
Autor:
Zaraket, Fadi A.
Thesis (Ph. D.)--University of Texas at Austin, 2007.
Vita. Includes bibliographical references.
Vita. Includes bibliographical references.