Is Arabic punctuation rule-governed?
Autor: | Sane Yagi, Shehdeh Fareh, Ashraf Elnagar, Mariam Balajeed, Abdalla El-mneizel, Mohammad Al-Badawi |
---|---|
Jazyk: | angličtina |
Rok vydání: | 2024 |
Předmět: |
Arabic punctuation
sentence boundary identification text disambiguation machine translation punctuation standardization Jeroen van de Weijer College of International Studies Shenzhen University Shenzhen Guangdong China Fine Arts Arts in general NX1-820 General Works History of scholarship and learning. The humanities AZ20-999 |
Zdroj: | Cogent Arts & Humanities, Vol 11, Iss 1 (2024) |
Druh dokumentu: | article |
ISSN: | 23311983 2331-1983 |
DOI: | 10.1080/23311983.2024.2303818 |
Popis: | This paper investigates the extent to which Arabic punctuation is rule-governed, with the aim of improving text comprehension, disambiguation, and machine translation. The study highlights the lack of systematic punctuation in Arabic written discourse, which may be attributed to difficulties in sentence boundary identification or inadequate differentiation between various conjunctions. The punctuation behavior of Arabic speakers is examined in relation to sentence boundary identification and the level of agreement among Arabic specialists is assessed. A quantitative analysis of paragraph and sentence lengths across genres, categories of writers, and in comparison to English is conducted using five corpora specifically compiled for this study. Additionally, a punctuation survey is carried out to evaluate specialists’ agreement on sentence boundary identification. The results indicate that writers of Arabic interpret punctuation rules differently and that Arabic punctuation practice is irregular. The study suggests that standardization of Arabic punctuation rules is necessary to facilitate comprehension and automatic text processing. |
Databáze: | Directory of Open Access Journals |
Externí odkaz: |