Statistical machine translation proposal for Uzbek to English

Autor: Alisher Shakirovich Ismailov, Gulshoda Shamsiyeva, Nilufar Abdurakhmonova
Jazyk: angličtina
Rok vydání: 2021
Předmět:
Zdroj: Science and Education, Vol 2, Iss 12, Pp 212-219 (2021)
ISSN: 2181-0842
Popis: The machine translation means is a translating one natural language to another natural language automatically [1]. The machine translation is one of the major and the most active areas in natural language processing. The last decade have seen the rise of the use of statistical approaches to the machine translation. The statistical machine translation approaches learn translation parameters automatically from alignment text rather than relying on rule-based approaches. There has been quite extensive work in statistical machine translation area for some language pairs. However, there are very limited research sources available for the Uzbek to English language pair [2]. In this paper, we propose statistical machine translation algorithm for Uzbek to English. The developing English to Uzbek statistical machine translation algorithm is an interesting obstacle from a number of perspectives. The most important challenge is that English and Uzbek are typologically distant languages. The English language has very limited morphology and Uzbek is an agglutinative language with a very rich and productive derivational and inflectional morphology. The Uzbek word structures that can correspond to complete phrases of several words in English when translated. In this paper, propose that will achieve Uzbek to English statistical machine translation algorithm using phrase-base model. Moreover, in order to achieve statistical machine translation we need to develop English-Uzbek corpora. In this paper, we present briefly about English-Uzbek corpora development.
Databáze: OpenAIRE