English to Urdu transliteration: An application of Soundex algorithm
Autor: | Adil Masood Siddiqui, Muhammad Adeel Zahid, Naveed Iqbal Rao |
---|---|
Rok vydání: | 2010 |
Předmět: |
Computer science
business.industry Speech recognition computer.software_genre language.human_language Romanization Soundex Stress (linguistics) ComputingMethodologies_DOCUMENTANDTEXTPROCESSING Transliteration language Urdu Artificial intelligence Language translation business Algorithm computer Natural language processing Orthography Word (computer architecture) |
Zdroj: | 2010 International Conference on Information and Emerging Technologies. |
DOI: | 10.1109/iciet.2010.5625681 |
Popis: | Transliteration algorithms are used to convert Romanized form of Urdu in Urdu script. But the accuracy of such systems is greatly reduced by presence of English words like weak, next etc. in online conversations. In this paper we present dictionary based solution to convert English word to Urdu script. In doing so accent conversion problem may arise that is handled through Soundex based algorithm where relative positions of transcriptions and Urdu language rules are combined to assign codes to English words which are then mapped to Urdu script. We have integrated our work with an existing roman Urdu transliteration system and experimental results have proved the significance of our work both for standalone English transliteration and as a part of roman Urdu transliteration framework. |
Databáze: | OpenAIRE |
Externí odkaz: |