A convolutional recursive deep architecture for unconstrained Urdu handwriting recognition
Autor: | Syed Muhammad Kumail Raza, Faisal Shafait, Muhammad Mubasher Khan, Adnan Ul-Hasan, Noor ul Sehr Zia, Muhammad Ferjad Naeem |
---|---|
Rok vydání: | 2021 |
Předmět: |
education.field_of_study
Character (computing) Computer science business.industry Generalization Population Word error rate computer.software_genre language.human_language ComputingMethodologies_PATTERNRECOGNITION Artificial Intelligence Handwriting recognition ComputingMethodologies_DOCUMENTANDTEXTPROCESSING language Artificial intelligence Urdu Language model Architecture business education computer Software Natural language processing |
Zdroj: | Neural Computing and Applications. 34:1635-1648 |
ISSN: | 1433-3058 0941-0643 |
DOI: | 10.1007/s00521-021-06498-2 |
Popis: | An offline handwriting recognition system for Urdu, a language with a user base of 200 Million and written in Nastaleeq script, has been a challenge for the research community. The key problems include recognition of complex ligature shapes and lack of publicly available datasets. This paper addresses both these problems by (i) proposing an end-to-end handwriting recognition system based on a new CNN-RNN architecture with n-gram language modeling, and (ii) presenting a new unconstrained dataset called NUST-UHWR. We compiled the first unconstrained Urdu handwritten data from around 1000 people from diverse background, age and gender population. The text in this dataset is selected carefully from seven different fields to ensure the presence of commonly used words in different domains. The model architecture is capable of incorporating fine-grained features necessary for handwritten text recognition of complex ligature languages. Our method addresses the limitations of existing architectures and provides state-of-the-art performance on Urdu handwritten text. We achieve a minimum character error rate of 5.28% on Urdu handwriting recognition (UHWR) and establish a state-of-the-art. The paper further demonstrates the generalization ability of the proposed model by training on English language and bilingual (Urdu and English) handwritten data. |
Databáze: | OpenAIRE |
Externí odkaz: |