Autor: |
Khan Bahadar, Riaz Ahmad, Khursheed Aurangzeb, Siraj Muhammad, Khalil Ullah, Ibrar Hussain, Ikram Syed, Muhammad Shahid Anwar |
Jazyk: |
angličtina |
Rok vydání: |
2024 |
Předmět: |
|
Zdroj: |
PeerJ Computer Science, Vol 10, p e2089 (2024) |
Druh dokumentu: |
article |
ISSN: |
2376-5992 |
DOI: |
10.7717/peerj-cs.2089 |
Popis: |
Layout analysis is the main component of a typical Document Image Analysis (DIA) system and plays an important role in pre-processing. However, regarding the Pashto language, the document images have not been explored so far. This research, for the first time, examines Pashto text along with graphics and proposes a deep learning-based classifier that can detect Pashto text and graphics per document. Another notable contribution of this research is the creation of a real dataset, which contains more than 1,000 images of the Pashto documents captured by a camera. For this dataset, we applied the convolution neural network (CNN) following a deep learning technique. Our intended method is based on the development of the advanced and classical variant of Faster R-CNN called Single-Shot Detector (SSD). The evaluation was performed by examining the 300 images from the test set. Through this way, we achieved a mean average precision (mAP) of 84.90%. |
Databáze: |
Directory of Open Access Journals |
Externí odkaz: |
|