Enhancing Object Detection for VIPs Using YOLOv4_Resnet101 and Text-to-Speech Conversion Model

Autor:	Tahani Jaser Alahmadi, Atta Ur Rahman, Hend Khalid Alkahtani, Hisham Kholidy
Jazyk:	angličtina
Rok vydání:	2023
Předmět:	object detection recognition tracking text-to-speech conversion YOLOv4_Resnet101 visual impairments Technology Science
Zdroj:	Multimodal Technologies and Interaction, Vol 7, Iss 8, p 77 (2023)
Druh dokumentu:	article
ISSN:	2414-4088
DOI:	10.3390/mti7080077
Popis:	Vision impairment affects an individual’s quality of life, posing challenges for visually impaired people (VIPs) in various aspects such as object recognition and daily tasks. Previous research has focused on developing visual navigation systems to assist VIPs, but there is a need for further improvements in accuracy, speed, and inclusion of a wider range of object categories that may obstruct VIPs’ daily lives. This study presents a modified version of YOLOv4_Resnet101 as backbone networks trained on multiple object classes to assist VIPs in navigating their surroundings. In comparison to the Darknet, with a backbone utilized in YOLOv4, the ResNet-101 backbone in YOLOv4_Resnet101 offers a deeper and more powerful feature extraction network. The ResNet-101’s greater capacity enables better representation of complex visual patterns, which increases the accuracy of object detection. The proposed model is validated using the Microsoft Common Objects in Context (MS COCO) dataset. Image pre-processing techniques are employed to enhance the training process, and manual annotation ensures accurate labeling of all images. The module incorporates text-to-speech conversion, providing VIPs with auditory information to assist in obstacle recognition. The model achieves an accuracy of 96.34% on the test images obtained from the dataset after 4000 iterations of training, with a loss error rate of 0.073%.
Databáze:	Directory of Open Access Journals
Externí odkaz:	https://doaj.org/article/3e2203c5496148d6aea6d7983f5d8209 Zobrazit plný text záznamu View record in DOAJ