Content-Aware Video Analysis to Guide Visually Impaired Walking on the Street

Autor:	Timothy K. Shih, Chih-Yang Lin, Ervin Yohannes
Rok vydání:	2019
Předmět:	Class (computer programming) Visually impaired Computer science business.industry Deep learning 020206 networking & telecommunications 02 engineering and technology Object (computer science) Human–computer interaction Face (geometry) Assistive technology 0202 electrical engineering electronic engineering information engineering 020201 artificial intelligence & image processing Segmentation Artificial intelligence business
Zdroj:	Advances in Visual Informatics ISBN: 9783030340315 IVIC
DOI:	10.1007/978-3-030-34032-2_1
Popis:	Although many researchers have developed systems or tools to assist blind and visually impaired people, they continue to face many obstacles in daily life—especially in outdoor environments. When people with visual impairments walk outdoors, they must be informed of objects in their surroundings. However, it is challenging to develop a system that can handle related tasks. In recent years, deep learning has enabled the development of many architectures with more accurate results than machine learning. One popular model for instance segmentation is Mask-RCNN, which can do segmentation and rapidly recognize objects. We use Mask-RCNN to develop a context-aware video that can help blind and visually impaired people recognize objects in their surroundings. Moreover, we provide the distance between the subject and object, and the object’s relative speed and direction using Mask-RCNN outputs. The results of our content-aware video include the name of the object, class object score, the distance between the person and the object, speed of the object, and object direction.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::5422bdae484b8fdaac4be2534f1a9614 https://doi.org/10.1007/978-3-030-34032-2_1 Zobrazit plný text záznamu