Autonomous Navigation in Search and Rescue Simulated Environment using Deep Reinforcement Learning

Autor:	Fatih Abut, Fatih Akay, Mohammed Abdeh
Rok vydání:	2021
Předmět:	Computer Science Artifical Intelligence Bilgisayar Bilimleri Yapay Zeka business.industry Computer science Deep Reinforcement Learning Autonomous Navigation Autonomous Search and Rescue Simulation Reinforcement learning Building and Construction Artificial intelligence Electrical and Electronic Engineering business Search and rescue
Zdroj:	Volume: 9, Issue: 2 92-98 Balkan Journal of Electrical and Computer Engineering
ISSN:	2147-284X
DOI:	10.17694/bajece.781162
Popis:	Human assisted search and rescue (SAR) robots are increasingly being used in zones of natural disasters, industrial accidents, and civil wars. Due to complex terrains, obstacles, and uncertainties in time availability, there is a need for these robots to have a certain level of autonomy to act independently for approaching certain SAR tasks. One of these tasks is autonomous navigation. Previous approaches to develop autonomous or semi-autonomous SAR navigating robots use heuristics-based methods. These algorithms, however, require environment-related prior knowledge and enough sensing capabilities, which are hard to maintain due to restrictions of size and weight in highly unstructured environments such as collapsed buildings. This study approaches the problem of autonomous navigation using a modified version of the Deep Q-Network algorithm. Unlike the classical usage of the entire game screen images to train the agent, our approach uses only the images captured by the agent's low-resolution camera to train the agent for navigating through an arena avoiding obstacles and to reach a victim. This approach is a much more relevant way of decision making in complex, uncertain contexts; since in real-world SAR scenarios, it is almost impossible to have the area's full information to be used by SAR teams. We simulated a SAR scenario, which consists of an arena full of randomly generated obstacles, a victim, and an autonomous SAR robot. The simulation results show that the agent was able to reach the victim in 56% of the evaluation episodes after 400 episodes of training.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1dd9382b3c871a8c110dc5b6ee26018b https://doi.org/10.17694/bajece.781162 Zobrazit plný text záznamu