Learning safe cooperative policies in autonomous multi-UAV navigation

Please use this identifier to cite or link to this item: http://dspace.iitrpr.ac.in:8080/xmlui/handle/123456789/3900

Title:	Learning safe cooperative policies in autonomous multi-UAV navigation
Authors:	Singh, A. Jha, S.S.
Keywords:	Multi-agent system Policy gradient Reinforcement learning Safe navigation UAV Webots
Issue Date:	25-Aug-2022
Abstract:	The deployment of multiple Unmanned Aerial Vehicles (UAV) in constrained environments has various challenges concerning trajectory optimization with the target(s) reachability and collisions. In this paper, we formulate multi-UAV navigation in constrained environments as a multi-agent learning problem. Further, we propose a reinforcement learning based Safe-MADDPG method to learn safe and cooperative multi-UAV navigation policies in a constrained environment. The safety constraints to handle inter-UAV collisions during navigation are modeled through action corrections of the learned autonomous navigation policies using an additional safety layer. We have implemented our proposed approach on the Webots Simulator and provided a detailed analysis of the proposed solution. The results demonstrate that the proposed Safe-MADDPG approach is effective in learning safe actions for multi-UAV navigation in constrained environments.
URI:	http://localhost:8080/xmlui/handle/123456789/3900
Appears in Collections:	Year-2021

Files in This Item:

File	Description	Size	Format
Full Text.pdf		620.1 kB	Adobe PDF	View/Open Request a copy

DSpace JSPUI