Learning safe cooperative policies in autonomous multi-UAV navigation

dc.contributor.author	Singh, A.
dc.contributor.author	Jha, S.S.
dc.date.accessioned	2022-08-25T15:13:36Z
dc.date.available	2022-08-25T15:13:36Z
dc.date.issued	2022-08-25
dc.identifier.uri	http://localhost:8080/xmlui/handle/123456789/3900
dc.description.abstract	The deployment of multiple Unmanned Aerial Vehicles (UAV) in constrained environments has various challenges concerning trajectory optimization with the target(s) reachability and collisions. In this paper, we formulate multi-UAV navigation in constrained environments as a multi-agent learning problem. Further, we propose a reinforcement learning based Safe-MADDPG method to learn safe and cooperative multi-UAV navigation policies in a constrained environment. The safety constraints to handle inter-UAV collisions during navigation are modeled through action corrections of the learned autonomous navigation policies using an additional safety layer. We have implemented our proposed approach on the Webots Simulator and provided a detailed analysis of the proposed solution. The results demonstrate that the proposed Safe-MADDPG approach is effective in learning safe actions for multi-UAV navigation in constrained environments.	en_US
dc.language.iso	en_US	en_US
dc.subject	Multi-agent system	en_US
dc.subject	Policy gradient	en_US
dc.subject	Reinforcement learning	en_US
dc.subject	Safe navigation	en_US
dc.subject	UAV	en_US
dc.subject	Webots	en_US
dc.title	Learning safe cooperative policies in autonomous multi-UAV navigation	en_US
dc.type	Article	en_US