INSTITUTIONAL DIGITAL REPOSITORY

Learning safe cooperative policies in autonomous multi-UAV navigation

Show simple item record

dc.contributor.author Singh, A.
dc.contributor.author Jha, S.S.
dc.date.accessioned 2022-08-25T15:13:36Z
dc.date.available 2022-08-25T15:13:36Z
dc.date.issued 2022-08-25
dc.identifier.uri http://localhost:8080/xmlui/handle/123456789/3900
dc.description.abstract The deployment of multiple Unmanned Aerial Vehicles (UAV) in constrained environments has various challenges concerning trajectory optimization with the target(s) reachability and collisions. In this paper, we formulate multi-UAV navigation in constrained environments as a multi-agent learning problem. Further, we propose a reinforcement learning based Safe-MADDPG method to learn safe and cooperative multi-UAV navigation policies in a constrained environment. The safety constraints to handle inter-UAV collisions during navigation are modeled through action corrections of the learned autonomous navigation policies using an additional safety layer. We have implemented our proposed approach on the Webots Simulator and provided a detailed analysis of the proposed solution. The results demonstrate that the proposed Safe-MADDPG approach is effective in learning safe actions for multi-UAV navigation in constrained environments. en_US
dc.language.iso en_US en_US
dc.subject Multi-agent system en_US
dc.subject Policy gradient en_US
dc.subject Reinforcement learning en_US
dc.subject Safe navigation en_US
dc.subject UAV en_US
dc.subject Webots en_US
dc.title Learning safe cooperative policies in autonomous multi-UAV navigation en_US
dc.type Article en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account