Towards Safe Deep Reinforcement Learning for Autonomous Airborne Collision Avoidance Systems

Christos Panoutsakopoulos, Burak Yuksek, Gokhan Inalhan, Antonios Tsourdos

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

7 Citations (Scopus)

Abstract

In this paper we consider the application of Safe Deep Reinforcement Learning in the context of a trustworthy autonomous Airborne Collision Avoidance System. A simple 2D airspace model is defined, in which a hypothetical air vehicle attempts to fly to a given waypoint while autonomously avoiding Near Mid-Air collisions (NMACs) with non-cooperative traffic. We use Proximal Policy Optimisation for our learning agent and we propose a reward engineering approach based on a combination of sparse terminal rewards at natural termination points and dense step rewards providing the agent with continuous feedback on its actions, based on relative geometry and motion attributes of its trajectory with respect to the traffic and the target waypoint. The performance of our trained agent is evaluated through Monte-Carlo simulations and it is demonstrated that it achieves to master the collision avoidance task with respect to safety for a reasonable trade-off in mission performance.

Original languageEnglish
Title of host publicationAIAA SciTech Forum 2022
PublisherAmerican Institute of Aeronautics and Astronautics Inc, AIAA
ISBN (Print)9781624106316
DOIs
Publication statusPublished - 2022
Externally publishedYes
EventAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022 - San Diego, United States
Duration: 3 Jan 20227 Jan 2022

Publication series

NameAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022

Conference

ConferenceAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022
Country/TerritoryUnited States
CitySan Diego
Period3/01/227/01/22

Bibliographical note

Publisher Copyright:
© 2022, American Institute of Aeronautics and Astronautics Inc.. All rights reserved.

Fingerprint

Dive into the research topics of 'Towards Safe Deep Reinforcement Learning for Autonomous Airborne Collision Avoidance Systems'. Together they form a unique fingerprint.

Cite this