An Evolutionary Reinforcement Learning Approach for Autonomous Maneuver Decision in One-to-One Short-Range Air Combat

Yasin Baykal*, Baris Baspinar

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

4 Citations (Scopus)

Abstract

This paper presents an evolutionary reinforcement learning approach based on Deep Q Networks to address the maneuver decision challenge of unmanned aerial vehicles (UAV) in short-range aerial combat. The proposed approach aims to improve the UAVs' autonomous maneuver decision process and generate a robust policy against alternative enemy strategies. The training process involves parallel training of multiple workers, evaluation of models at regular intervals, selection of the best model, testing the best model against enemy policies, and updating the pool of enemy strategies. The proposed method continuously improves the trained models and generates more robust policies with higher win rates than standard reinforcement learning techniques or k-level learning approaches.

Original languageEnglish
Title of host publicationDASC 2023 - Digital Avionics Systems Conference, Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350333572
DOIs
Publication statusPublished - 2023
Event42nd IEEE/AIAA Digital Avionics Systems Conference, DASC 2023 - Barcelona, Spain
Duration: 1 Oct 20235 Oct 2023

Publication series

NameAIAA/IEEE Digital Avionics Systems Conference - Proceedings
ISSN (Print)2155-7195
ISSN (Electronic)2155-7209

Conference

Conference42nd IEEE/AIAA Digital Avionics Systems Conference, DASC 2023
Country/TerritorySpain
CityBarcelona
Period1/10/235/10/23

Bibliographical note

Publisher Copyright:
© 2023 IEEE.

Keywords

  • air combat
  • decision-making
  • reinforcement learning

Fingerprint

Dive into the research topics of 'An Evolutionary Reinforcement Learning Approach for Autonomous Maneuver Decision in One-to-One Short-Range Air Combat'. Together they form a unique fingerprint.

Cite this