Reinforcement Learning Based Optimization of Autonomous Aircraft Performance

Metin Sari, Fikret Caliskan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

This research introduces an advanced flight control system for optimizing autonomous aircraft performance, leveraging deep reinforcement learning (DRL) to address the complexities of nonlinear flight dynamics. Using a six-degree-of-freedom (6-DoF) rigid aircraft flight dynamics model, we develop a Deep Deterministic Policy Gradient (DDPG) controller tailored for waypoint navigation and attitude stabilization tasks. A custom reward framework and extensive hyperparameter tuning enable effective training within a high-fidelity MATLAB/Simulink environment, achieving high rewards and precise control. Although computationally intensive, the simulations demonstrate robust performance across diverse flight conditions, with potential for real-world applications and future extensions to multi-agent scenarios.

Original languageEnglish
Title of host publication2025 11th International Conference on Mechatronics and Robotics Engineering, ICMRE 2025
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages181-186
Number of pages6
ISBN (Electronic)9798331509293
DOIs
Publication statusPublished - 2025
Event11th International Conference on Mechatronics and Robotics Engineering, ICMRE 2025 - Lille, France
Duration: 24 Feb 202526 Feb 2025

Publication series

Name2025 11th International Conference on Mechatronics and Robotics Engineering, ICMRE 2025

Conference

Conference11th International Conference on Mechatronics and Robotics Engineering, ICMRE 2025
Country/TerritoryFrance
CityLille
Period24/02/2526/02/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Keywords

  • deep deterministic policy gradient
  • formation flying
  • modeling
  • navigation and control of unmanned autonomous vehicles
  • reinforcement learning control

Fingerprint

Dive into the research topics of 'Reinforcement Learning Based Optimization of Autonomous Aircraft Performance'. Together they form a unique fingerprint.

Cite this