High fidelity progressive reinforcement learning for agile maneuvering uavs

U. Can Bekar, Burak Yuksek, Gokhan Inalhan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Citations (Scopus)

Abstract

In this work, we present a high fidelity model based progressive reinforcement learning method for control system design for an agile maneuvering UAV. Our work relies on a simulation-based training and testing environment for doing software-in-the-loop (SIL), hardware-in-the-loop (HIL) and integrated flight testing within photo-realistic virtual reality (VR) environment. Through progressive learning with the high fidelity agent and environment models, the guidance and control policies build agile maneuvering based on fundamental control laws. First, we provide insight on development of high fidelity mathematical models using frequency domain system identification. These models are later used to design reinforcement learning based adaptive flight control laws allowing the vehicle to be controlled over a wide range of operating conditions covering model changes on operating conditions such as payload, voltage and damage to actuators and electronic speed controllers (ESCs). We later design outer flight guidance and control laws. Our current work and progress is summarized in this work.

Original languageEnglish
Title of host publicationAIAA Scitech 2020 Forum
PublisherAmerican Institute of Aeronautics and Astronautics Inc, AIAA
ISBN (Print)9781624105951
DOIs
Publication statusPublished - 2020
Externally publishedYes
EventAIAA Scitech Forum, 2020 - Orlando, United States
Duration: 6 Jan 202010 Jan 2020

Publication series

NameAIAA Scitech 2020 Forum
Volume1 PartF

Conference

ConferenceAIAA Scitech Forum, 2020
Country/TerritoryUnited States
CityOrlando
Period6/01/2010/01/20

Bibliographical note

Publisher Copyright:
© 2020, American Institute of Aeronautics and Astronautics Inc, AIAA. All rights reserved.

Fingerprint

Dive into the research topics of 'High fidelity progressive reinforcement learning for agile maneuvering uavs'. Together they form a unique fingerprint.

Cite this