Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm

Marc Espinós Longa, Antonios Tsourdos, Gokhan Inalhan

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

3 Citations (Scopus)

Abstract

Uncertainty and partial or unknown information about environment dynamics have led reward-based methods to play a key role in the Single-Agent and Multi-Agent Learning problem. Tree-based planning approaches such as Monte Carlo Tree Search algorithm have been a striking success in single-agent domains where a perfect simulator model is available, e.g., Go and chess strategic board games. This paper presents a decentralized tree-based planning scheme, that combines forward planning with direct reinforcement learning temporal-difference updates applied to the multi-agent setting. Forward planning requires an engine model which is learned from experience and represented via function approximation. Evaluation and validation are carried out in the Hunter-Prey Pursuit cooperative environment and performance is compared with state-of-the-art RL techniques. N-Step Dynamic Tree Search (NSDTS) pretends to adapt the most successful single-agent learning methods to the multi-agent boundaries in a decentralized system structure, dealing with scalability issues and exponential growth of computational resources suffered by centralized systems. NSDTS demonstrates to be a remarkable advance compared to the conventional Q-Learning temporal-difference method.

Original languageEnglish
Title of host publicationAIAA SciTech Forum 2022
PublisherAmerican Institute of Aeronautics and Astronautics Inc, AIAA
ISBN (Print)9781624106316
DOIs
Publication statusPublished - 2022
Externally publishedYes
EventAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022 - San Diego, United States
Duration: 3 Jan 20227 Jan 2022

Publication series

NameAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022

Conference

ConferenceAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022
Country/TerritoryUnited States
CitySan Diego
Period3/01/227/01/22

Bibliographical note

Publisher Copyright:
© 2022, American Institute of Aeronautics and Astronautics Inc.. All rights reserved.

Funding

This work is sponsored by the Engineering and Physical Sciences Research Council (EPSRC) and BAE Systems under the project reference no. 2454254.

FundersFunder number
BAE Systems2454254
Engineering and Physical Sciences Research Council

    Fingerprint

    Dive into the research topics of 'Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm'. Together they form a unique fingerprint.

    Cite this