Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm

Marc Espinós Longa, Antonios Tsourdos, Gokhan Inalhan

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

3 Atıf (Scopus)

Özet

Uncertainty and partial or unknown information about environment dynamics have led reward-based methods to play a key role in the Single-Agent and Multi-Agent Learning problem. Tree-based planning approaches such as Monte Carlo Tree Search algorithm have been a striking success in single-agent domains where a perfect simulator model is available, e.g., Go and chess strategic board games. This paper presents a decentralized tree-based planning scheme, that combines forward planning with direct reinforcement learning temporal-difference updates applied to the multi-agent setting. Forward planning requires an engine model which is learned from experience and represented via function approximation. Evaluation and validation are carried out in the Hunter-Prey Pursuit cooperative environment and performance is compared with state-of-the-art RL techniques. N-Step Dynamic Tree Search (NSDTS) pretends to adapt the most successful single-agent learning methods to the multi-agent boundaries in a decentralized system structure, dealing with scalability issues and exponential growth of computational resources suffered by centralized systems. NSDTS demonstrates to be a remarkable advance compared to the conventional Q-Learning temporal-difference method.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıAIAA SciTech Forum 2022
YayınlayanAmerican Institute of Aeronautics and Astronautics Inc, AIAA
ISBN (Basılı)9781624106316
DOI'lar
Yayın durumuYayınlandı - 2022
Harici olarak yayınlandıEvet
EtkinlikAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022 - San Diego, United States
Süre: 3 Oca 20227 Oca 2022

Yayın serisi

AdıAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022
Ülke/BölgeUnited States
ŞehirSan Diego
Periyot3/01/227/01/22

Bibliyografik not

Publisher Copyright:
© 2022, American Institute of Aeronautics and Astronautics Inc.. All rights reserved.

Parmak izi

Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap