Ana gezinime geç Aramaya geç Ana içeriğe geç

Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm

  • Cranfield University

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

3 Atıf (Scopus)

Özet

Uncertainty and partial or unknown information about environment dynamics have led reward-based methods to play a key role in the Single-Agent and Multi-Agent Learning problem. Tree-based planning approaches such as Monte Carlo Tree Search algorithm have been a striking success in single-agent domains where a perfect simulator model is available, e.g., Go and chess strategic board games. This paper presents a decentralized tree-based planning scheme, that combines forward planning with direct reinforcement learning temporal-difference updates applied to the multi-agent setting. Forward planning requires an engine model which is learned from experience and represented via function approximation. Evaluation and validation are carried out in the Hunter-Prey Pursuit cooperative environment and performance is compared with state-of-the-art RL techniques. N-Step Dynamic Tree Search (NSDTS) pretends to adapt the most successful single-agent learning methods to the multi-agent boundaries in a decentralized system structure, dealing with scalability issues and exponential growth of computational resources suffered by centralized systems. NSDTS demonstrates to be a remarkable advance compared to the conventional Q-Learning temporal-difference method.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığıAIAA SciTech Forum 2022
YayınlayanAmerican Institute of Aeronautics and Astronautics Inc, AIAA
ISBN (Basılı)9781624106316
DOI'lar
Yayın durumuYayınlandı - 2022
Harici olarak yayınlandıEvet
EtkinlikAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022 - San Diego, United States
Süre: 3 Oca 20227 Oca 2022

Yayın serisi

AdıAIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022
Ülke/BölgeUnited States
ŞehirSan Diego
Periyot3/01/227/01/22

Bibliyografik not

Publisher Copyright:
© 2022, American Institute of Aeronautics and Astronautics Inc.. All rights reserved.

Finansman

This work is sponsored by the Engineering and Physical Sciences Research Council (EPSRC) and BAE Systems under the project reference no. 2454254.

FinansörlerFinansör numarası
BAE Systems2454254
Engineering and Physical Sciences Research Council

    Parmak izi

    Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

    Alıntı Yap