Özet
Uncertainty and partial or unknown information about environment dynamics have led reward-based methods to play a key role in the Single-Agent and Multi-Agent Learning problem. Tree-based planning approaches such as Monte Carlo Tree Search algorithm have been a striking success in single-agent domains where a perfect simulator model is available, e.g., Go and chess strategic board games. This paper presents a decentralized tree-based planning scheme, that combines forward planning with direct reinforcement learning temporal-difference updates applied to the multi-agent setting. Forward planning requires an engine model which is learned from experience and represented via function approximation. Evaluation and validation are carried out in the Hunter-Prey Pursuit cooperative environment and performance is compared with state-of-the-art RL techniques. N-Step Dynamic Tree Search (NSDTS) pretends to adapt the most successful single-agent learning methods to the multi-agent boundaries in a decentralized system structure, dealing with scalability issues and exponential growth of computational resources suffered by centralized systems. NSDTS demonstrates to be a remarkable advance compared to the conventional Q-Learning temporal-difference method.
| Orijinal dil | İngilizce |
|---|---|
| Ana bilgisayar yayını başlığı | AIAA SciTech Forum 2022 |
| Yayınlayan | American Institute of Aeronautics and Astronautics Inc, AIAA |
| ISBN (Basılı) | 9781624106316 |
| DOI'lar | |
| Yayın durumu | Yayınlandı - 2022 |
| Harici olarak yayınlandı | Evet |
| Etkinlik | AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022 - San Diego, United States Süre: 3 Oca 2022 → 7 Oca 2022 |
Yayın serisi
| Adı | AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022 |
|---|
???event.eventtypes.event.conference???
| ???event.eventtypes.event.conference??? | AIAA Science and Technology Forum and Exposition, AIAA SciTech Forum 2022 |
|---|---|
| Ülke/Bölge | United States |
| Şehir | San Diego |
| Periyot | 3/01/22 → 7/01/22 |
Bibliyografik not
Publisher Copyright:© 2022, American Institute of Aeronautics and Astronautics Inc.. All rights reserved.
Finansman
This work is sponsored by the Engineering and Physical Sciences Research Council (EPSRC) and BAE Systems under the project reference no. 2454254.
| Finansörler | Finansör numarası |
|---|---|
| BAE Systems | 2454254 |
| Engineering and Physical Sciences Research Council |
Parmak izi
Swarm Intelligence in Cooperative Environments: Introducing the N-Step Dynamic Tree Search Algorithm' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.Alıntı Yap
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver