Ana gezinime geç Aramaya geç Ana içeriğe geç

Swarm Intelligence in Cooperative Environments: n-Step Dynamic Tree Search Algorithm Overview

Araştırma sonucu: Dergiye katkıMakalebilirkişi

2 Atıf (Scopus)

Özet

Reinforcement learning tree-based planningmethods have been gaining popularity in the last few years due to their success in single-agent domains,where a perfect simulatormodel is available: for example,Go and chess strategic board games. This paper pretends to extend tree search algorithms to the multiagent setting in a decentralized structure, dealing with scalability issues and exponential growth of computational resources. The n-step dynamic tree search combines forward planning and direct temporal-difference updates, outperforming markedly conventional tabular algorithms such asQ learning and state-action-reward-state-action (SARSA). Future state transitions and rewards are predicted with a model built and learned from real interactions between agents and the environment. This paper analyzes the developed algorithmin the hunter–pursuit cooperative game against stochastic and intelligent evaders.The n-step dynamic tree search aims to adapt single-agent tree search learningmethods to themultiagent boundaries and is demonstrated to be a remarkable advance as compared to conventional temporal-difference techniques.

Orijinal dilİngilizce
Sayfa (başlangıç-bitiş)418-425
Sayfa sayısı8
DergiJournal of Aerospace Information Systems
Hacim20
Basın numarası7
DOI'lar
Yayın durumuYayınlandı - Tem 2023
Harici olarak yayınlandıEvet

Bibliyografik not

Publisher Copyright:
© 2023 by the American Institute of Aeronautics and Astronautics, Inc. All rights reserved.

Finansman

This research is sponsored by the Engineering and Physical Sciences Research Council and BAE Systems under project reference number 2454254.

FinansörlerFinansör numarası
BAE Systems2454254
Engineering and Physical Sciences Research Council

    Parmak izi

    Swarm Intelligence in Cooperative Environments: n-Step Dynamic Tree Search Algorithm Overview' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

    Alıntı Yap