Ana gezinime geç Aramaya geç Ana içeriğe geç

Bayesian learning for policy search in trajectory control of a planar manipulator

  • Vahid Tavakol Aghaei
  • , Arda Agababaoglu
  • , Ahmet Onat
  • , Sinan Yildirim

Araştırma sonucu: Kitap/Rapor/Konferans Bildirisinde BölümKonferans katkısıbilirkişi

1 Atıf (Scopus)

Özet

Application of learning algorithms to robotics and control problems with highly nonlinear dynamics to obtain a plausible control policy in a continuous state space is expected to greatly facilitate the design process. Recently, policy search methods such as policy gradient in Reinforcement Learning (RL) have succeeded in coping with such complex systems. Nevertheless, they are slow in convergence speed and are prone to get stuck in local optima. To alleviate this, a Bayesian inference method based on Markov Chain Monte Carlo (MCMC), utilizing a multiplicative reward function, is proposed. This study aims to compare eNAC, a popular gradient based RL method, with the proposed Bayesian learning method, where the objective is trajectory control of a complex model of a 2-DOF planar manipulator. The results obtained for the convergence speed of the proposed algorithm and time response performance, illustrate that the proposed MCMC algorithm is qualified for complex problems in robotics.

Orijinal dilİngilizce
Ana bilgisayar yayını başlığı2019 IEEE 9th Annual Computing and Communication Workshop and Conference, CCWC 2019
EditörlerSatyajit Chakrabarti, Himadri Nath Saha
YayınlayanInstitute of Electrical and Electronics Engineers Inc.
Sayfalar240-246
Sayfa sayısı7
ISBN (Elektronik)9781728105543
DOI'lar
Yayın durumuYayınlandı - 12 Mar 2019
Harici olarak yayınlandıEvet
Etkinlik9th IEEE Annual Computing and Communication Workshop and Conference, CCWC 2019 - Las Vegas, United States
Süre: 7 Oca 20199 Oca 2019

Yayın serisi

Adı2019 IEEE 9th Annual Computing and Communication Workshop and Conference, CCWC 2019

???event.eventtypes.event.conference???

???event.eventtypes.event.conference???9th IEEE Annual Computing and Communication Workshop and Conference, CCWC 2019
Ülke/BölgeUnited States
ŞehirLas Vegas
Periyot7/01/199/01/19

Bibliyografik not

Publisher Copyright:
© 2019 IEEE.

Parmak izi

Bayesian learning for policy search in trajectory control of a planar manipulator' araştırma başlıklarına git. Birlikte benzersiz bir parmak izi oluştururlar.

Alıntı Yap