Bayesian learning for policy search in trajectory control of a planar manipulator

Vahid Tavakol Aghaei, Arda Agababaoglu, Ahmet Onat, Sinan Yildirim

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Citation (Scopus)

Abstract

Application of learning algorithms to robotics and control problems with highly nonlinear dynamics to obtain a plausible control policy in a continuous state space is expected to greatly facilitate the design process. Recently, policy search methods such as policy gradient in Reinforcement Learning (RL) have succeeded in coping with such complex systems. Nevertheless, they are slow in convergence speed and are prone to get stuck in local optima. To alleviate this, a Bayesian inference method based on Markov Chain Monte Carlo (MCMC), utilizing a multiplicative reward function, is proposed. This study aims to compare eNAC, a popular gradient based RL method, with the proposed Bayesian learning method, where the objective is trajectory control of a complex model of a 2-DOF planar manipulator. The results obtained for the convergence speed of the proposed algorithm and time response performance, illustrate that the proposed MCMC algorithm is qualified for complex problems in robotics.

Original languageEnglish
Title of host publication2019 IEEE 9th Annual Computing and Communication Workshop and Conference, CCWC 2019
EditorsSatyajit Chakrabarti, Himadri Nath Saha
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages240-246
Number of pages7
ISBN (Electronic)9781728105543
DOIs
Publication statusPublished - 12 Mar 2019
Externally publishedYes
Event9th IEEE Annual Computing and Communication Workshop and Conference, CCWC 2019 - Las Vegas, United States
Duration: 7 Jan 20199 Jan 2019

Publication series

Name2019 IEEE 9th Annual Computing and Communication Workshop and Conference, CCWC 2019

Conference

Conference9th IEEE Annual Computing and Communication Workshop and Conference, CCWC 2019
Country/TerritoryUnited States
CityLas Vegas
Period7/01/199/01/19

Bibliographical note

Publisher Copyright:
© 2019 IEEE.

Keywords

  • Bayesian Learning
  • Control Systems
  • Markov Chain Monte Carlo
  • Reinforcement Learning
  • Robotics

Fingerprint

Dive into the research topics of 'Bayesian learning for policy search in trajectory control of a planar manipulator'. Together they form a unique fingerprint.

Cite this