Çift Kollu Robotlarda Derin Deterministik Politika Gradyanı ile Hata Önleme

Translated title of the contribution: Failure Prevention in Bimanual Robots using Deep Deterministic Policy Gradient

Asel Menekşe*, Abdullah Cihan Ak, Sanem Sarıel

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

One of the key challenges in learning robot object interaction behavior is to obtain an optimal policy that accomplishes the goal safely. In reinforcement learning, representing both the goal and safety constraints with a reward function may result in learning a policy that either omits safety or is too conservative; because states that yield high rewards for achieving the goal are often closer to unsafe zones. To overcome this problem, a dual-arm cooperation method is proposed, which decomposes the overall task into two subtasks assigned to each robot arm. This approach ensures that the goal is achieved while considering safety constraints and preventing potential unsafe situations. In this method, while one arm learns to achieve the goal successfully, the other arm learns to interact with the environment to ensure safety by defining an environmental constraint. The proposed method was tested in a simulation environment on 3 different scenarios adapted for a humanoid robot to perform the task of stirring a bowl, and it was seen that the second arm-assisted stirring scenario showed the best performance.

Translated title of the contributionFailure Prevention in Bimanual Robots using Deep Deterministic Policy Gradient
Original languageTurkish
Title of host publication32nd IEEE Conference on Signal Processing and Communications Applications, SIU 2024 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798350388961
DOIs
Publication statusPublished - 2024
Event32nd IEEE Conference on Signal Processing and Communications Applications, SIU 2024 - Mersin, Turkey
Duration: 15 May 202418 May 2024

Publication series

Name32nd IEEE Conference on Signal Processing and Communications Applications, SIU 2024 - Proceedings

Conference

Conference32nd IEEE Conference on Signal Processing and Communications Applications, SIU 2024
Country/TerritoryTurkey
CityMersin
Period15/05/2418/05/24

Bibliographical note

Publisher Copyright:
© 2024 IEEE.

Fingerprint

Dive into the research topics of 'Failure Prevention in Bimanual Robots using Deep Deterministic Policy Gradient'. Together they form a unique fingerprint.

Cite this