No Cover Image

Journal article 21 views 4 downloads

GCTD3: Modeling of Bipedal Locomotion by Combination of TD3 Algorithms and Graph Convolutional Network

Khoi Phan Bui Orcid Logo, GIANG NGUYEN, Dat Nguyen Ngoc Orcid Logo

Applied Sciences, Volume: 12, Issue: 6, Start page: 2948

Swansea University Author: GIANG NGUYEN

  • 71636.VoR.pdf

    PDF | Version of Record

    © 2022 by the authors. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.

    Download (5.04MB)

Check full text

DOI (Published version): 10.3390/app12062948

Abstract

In recent years, there has been a lot of research using reinforcement learning algorithms to train 2-legged robots to move, but there are still many challenges. The authors propose the GCTD3 method, which takes the idea of using Graph Convolutional Networks to represent the kinematic link features o...

Full description

Published in: Applied Sciences
ISSN: 2076-3417
Published: MDPI AG 2022
Online Access: Check full text

URI: https://cronfa.swan.ac.uk/Record/cronfa71636
Abstract: In recent years, there has been a lot of research using reinforcement learning algorithms to train 2-legged robots to move, but there are still many challenges. The authors propose the GCTD3 method, which takes the idea of using Graph Convolutional Networks to represent the kinematic link features of the robot, and combines this with the Twin-Delayed Deep Deterministic Policy Gradient algorithm to train the robot to move. Graph Convolutional Networks are very effective in graph-structured problems such as the connection of the joints of the human-like robots. The GCTD3 method shows better results on the motion trajectories of the bipedal robot joints compared with other reinforcement learning algorithms such as Twin-Delayed Deep Deterministic Policy Gradient, Deep Deterministic Policy Gradient and Soft Actor Critic. This research is implemented on a 2-legged robot model with six independent joint coordinates through the Robot Operating System and Gazebo simulator.
Keywords: GCTD3; GCN; TD3; ROS; reward function; bipedal robot
College: Faculty of Science and Engineering
Funders: N/A
Issue: 6
Start Page: 2948