GCTD3: Modeling of Bipedal Locomotion by Combination of TD3 Algorithms and Graph Convolutional Network

Bui, Khoi Phan; NGUYEN, GIANG; Ngoc, Dat Nguyen

doi:10.3390/app12062948

Journal article 21 views 4 downloads

GCTD3: Modeling of Bipedal Locomotion by Combination of TD3 Algorithms and Graph Convolutional Network

Khoi Phan Bui

, GIANG NGUYEN, Dat Nguyen Ngoc

Applied Sciences, Volume: 12, Issue: 6, Start page: 2948

Swansea University Author: GIANG NGUYEN

PDF | Version of Record

© 2022 by the authors. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Download (5.04MB)

Check full text

DOI (Published version): 10.3390/app12062948

Abstract

In recent years, there has been a lot of research using reinforcement learning algorithms to train 2-legged robots to move, but there are still many challenges. The authors propose the GCTD3 method, which takes the idea of using Graph Convolutional Networks to represent the kinematic link features o...

Full description

Published in:	Applied Sciences
ISSN:	2076-3417
Published:	MDPI AG 2022
Online Access:	Check full text
URI:	https://cronfa.swan.ac.uk/Record/cronfa71636

Abstract:	In recent years, there has been a lot of research using reinforcement learning algorithms to train 2-legged robots to move, but there are still many challenges. The authors propose the GCTD3 method, which takes the idea of using Graph Convolutional Networks to represent the kinematic link features of the robot, and combines this with the Twin-Delayed Deep Deterministic Policy Gradient algorithm to train the robot to move. Graph Convolutional Networks are very effective in graph-structured problems such as the connection of the joints of the human-like robots. The GCTD3 method shows better results on the motion trajectories of the bipedal robot joints compared with other reinforcement learning algorithms such as Twin-Delayed Deep Deterministic Policy Gradient, Deep Deterministic Policy Gradient and Soft Actor Critic. This research is implemented on a 2-legged robot model with six independent joint coordinates through the Robot Operating System and Gazebo simulator.
Keywords:	GCTD3; GCN; TD3; ROS; reward function; bipedal robot
College:	Faculty of Science and Engineering
Funders:	N/A
Issue:	6
Start Page:	2948

GCTD3: Modeling of Bipedal Locomotion by Combination of TD3 Algorithms and Graph Convolutional Network

Similar Items